{
  "biology": {
    "train": {
      "total_tokens": 564610758,
      "example": "# \\[2401.03968\\] scDiffusion: conditional generation of high-quality single-cell data using diffusion model\n\n<sup>†</sup>\n\n<sup>†</sup>footnotetext: <sup>\\#</sup> These authors contributed equally to this work.\n\n<sup>†</sup>\n\n<sup>†</sup>footnotetext: <sup>∗</sup> Corresponding Author. Email: zhangxg@tsinghua.edu.cn\n\n# scDiffusion: conditional generation of high-quality single-cell data using diffusion model\n\nErpai Luo$`^{1,^{\\#}}`$, Minsheng Hao$`^{1,^{\\#}}`$, Lei Wei<sup>1</sup>, Xuegong Zhang$`^{1,2,^{\\ast}}`$\n\n<sup>1</sup>MOE Key Lab of Bioinformatics and Bioinformatics Division of BNRIST,\nDepartment of Automation, Tsinghua University, Beijing 100084, China\n<sup>2</sup>School of Life Sciences and School of Medicine, Tsinghua University, Beijing 100084, China\n\n###### Abstract\n\nSingle-cell RNA sequencing (scRNA-seq) data are important for studying the biology of development or diseases at single-cell level. To better understand the properties of the data, to build controlled benchmark data for testing downstream methods, and to augment data when collecting sufficient real data is challenging, generative models have been proposed to computationally generate synthetic scRNA-seq data. However, the data generated with current models are not very realistic yet, especially when we need to generate data with controlled conditions. In the meantime, the Diffusion models have shown their power in generating data in computer vision at high fidelity, providing a new opportunity for scRNA-seq generation.\n\nIn this study, we developed scDiffusion, a diffusion-based model to generate high-quality scRNA-seq data with controlled conditions. We designed multiple classifiers to guide the diffusion process simultaneously, enabling scDiffusion to generate data under multiple condition combinations. We also proposed a new control strategy called Gradient Interpolation. This strategy allows the model to generate continuous trajectories of cell development from a given cell state.\n\nExperiments showed that scDiffusion can generate single-cell gene expression data closely resembling real scRNA-seq data, surpassing state-of-the-art models in multiple metrics. Also, scDiffusion can conditionally produce data on specific cell types including rare cell types. Furthermore, we could use the multiple-condition generation of scDiffusion to generate cell type that was out of the training data. Leveraging the Gradient Interpolation strategy, we generated a continuous developmental trajectory of mouse embryonic cells. These experiments demonstrate that scDiffusion is a powerful tool for augmenting the real scRNA-seq data and can provide insights into cell fate research.\n\n## 1 Introduction\n\nSingle-cell RNA sequencing (scRNA-seq) data offer comprehensive depictions of the gene expression profile of every single cell, which can help gain a more systematic and precise understanding of the development and function of living organisms \\[[1](#bib.bib1), [2](#bib.bib2)\\]. Although current sequencing technologies have come a long way, the cost and difficulty of sequencing remain high. Besides, the biological samples are sometimes hard to be obtained due to ethical reasons \\[[3](#bib.bib3), [4](#bib.bib4), [5](#bib.bib5)\\], and certain cell types within a sample may be too rare to be analyzed. It is still challenging to obtain enough high-quality scRNA-seq data of interest, which may impede biological discovery as most tools for scRNA-seq analysis require a certain amount of high-quality data.\n\nSome researchers have tried to generate in silico gene expression data in response to the less-than-desirable availability of scRNA-seq data. Unlike single-cell sequencing technologies that need real biological samples, the in silico data generation methods aim to generate pseudo data according to the expression pattern of known data. There are two main types of in silico data generation methods: statistical modelling and deep learning. Statistical modeling methods are guided by well-studied statistical distributions of gene expression profiles such as Zero-inflated Negative Binomial (ZINB) \\[[6](#bib.bib6)\\], and new data are generated by manually setting certain parameters of the distributions \\[[7](#bib.bib7), [8](#bib.bib8), [9](#bib.bib9), [10](#bib.bib10)\\]. However, due to the over simplification of statistical models, these methods can hardly mimic the real gene expression data exactly, but are mainly used as toy data for guiding the development of scRNA-seq analysis algorithms.\n\nThe recent prosperity of deep generative models brings new chances for the in silico transcriptomic data generation \\[[11](#bib.bib11)\\]. Deep generative models can learn the biological patterns of single-cell gene expression without any explicit modeling, which makes it possible to generate realistic scRNA-seq data. The variational autoencoder (VAE) is one of the most prominent deep generative models in the field \\[[12](#bib.bib12)\\]. A classic example of a VAE-based data generation model is the single-cell variational inference (scVI), which guide a VAE to infer underlying data distributions by generating data similar to original data \\[[13](#bib.bib13)\\]. However, these VAE-based models are designed to approximate the data distributions for downstream analysis tasks such as batch correction and clustering, rather than generating pseudo gene expression profiles of cells. Recently, the generative adversary network (GAN) \\[[14](#bib.bib14)\\] was utilized to develop tools specializing in generating new cells. For example, scGAN uses a deep learning model to learn the non-linear gene-gene dependencies from different cell samples, and then generates realistic scRNA-seq data according to the information it learns, achieving very impressive results in many evaluation metrics \\[[15](#bib.bib15)\\]. There are several variations of the scGAN model, such as LSH-GAN\\[[16](#bib.bib16)\\] and scIGAN\\[[17](#bib.bib17)\\], both of which utilize the generative capabilities of GAN to accomplish downstream tasks in single-cell analysis. Though, the scGAN model is specifically designed to generate data from a known distribution, and cannot be used to supplement unmeasured data. Besides, GAN requires careful design and tuning of model architectures and optimization methods to achieve stable training results \\[[18](#bib.bib18)\\], which may cause trouble when generalizing the GAN-based methods to other datasets.\n\nThe diffusion model \\[[19](#bib.bib19)\\] is the most trending generative model at the moment and has demonstrated excellent performance in a number of areas such as generating images and audios \\[[20](#bib.bib20), [21](#bib.bib21)\\]. It has many desirable properties like distribution coverage, a stationary training objective, and easy scalability, and has outperformed previous works in terms of generating significantly higher-quality data \\[[22](#bib.bib22), [23](#bib.bib23), [24](#bib.bib24)\\]. While the diffusion model has yielded great results in many fields, it has few applications in the single-cell area. The primary challenge lies in the fact that the distribution of scRNA-seq data significantly deviates from the Gaussian distribution, unlike image data which inherently exhibit a Gaussian-like distribution. This makes it difficult for diffusion models to generate new gene expression data, as the entire diffusion process is based on Gaussian noise.\n\nIn this paper, we propose scDiffusion, a novel in silico scRNA-seq data generation model based on the structure of the denoising diffusion probability model, to generate single-cell gene expression data with given conditions. scDiffusion mainly consists of three parts, an autoencoder, a diffusion backbone, and a condition controller. The autoencoder helps rectify the raw distribution and reduce the dimensionality of scRNA-seq data, which can make the data amenable to diffusion modeling. The backbone model was redesigned based on a multilayer perceptron (MLP) to make the model adaptable to the unordered nature of gene expression data. The conditional controller is a cell type classifier, enabling scDiffusion to generate data specific to a particular cell or organ type according to diverse requirements. scDiffusion was shown to be able to generate realistic scRNA-seq data which surpassed the data generated by scGAN in various evaluation metrics. We further demonstrated the ability of scDiffusion on conditional generation, multi-conditional generation capacity, and out-of-distribution data generation. We also proposed a novel condition control strategy, Gradient Interpolation, to interpolate continuous cell trajectories from discrete cell states. With the powerful generation ability, scDiffusion has the potential to augment existing scRNA-seq data and could potentially contribute to the investigation of undersampled or even unseen cell states.\n\n## 2 Methods\n\nThe scDiffusion model consists of three parts, an autoencoder, a diffusion backbone network, and a conditional classifier, as depicted in Fig. [1](#S2.F1 \"Figure 1 ‣ 2 Methods ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\"). The autoencoder is designed to transform the raw gene expression profiles into latent space embeddings to effectively reduce dimensions and ensure a more suitable distribution for the subsequent diffusion process. The diffusion backbone network is designed to learn the reversed diffusion process. Meanwhile, the conditional classifier provides guidance for the reverse diffusion, allowing the generation of cells under specific conditions such as specific cell types or organs.\n\nAt the training stage, the autoencoder is first trained to embed the gene expression profile of every single cell by using all real gene expression data. After, the diffusion process is applied to each embedding derived by the autoencoder and produces a series of noisy embeddings. These noisy embeddings serve as the training data for the backbone network. Meanwhile, the conditional classifier processes the embeddings to predict associated labels, such as cell types. At the inference stage, the diffusion backbone denoises the input noise embeddings and generates new embeddings. The generation can be guided by the classifier or the Gradient Interpolation strategy. The generated embeddings are finally fed into the decoder to obtain full gene expression. Detailed descriptions of scDiffusion are provided below.\n\n[Refer to caption](/html/2401.03968/assets/figs/model_archi.png)\n\nFigure 1: The overall structure of scDiffusion.\n\n### 2.1 Training the gene expression autoencoder\n\nThe dimensions of gene expression data are extremely high, and the data distribution differs from the Gaussian distribution used in the diffusion process. To solve this problem, we used an autoencoder consisting of two MLPs to encode the gene expression data of every single cell $`S_{o\\hspace{0pt}r\\hspace{0pt}i}`$ into a latent space embedding $`x_{0}`$. The input of the encoder is a gene expression profile that is normalized by 1e4 total counts, and the output is a 1,000-dimension latent space embedding. The decoder subsequently accepts the latent space embedding and generates the corresponding expression profile $`S_{n\\hspace{0pt}e\\hspace{0pt}w}`$ as the output. As shown in Fig. S1, the distribution of gene expression is transformed into a Gaussian-like distribution by the autoencoder, which is in line with the Gaussian distribution used in the diffusion process and makes it much easier for the backbone model to learn the reverse process.\n\n### 2.2 Training the diffusion backbone network\n\nAfter getting embeddings from the encoder, the diffusion process is applied to each embedding. The diffusion backbone network is trained to learn the reversed process. Classical diffusion backbone models such as convolutional neural networks are not applicable to gene expression, as a gene expression profile of scRNA-seq data is a long, sparse, and unordered vector. Thus, we developed a new architecture as the backbone, with fully connected layers and a residual structure (Fig. [1](#S2.F1 \"Figure 1 ‣ 2 Methods ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")). The residual structure can help to maintain the characteristics of features at different levels and reduce the loss of information.\n\nIn the diffusion process, the original cell embedding $`x_{0}`$ becomes a noised embedding $`x_{T}`$ by iteratively adding noise through $`T`$ steps. For the $`i`$-th step, the embedding $`x_{i}`$ is sampled from the following distribution:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>q(xi|xi−1)=𝒩(xi|1−βixi−1,βiI),whereβi∈(0,1)q\\left(x_{i}\\middle|x_{i-1}\\right)=\\mathcal{N}\\left(x_{i}\\middle|\\sqrt{1-\\beta_{i}}x_{i-1},\\beta_{i}I\\right),~{}~{}\\mathrm{where}\\quad\\beta_{i}\\in(0,1)</td>\n<td></td>\n<td>(1)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`I`$ stands for standard Gaussian noise. $`\\beta_{i}`$ is a coefficient that varies with time step, and $`\\beta_{m\\hspace{0pt}i\\hspace{0pt}n}`$ and $`\\beta_{m\\hspace{0pt}a\\hspace{0pt}x}`$ are two parameters that control the scale of $`\\beta_{i}`$ in the diffusion process:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>βi=βm​i​nT+i−1T−1​(βm​a​xT−βm​i​nT)subscript𝛽𝑖subscript𝛽𝑚𝑖𝑛𝑇𝑖1𝑇1subscript𝛽𝑚𝑎𝑥𝑇subscript𝛽𝑚𝑖𝑛𝑇\\beta_{i}=\\frac{\\beta_{min}}{T}+\\frac{i-1}{T-1}\\left({\\frac{\\beta_{max}}{T}-\\frac{\\beta_{min}}{T}}\\right)</td>\n<td></td>\n<td>(2)</td>\n</tr>\n</tbody>\n</table>\n\nThe training goal is to learn the reverse diffusion process $`p\\hspace{0pt}{(\\left. x_{i - 1} \\middle| x_{i} \\right.)}`$. In each iteration, $`x_{i - 1}`$ at step $`i - 1`$ is predicted, given an embedding $`x_{i}`$ at step $`i`$. Such process also follows the Gaussian distribution. According to previous works \\[[19](#bib.bib19), [22](#bib.bib22)\\], the mean and variance are parameterized as:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>pθ​(xi−1|xi)=𝒩​(xi−1|μθ​(xi,i),exp​(w​βi)​I)subscript𝑝𝜃conditionalsubscript𝑥𝑖1subscript𝑥𝑖𝒩conditionalsubscript𝑥𝑖1subscript𝜇𝜃subscript𝑥𝑖𝑖exp𝑤subscript𝛽𝑖𝐼p_{\\theta}(x_{i-1}|x_{i})=\\mathcal{N}(x_{i-1}|\\mu_{\\theta}(x_{i},i),\\mathrm{exp}({w\\beta}_{i}){I})</td>\n<td></td>\n<td>(3)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`w`$ in the variance is an adjustable weight that controls the randomness of the reverse process. The mean $`\\mu_{\\theta}\\hspace{0pt}{(x_{i},i)}`$ can be written as:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>μθ​(xi,i)=1αt​(𝐱t−βt1−α¯t​ϵθ​(𝐱t,t))subscript𝜇𝜃subscript𝑥𝑖𝑖1subscript𝛼𝑡subscript𝐱𝑡subscript𝛽𝑡1subscript¯𝛼𝑡subscriptbold-italic-ϵ𝜃subscript𝐱𝑡𝑡\\mu_{\\theta}(x_{i},i)=\\frac{1}{\\sqrt{\\alpha_{t}}}\\left(\\mathbf{x}_{t}-\\frac{\\beta_{t}}{\\sqrt{1-\\bar{\\alpha}_{t}}}\\boldsymbol{\\epsilon}_{\\theta}\\left(\\mathbf{x}_{t},t\\right)\\right)</td>\n<td></td>\n<td>(4)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`\\alpha_{t} = {1 - \\beta_{t}}`$ and $`{\\overline{\\alpha}}_{t} = {\\prod_{s = 1}^{t}\\alpha_{s}}`$. $`\\epsilon_{\\theta}\\hspace{0pt}\\left( x_{i},i \\right)`$ is the added noise predicted by the backbone network. In other words, the backbone network takes the cell’s latent space embedding $`x_{i}`$ and the timestamp $`i`$ as inputs to predict the noise.\n\nIn the inference process, the diffusion model takes the Gaussian noise as the initial input and denoises it iteratively through T steps. Eventually, we can get the new cellular latent space embedding $`x_{0}`$ and put it into the decoder to get the final gene expression data.\n\n### 2.3 Conditional generation and the Gradient Interpolation strategy\n\nWe use the classifier guidance method to perform conditional generation. This method does not interfere with the training of the diffusion backbone model. Instead, the classifier is first trained separately by using condition labels like cell types and then provides a gradient to guide cell generation. Here, we designed the cell classifier as a four-layer MLP. After generating a series of embeddings from cells with labels, the classifier takes both timestamp $`i`$ and cell embedding $`x_{i}`$ as inputs and predicts the cell labels $`y`$ paired with $`x_{0}`$. The cross entropy loss is used for training. It is worth noting that only the embeddings between step $`0`$ and step $`T/2`$ of the diffusion process are used for training the classifier, considering that the signal in the rest part is too noisy to be predicted.\n\nAs for inference, given each step $`i`$ between the last part of the reverse process (between timestamp 0 and $`T/2`$), the classifier receives the intermediate state $`x_{i}`$ and outputs the predicted probability for every cell type. By computing the cross entropy loss between the predicted and desired condition given by the user, the gradient derived from the classifier can guide the diffusion backbone model to generate a designated endpoint. The new embedding with the guidance is now sampled from:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>pθ(xi−1|xi,y)=𝒩(xi−1|μθ(xi,I)+βiγ∇xilogpϕ(y∣xi),exp(wβi)I)p_{\\theta}(x_{i-1}|x_{i},y)=\\mathcal{N}(x_{i-1}|\\mu_{\\theta}(x_{i},I)+\\beta_{i}\\gamma\\nabla_{x_{i}}{\\log p_{\\phi}}\\left({y\\mid x_{i}}\\right),\\mathrm{exp}({w\\beta}_{i}){I})</td>\n<td></td>\n<td>(5)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`p_{\\phi}\\hspace{0pt}\\left( {y \\mid x_{i}} \\right)`$ stands for the classifier’s result, and $`\\gamma`$ is a weight that controls the effectiveness of the classifier to the reverse process. $`\\phi`$ indicates the trainable parameters in the classifier. This guidance will affect every step of the reverse process and finally help the model’s output reach a certain condition.\n\nSince the classifier is trained aside from the diffusion model and is only used in the inference stage, we can train multiple classifiers {$`\\phi_{1},\\phi_{2},\\ldots`$} to control different conditions separately. The gradient that guides the diffusion process is the summation of all the classifiers’ gradients with different weights {$`\\gamma_{1},\\gamma_{2},\\ldots`$}.\n\nWe proposed a Gradient Interpolation strategy to generate continuous cell condition guidance. A classifier receives two different conditions such as the initial and end state of cell differentiation, and generates two gradients at the same time. These gradients are then integrated to guide the diffusion to an unseen intermediate state. Specifically speaking, the $`\\beta_{i}\\hspace{0pt}\\gamma\\hspace{0pt}{{\\nabla_{x_{i}}\\log}p_{\\phi}}\\hspace{0pt}\\left( {y \\mid x_{i}} \\right)`$ in Eq. [5](#S2.E5 \"In 2.3 Conditional generation and the Gradient Interpolation strategy ‣ 2 Methods ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\") is replaced by:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>βi​γ​∇xilog⁡pϕ​(y∣xi)→βi​(γ1​∇xilog⁡pϕ​(y1∣xi)+γ2​∇xilog⁡pϕ​(y2∣xi))→subscript𝛽𝑖𝛾subscript∇subscript𝑥𝑖subscript𝑝italic-ϕconditional𝑦subscript𝑥𝑖subscript𝛽𝑖subscript𝛾1subscript∇subscript𝑥𝑖subscript𝑝italic-ϕconditionalsubscript𝑦1subscript𝑥𝑖subscript𝛾2subscript∇subscript𝑥𝑖subscript𝑝italic-ϕconditionalsubscript𝑦2subscript𝑥𝑖\\beta_{i}\\gamma\\nabla_{x_{i}}{\\log p_{\\phi}}\\left({y\\mid x_{i}}\\right)\\rightarrow\\beta_{i}(\\gamma_{1}\\nabla_{x_{i}}{\\log p_{\\phi}}\\left({y_{1}\\mid x_{i}}\\right)+\\gamma_{2}\\nabla_{x_{i}}{\\log p_{\\phi}}\\left({y_{2}\\mid x_{i}}\\right))</td>\n<td></td>\n<td>(6)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`\\gamma_{1}`$ and $`\\gamma_{2}`$ represent two adjustable coefficients that control the distance between the generated cells and the two target cell states. By tuning these coefficients, scDiffusion can decide which cell state the generated cell is closer to, thus generating cells with continuous states. With this strategy, the initial state of the diffusion generation process is changed from pure Gaussian noise to the latent space embedding of cells of the initial condition, following a noise addition process:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>xi​n​i​t=αt​x0+1−αt​ϵsubscript𝑥𝑖𝑛𝑖𝑡subscript𝛼𝑡subscript𝑥01subscript𝛼𝑡italic-ϵx_{init}=\\sqrt{\\alpha_{t}}x_{0}+\\sqrt{1-\\alpha_{t}}\\epsilon</td>\n<td></td>\n<td>(7)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`x_{i\\hspace{0pt}n\\hspace{0pt}i\\hspace{0pt}t}`$ is the initial state, and $`t`$ is a parameter that is smaller than the total diffusion step. $`\\alpha_{t}`$ is the same thing as in Eq. [4](#S2.E4 \"In 2.2 Training the diffusion backbone network ‣ 2 Methods ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\"). This modification preserves the general characteristics of the initial cells, allowing the model to generate a series of new cell states for each given initial state. These generated cells can constitute a continuous trajectory of cell states.\n\n### 2.4 Evaluation metrics\n\nTo compare similarity between generated and real cells, we evaluated the generated data with various metrics. The statistical indicators consist of Spearman Correlation Coefficient (SCC), Pearson Correlation Coefficient (PCC), Maximum Mean Discrepancy (MMD) \\[[25](#bib.bib25)\\], local inverse Simpson’s index (LISI) \\[[26](#bib.bib26)\\]. and quantile-quantile plot (QQ-plot). We normalized the gene expression data of generated and real cells and calculated SCC and PCC between them. The LISI score was calculated on the data-integrated KNN graph by using the Python package scib \\[[27](#bib.bib27)\\]. We used the top 50 principle components (PCs) of real and generated cells to calculate MMD. The QQ-plot was drawn using both real and generated expression data of a specific gene.\n\nThe non-satistical metrics include Uniform Manifold Approximation and Projection (UMAP) visualization \\[[28](#bib.bib28)\\], marker gene expression, CellTypist classification \\[[29](#bib.bib29)\\], and random forest evaluation. The UMAP plot was used to visualize the generated and real expression data on a two-dimensional plane to provide a subjective judgment for the generated data. Similar to scGAN, we projected the generated cells on the first 50 PCs that were computed from the real cells, and then drew UMAP based on these features. The cell type in unconditionally generated data is classified by CellTypist, a classifier trained with real cell data by using the cell type as the label \\[[29](#bib.bib29)\\]. CellTypist is also used to judge whether the conditionally generated data can be classified into the right type. The random forest evaluation shares the same idea with scGAN, which uses a random forest model with 1000 trees and 5 maximum depths to distinguish cells from real and generated, and the more similar these two cells are, the closer the area under the receiver operating characteristic (ROC) curve (AUC) metric of random forests is close to 0.5.\n\n## 3 Results\n\nWe conducted four experiments to demonstrate the capability of scDiffusion. First, we investigated the unconditional data generation ability of scDiffusion and compared it with scGAN. We then assessed scDiffusion on a conditional generation task to generate specific cell types. Furthermore, we applied scDiffusion in a multi-conditional generation case with both cell types and organs as conditions and used it to generate new cells under an unseen condition which is out of the distribution of the training data. Lastly, we employed the Gradient Interpolation strategy to generate intermediate states in cell reprogramming.\n\nWe employed three single cell transcriptomic datasets in these experiments. The PBMC68k dataset \\[[30](#bib.bib30)\\] is a classical scRNA-seq dataset that contains 11 different cell types of human peripheral blood mononuclear cells (PBMCs). As the CD4+ T helper 2 cells had an extremely low number and could not be classified by CellTypist, we removed them for downstream analysis. Tabular Muris \\[[31](#bib.bib31)\\] is a large-scale single cell transcriptomic database of mice across 12 organs. The Waddington-OT dataset \\[[32](#bib.bib32)\\] is a cell reprogramming dataset of mouse embryonic fibroblasts (MEFs), containing cells with different timestamps during an 18-day reprogramming process. For all three datasets, we filtered out cells with less than 10 expression counts and genes that expressed in less than 3 cells.\n\nIn all experiments, we set the diffusion step to 2000. The parameter $`\\gamma`$ in Eq. [5](#S2.E5 \"In 2.3 Conditional generation and the Gradient Interpolation strategy ‣ 2 Methods ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\") was set to 8 for the conditional generation of PBMC68k data and the intermediate state generation, and it was set to 2 for all other experiments. The parameter $`w`$ in Eq. [3](#S2.E3 \"In 2.2 Training the diffusion backbone network ‣ 2 Methods ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\") and Eq. [5](#S2.E5 \"In 2.3 Conditional generation and the Gradient Interpolation strategy ‣ 2 Methods ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\") was set to 0.5. The parameter $`t`$ in Eq. [7](#S2.E7 \"In 2.3 Conditional generation and the Gradient Interpolation strategy ‣ 2 Methods ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\") was set to 1800.\n\n### 3.1 Realistic scRNA-seq data generation\n\nWe applied scDiffusion on the Tabular Muris dataset to generate new cells (Fig. [2](#S3.F2 \"Figure 2 ‣ 3.1 Realistic scRNA-seq data generation ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")a). For comparison, we also generated cells with scGAN using its default parameter setting. We evaluated the performance of scDiffusion and scGAN with various metrics, and the results indicated that scDiffusion can generate more realistic scRNA-seq data, achieving a SCC of 0.932, a MMD of 0.059, and a LISI score of 0.88, while scGAN’s results were 0.867, 0.065, and 0.66, respectively. The Random Forest AUC score for scDiffusion in the test set was 0.61, also outperforming scGAN’s 0.78 (Fig. [2](#S3.F2 \"Figure 2 ‣ 3.1 Realistic scRNA-seq data generation ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")b). We also trained scGAN and scDiffusion on the PBMC68k dataset, which showed a very similar result to the Tabular Muris dataset (Fig. [2](#S3.F2 \"Figure 2 ‣ 3.1 Realistic scRNA-seq data generation ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")c). The SCC, MMD and LISI between the scDiffusion-generated cells and real cells were 0.84, 0.013, and 0.91, respectively, while the results of scGAN were 0.89, 0.019, 0.88. The Random Forest result shown in Fig. [2](#S3.F2 \"Figure 2 ‣ 3.1 Realistic scRNA-seq data generation ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")d also demonstrated the outperformance of scDiffusion.\n\nFor the Tabular Muris dataset, we selected five transcription factors (Klf13, Ybx1, Hnrnpk, Cnbp, Hmgb2) that have the highest mean Gini importance when making cell type classification in the original paper \\[[31](#bib.bib31)\\]. We calculated the mean expression level of every marker in different cell types for real and scDiffusion-generated data, respectively, and then calculated PCCs between them. For the PBMC68k dataset, we selected several marker genes, including CD3D, CD8A, NKG7 \\[[33](#bib.bib33)\\], CD79A \\[[34](#bib.bib34)\\], CCR10 \\[[35](#bib.bib35)\\], and S100A8 \\[[36](#bib.bib36)\\] and did the same calculation. PCCs of all these genes are larger than 0.98. The distributions of expression of these genes in real and generated data also showed to be similar (Fig. [2](#S3.F2 \"Figure 2 ‣ 3.1 Realistic scRNA-seq data generation ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")e, [2](#S3.F2 \"Figure 2 ‣ 3.1 Realistic scRNA-seq data generation ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")f, S2, and S3). We also draw the QQ-plots of these genes in different cell types between real and generated data (Fig. S4). All these results show that scDiffusion can generate realistic data.\n\n[Refer to caption](/html/2401.03968/assets/figs/result_noncondi.png)\n\nFigure 2: scDiffusion can generate realistic cell data. (a) UMAP of scDiffusion-generated Tabular Muris data and real Tabular Muris data. (b) Random Forest ROC curve of different methods in the Tabular Muris dataset. (c) UMAP of scDiffusion-generated PBMC68k data and real PBMC68k data. (d) Random Forest ROC curve of different methods in the PBMC68k dataset. (e) The expression of marker gene Klf13 in different cell types in the real and generated Tabular Muris data. (f) The expression of marker gene CD3D in different cell types in the real and generated PBMC68k data.\n\n### 3.2 Conditionally generating specific cell types\n\nWe trained a cell type classifier according to the annotation provided by the Tabular Muris dataset to to guide the conditional generation of scDiffusion. For each cell type, we conditionally generated the same number of cells as the real data. As shown in Fig. [3](#S3.F3 \"Figure 3 ‣ 3.2 Conditionally generating specific cell types ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")a and S5, the conditionally generated cells visually overlapped with the real cells on the UMAP plot. We used Celltypist to classify these conditionally generated cells. As shown in Fig. [3](#S3.F3 \"Figure 3 ‣ 3.2 Conditionally generating specific cell types ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")b, the classification accuracies of the generated cells are close to those of the real cells in the test set. We also performed a similar procedure on the PBMC68k dataset, and the results are similar (Fig. [3](#S3.F3 \"Figure 3 ‣ 3.2 Conditionally generating specific cell types ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")c and [3](#S3.F3 \"Figure 3 ‣ 3.2 Conditionally generating specific cell types ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")d). It’s worth mentioning that rare cell types, such as the Thymus cell in the Tabular Muris dataset (2.5% in the whole dataset) and the CD34+ cell in the PBMC68k dataset (0.4% in the whole dataset), can also be well generated.\n\nWe then investigated how different parameter settings for $`\\gamma`$ and $`w`$ in Eq. [5](#S2.E5 \"In 2.3 Conditional generation and the Gradient Interpolation strategy ‣ 2 Methods ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\") affect the generation results, taking CD19+ B cells in the PBMC68k dataset as an example. We found that as the weight of classifier guidance $`\\gamma`$ increases and the noise of the reverse process $`e\\hspace{0pt}x\\hspace{0pt}p\\hspace{0pt}{({\\sigma_{i}\\hspace{0pt}\\mathbf{w}})}\\hspace{0pt}\\varepsilon`$ decreases, the generated cells of the specific condition exhibit a denser distribution, and thus the boundaries between this cell type and other cell types become more distinct, which leads to higher accuracies of CellTypist to classify cell types (Fig. [3](#S3.F3 \"Figure 3 ‣ 3.2 Conditionally generating specific cell types ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")e). At the meantime, as real cells often exhibit a dispersed distribution, the densely generated cells may not fill all possible regions of the real distribution. These unfilled regions can be easily distinguished by a Random Forest classifier, leading to a worse Random Forest AUC performance (Fig. [3](#S3.F3 \"Figure 3 ‣ 3.2 Conditionally generating specific cell types ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")f).\n\n[Refer to caption](/html/2401.03968/assets/figs/result_condi.png)\n\nFigure 3: (a) UMAP of different cell types in the Tabular Muris dataset generated by conditional diffusion. The Thymus cell is a rare cell type. (b) The accuracy of CellTypist in different cell types in the Tabular Muris dataset. (c) UMAP of different cell types in the PPBMC68k dataset generated by conditional diffusion. The CD34+ cell is a rare cell type. (d) The accuracy of CellTypist in different cell types in the PBMC68k dataset. (e) The CellTypist accuracy of the CD19+ B cell with different parameter settings. (f) The Random Forest score of the CD19+ B cell with different parameter settings.\n\n### 3.3 Generating out-of-distribution cell data with multiple conditions\n\nWe then tried to generate cells with multiple conditions baes on the Tabular Muris dataset. We trained two classifiers to separately control different conditions, one for organ type and the other for cell type. We selected three cell groups, mammary gland T cell, spleen T cell, and spleen B cell, from the dataset for training. We would like to generate cells with a new combination of conditions (mammary B cell) which was not seen in the training data, or in other words, out of the distribution of the training data.\n\nAs shown in Fig. [4](#S3.F4 \"Figure 4 ‣ 3.3 Generating out-of-distribution cell data with multiple conditions ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\"), the generated cells with all combinations of conditions, including the out-of-distribution cells, visually overlapped with or near the real cells on the UMAP plot. The MMD score of the out-of-distribution cells was relatively higher than other generated cells (Fig. [4](#S3.F4 \"Figure 4 ‣ 3.3 Generating out-of-distribution cell data with multiple conditions ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")). We further trained a CellTypist model with all kinds of cells in mammary gland, and used it to classified the real and generated mammary gland B cells. The result showed that 85% of the generated cells and 91% of the real cells were categorized into the B cell. which showed that the scDiffusion can generate mammary B cells comparable to the real one. All the results suggested that scDiffusion can effectively generate realistic out-of-distribution mammary gland B cells by learning the expression patterns of both mammary gland cells and B cells.\n\n[Refer to caption](/html/2401.03968/assets/figs/ood.png)\n\nFigure 4: UMAP of real cells and cells generated with two conditions. The mammary gland B cells are unseen in the training data.\n\n### 3.4 Generating intermediate cell states during cell reprogramming\n\nWe used the Gradient Interpolation strategy to generate the intermediate cell states during cell reprogramming in the Waddington-OT dataset. We trained scDiffusion on the Waddington-OT dataset, which contains MEFs with the induction of reprogramming to induced pluripotent stem cells (iPSCs). The data were across 18 days since induction with a half-day interval, and a part of the cells were induced to redifferentiate at day 8. We first chose all samples from day 0 to day 8 with the exception of day 0.5 and day 1 to train scDiffusion, and trained the classifier with the same dataset using the timestamp as the label. We then sent two conditions, day 0 and day 1.5, to the classifier and used Gradient Interpolation to generate a series of cell states between day 0 and day 1.5 in the development trajectory (Fig. [5](#S3.F5 \"Figure 5 ‣ 3.4 Generating intermediate cell states during cell reprogramming ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")a). The initial noise was set to be the noised latent space embeddings of day-0 cells.\n\n[Refer to caption](/html/2401.03968/assets/figs/interpolate.png)\n\nFigure 5: (a) UMAP of real cells. (b) UMAP of cells generated by Gradient Interpolation. (b) the MMD score of different methods at different timestamps.\n\nWe generated 20 states between day 0 and day 1.5 (Fig. [5](#S3.F5 \"Figure 5 ‣ 3.4 Generating intermediate cell states during cell reprogramming ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")b). We calculated the diffusion pseudotime \\[[37](#bib.bib37)\\] of different states (Fig. S6) and found that state 8 and state 18 are the closest to real cells of day 0.5 and day 1, respectively. These cells were previously stripped out from the training data. We compared these two states with the direct interpolation of day 0 and day 1.5 whose weight was the same as Gradient Interpolation. The MMD scores of state 8 and state 18 are 0.31 and 0.90, while the direct interpolation’s scores are 0.35 and 1.55. These results showed that scDiffusion can generate cells that are closer to the real intermediate state.\n\nWe also tried to train scDiffusion with all integer days and generate cells in the middle of two integer days. We compared the results with direct interpolation. As some of the cells were induced to redifferentiate at day 8, we interpolated cells within each treatment group separately. The interpolation weights of both methods were set to 1:1. As shown in Fig. [5](#S3.F5 \"Figure 5 ‣ 3.4 Generating intermediate cell states during cell reprogramming ‣ 3 Results ‣ scDiffusion: conditional generation of high-quality single-cell data using diffusion model\")c, scDiffusion exhibited better performance than direct interpolation in MMD metrics. The mean MMD of scDiffusion is 0.36, while the result of direct interpolation is 0.51, respectively. It is worth noting that scDiffusion was not trained with the information of different treatments, but its performance was still better than direct interpolation according to the treatment information, suggesting that the diffusion model can well capture the miscellaneous distribution of cells and well fit their intermediate states.\n\n## 4 Discussion\n\nIn this paper, we present a deep generative neural network scDiffusion based on the denoising diffusion probability model. We use an autoencoder and a MLP-based backbone to enable diffusion models to be suitable for gene expression data, and generate realistic single cell data. By utilizing the classifier guidance method, scDiffusion can conditionally generate specific cell expression data based on user-defined conditions, including rare cell types. The flexibility of the classifier guidance also offers the potential to generate cells that are not seen in the training dataset. Furthermore, the Gradient Interpolation strategy enables the generation of a continuous cell trajectory between two known cell states to fill in intermediate states. These abilities can be used to augment available scRNA-seq data and hold the potential for analyzing cell states that are not sequenced.\n\nWith the powerful generative ability, scDiffusion has the prospect to carry on many other tasks. A very natural thing is multi-omics data generation. Theoretically, scDiffusion can generate any kind of single cell data. Besides, scDiffusion can also be used in the quality improvement of single cell data. For instance, by learning the overall expression paradigm in clean data, scDiffusion is able to perform denoising operations for contaminated data. In the future, we will try to replace the classifier with more powerful tools such as CLIP \\[[38](#bib.bib38)\\] in the stable diffusion \\[[39](#bib.bib39)\\]. In this way we may use more complex conditions to control the generating process and enable more complex tasks such as in silico cell perturbation, providing important help for drug selection and the control of cell state transition. The code of scDiffusion is available at https://github.com/EperLuo/scDiffusion.\n\n## 5 Acknowledgements\n\nThe work is supported in part by National Key R&D Program of China (grant 2021YFF1200900), and National Natural Science Foundation of China (grants 62250005, 61721003, 62373210).\n\n## References\n\n- \\[1\\] Jovic, D. *et al.* Single-cell rna sequencing technologies and applications: A brief overview.\n\n  *Clinical and Translational Medicine* 12, e694 (2022).\n\n- \\[2\\] Gohil, S. H., Iorgulescu, J. B., Braun, D. A., Keskin, D. B. & Livak, K. J. Applying high-dimensional single-cell technologies to the analysis of cancer immunotherapy.\n\n  *Nature Reviews Clinical Oncology* 18, 244–256 (2021).\n\n- \\[3\\] Jiang, P. *et al.* Big data in basic and translational cancer research.\n\n  *Nature Reviews Cancer* 22, 625–639 (2022).\n\n- \\[4\\] Ke, M., Elshenawy, B., Sheldon, H., Arora, A. & Buffa, F. M. Single cell rna-sequencing: A powerful yet still challenging technology to study cellular heterogeneity.\n\n  *BioEssays* 44, 2200084 (2022).\n\n- \\[5\\] Suvà, M. L. & Tirosh, I. Single-cell rna sequencing in cancer: lessons learned and emerging challenges.\n\n  *Molecular cell* 75, 7–12 (2019).\n\n- \\[6\\] Greene, W. H. Accounting for excess zeros and sample selection in poisson and negative binomial regression models (1994).\n\n- \\[7\\] Lindenbaum, O., Stanley, J., Wolf, G. & Krishnaswamy, S. Geometry based data generation.\n\n  *Advances in Neural Information Processing Systems* 31 (2018).\n\n- \\[8\\] Dibaeinia, P. & Sinha, S. Sergio: a single-cell expression simulator guided by gene regulatory networks.\n\n  *Cell systems* 11, 252–271 (2020).\n\n- \\[9\\] Li, W. V. & Li, J. J. A statistical simulator scdesign for rational scrna-seq experimental design.\n\n  *Bioinformatics* 35, i41–i50 (2019).\n\n- \\[10\\] Zappia, L., Phipson, B. & Oshlack, A. Splatter: simulation of single-cell rna sequencing data.\n\n  *Genome biology* 18, 174 (2017).\n\n- \\[11\\] Lopez, R., Gayoso, A. & Yosef, N. Enhancing scientific discoveries in molecular biology with deep generative models.\n\n  *Molecular systems biology* 16, e9198 (2020).\n\n- \\[12\\] Kingma, D. P. & Welling, M. Auto-encoding variational bayes. *arXiv preprint arXiv:1312.6114* (2013).\n\n- \\[13\\] Lopez, R., Regier, J., Cole, M. B., Jordan, M. I. & Yosef, N. Deep generative modeling for single-cell transcriptomics.\n\n  *Nature methods* 15, 1053–1058 (2018).\n\n- \\[14\\] Goodfellow, I. *et al.* Generative adversarial nets.\n\n  *Advances in neural information processing systems* 27 (2014).\n\n- \\[15\\] Marouf, M. *et al.* Realistic in silico generation and augmentation of single-cell rna-seq data using generative adversarial networks.\n\n  *Nature communications* 11, 166 (2020).\n\n- \\[16\\] Lall, S., Ray, S. & Bandyopadhyay, S. Lsh-gan enables in-silico generation of cells for small sample high dimensional scrna-seq data.\n\n  *Communications Biology* 5, 577 (2022).\n\n- \\[17\\] Xu, Y. *et al.* scigans: single-cell rna-seq imputation using generative adversarial networks.\n\n  *Nucleic acids research* 48, e85–e85 (2020).\n\n- \\[18\\] Brock, A., Donahue, J. & Simonyan, K. Large scale gan training for high fidelity natural image synthesis. *arXiv preprint arXiv:1809.11096* (2018).\n\n- \\[19\\] Ho, J., Jain, A. & Abbeel, P. Denoising diffusion probabilistic models.\n\n  *Advances in neural information processing systems* 33, 6840–6851 (2020).\n\n- \\[20\\] Yang, L. *et al.* Diffusion models: A comprehensive survey of methods and applications. *ACM Computing Surveys* (2022).\n\n- \\[21\\] Cao, H. *et al.* A survey on generative diffusion model. *arXiv preprint arXiv:2209.02646* (2022).\n\n- \\[22\\] Dhariwal, P. & Nichol, A. Diffusion models beat gans on image synthesis.\n\n  *Advances in neural information processing systems* 34, 8780–8794 (2021).\n\n- \\[23\\] Song, J., Meng, C. & Ermon, S. Denoising diffusion implicit models. *arXiv preprint arXiv:2010.02502* (2020).\n\n- \\[24\\] Nichol, A. *et al.* Glide: Towards photorealistic image generation and editing with text-guided diffusion models. *arXiv preprint arXiv:2112.10741* (2021).\n\n- \\[25\\] Gretton, A., Borgwardt, K. M., Rasch, M. J., Schölkopf, B. & Smola, A. A kernel two-sample test.\n\n  *The Journal of Machine Learning Research* 13, 723–773 (2012).\n\n- \\[26\\] Haghverdi, L., Lun, A. T., Morgan, M. D. & Marioni, J. C. Batch effects in single-cell rna-sequencing data are corrected by matching mutual nearest neighbors.\n\n  *Nature biotechnology* 36, 421–427 (2018).\n\n- \\[27\\] Luecken, M. D. *et al.* Benchmarking atlas-level data integration in single-cell genomics.\n\n  *Nature methods* 19, 41–50 (2022).\n\n- \\[28\\] McInnes, L., Healy, J. & Melville, J. Umap: Uniform manifold approximation and projection for dimension reduction. *arXiv preprint arXiv:1802.03426* (2018).\n\n- \\[29\\] Domínguez Conde, C. *et al.* Cross-tissue immune cell analysis reveals tissue-specific features in humans.\n\n  *Science* 376, eabl5197 (2022).\n\n- \\[30\\] Zheng, G. X. *et al.* Massively parallel digital transcriptional profiling of single cells.\n\n  *Nature communications* 8, 14049 (2017).\n\n- \\[31\\] Schaum, N. *et al.* Single-cell transcriptomics of 20 mouse organs creates a tabula muris: The tabula muris consortium.\n\n  *Nature* 562, 367 (2018).\n\n- \\[32\\] Schiebinger, G. *et al.* Optimal-transport analysis of single-cell gene expression identifies developmental trajectories in reprogramming.\n\n  *Cell* 176, 928–943 (2019).\n\n- \\[33\\] Borrego, F., Masilamani, M., Marusina, A. I., Tang, X. & Coligan, J. E. The cd94/nkg2 family of receptors: from molecules and cells to clinical relevance.\n\n  *Immunologic research* 35, 263–277 (2006).\n\n- \\[34\\] Chu, P. G. & Arber, D. A. Cd79: a review.\n\n  *Applied Immunohistochemistry & Molecular Morphology* 9, 97–106 (2001).\n\n- \\[35\\] Lubberts, E. The il-23–il-17 axis in inflammatory arthritis.\n\n  *Nature Reviews Rheumatology* 11, 415–429 (2015).\n\n- \\[36\\] Schiopu, A., Cotoi, O. S. *et al.* S100a8 and s100a9: Damps at the crossroads between innate immunity, traditional risk factors, and cardiovascular disease.\n\n  *Mediators of inflammation* 2013 (2013).\n\n- \\[37\\] Haghverdi, L., Büttner, M., Wolf, F. A., Buettner, F. & Theis, F. J. Diffusion pseudotime robustly reconstructs lineage branching.\n\n  *Nature methods* 13, 845–848 (2016).\n\n- \\[38\\] Radford, A. *et al.* Learning transferable visual models from natural language supervision. In *International conference on machine learning*, 8748–8763 (PMLR, 2021).\n\n- \\[39\\] Rombach, R., Blattmann, A., Lorenz, D., Esser, P. & Ommer, B. High-resolution image synthesis with latent diffusion models. In *Proceedings of the IEEE/CVF conference on computer vision and pattern recognition*, 10684–10695 (2022).\n\nscDiffusion: conditional generation of high-quality single-cell data using diffusion model\n\nErpai Luo$`^{1,^{\\#}}`$, Minsheng Hao$`^{1,^{\\#}}`$, Lei Wei<sup>1</sup>, Xuegong Zhang$`^{1,2,^{\\ast}}`$\n\n<sup>1</sup>MOE Key Lab of Bioinformatics and Bioinformatics Division of BNRIST,\nDepartment of Automation, Tsinghua University, Beijing 100084, China\n<sup>2</sup>School of Life Sciences and School of Medicine, Tsinghua University, Beijing 100084, China\n\n<sup>†</sup>\n\n<sup>†</sup>footnotetext: <sup>\\#</sup> These authors contributed equally to this work.\n\n<sup>†</sup>\n\n<sup>†</sup>footnotetext: <sup>∗</sup> Corresponding Author. Email: zhangxg@tsinghua.edu.cn\n\n## Supplementary Materials\n\n[Refer to caption](/html/2401.03968/assets/figs/distribution_transfer.png)\n\nFigure S1: Distribution of original gene expression and latent embeddings derived by the autoencoder.\n\n[Refer to caption](/html/2401.03968/assets/figs/feature_gene_exp_a.png)\n\nFigure S2: Expression of feature genes in the real and generated Tabular Muris data.\n\n[Refer to caption](/html/2401.03968/assets/figs/feature_gene_exp_a.png)\n\nFigure S3: Expression of feature genes in the real and generated PBMC68k data.\n\n[Refer to caption](/html/2401.03968/assets/figs/qq_plot.png)\n\nFigure S4: QQ-plots of expression of feature genes in the real and generated data. (a) The Tabular Muris dataset. (b) The PBMC68k dataset.\n\n[Refer to caption](/html/2401.03968/assets/figs/condi_all.png)\n\nFigure S5: UMAP of conditionally generated cells. (a) The Tabular Muris dataset. (b) The PBMC68k dataset.\n\n[Refer to caption](/html/2401.03968/assets/figs/pseudotime.png)\n\nFigure S6: Pseudotime distance of generated states with different interpolation weights. Orange lines are the pseudotime of days 0, 0.5, 1, and 1.5 in the real data, from bottom to up.<|endoftext|>"
    },
    "test": {
      "total_tokens": 63641169,
      "example": "# Quantitative Biology \\> Other Quantitative Biology\n\n**arXiv:2412.17005** (q-bio)\n\n\\[Submitted on 22 Dec 2024\\]\n\n# Title:Investigation of phytochemicals, spectral properties, anticancer, antidiabetic, and antimicrobial activities of chosen Ayurvedic remedies\n\nAuthors:<a href=\"https://arxiv.org/search/q-bio?searchtype=author&amp;query=Shareef,+T+H+M+A\" rel=\"nofollow\">T. H. Mohamed Ahadu Shareef</a>, <a href=\"https://arxiv.org/search/q-bio?searchtype=author&amp;query=Navabshan,+I\" rel=\"nofollow\">Irfan Navabshan</a>, <a href=\"https://arxiv.org/search/q-bio?searchtype=author&amp;query=Masood,+M+M+D\" rel=\"nofollow\">M Mohamed Divan Masood</a>, <a href=\"https://arxiv.org/search/q-bio?searchtype=author&amp;query=Yuvaraj,+T+E\" rel=\"nofollow\">T. Eswara Yuvaraj</a>, <a href=\"https://arxiv.org/search/q-bio?searchtype=author&amp;query=Sherif,+A\" rel=\"nofollow\">A. Sherif</a>\n\nView a PDF of the paper titled Investigation of phytochemicals, spectral properties, anticancer, antidiabetic, and antimicrobial activities of chosen Ayurvedic remedies, by T. H. Mohamed Ahadu Shareef and 4 other authors\n\n[View PDF](/pdf/2412.17005)\n\n> Abstract:This study examines the phytochemical characteristics of Ayurvedic products. An analysis was performed on Kottakkal Ayurveda Triphala (T), Kottakkal Ayurveda Hinguvachadi Churnam (H), and Kottakkal Ayurveda Jirakadyarishtam (J) using GC-MS and LC-MS techniques to determine their bioactive constituents, while also assessing their antimicrobial, docking, anticancer, and anti-diabetic activities. The GC-MS analysis identified 30, 45, and 8 chemical components in Kottakkal Ayurveda Triphala (T), Kottakkal Ayurveda Hinguvachadi Churnam (H), and Kottakkal Ayurveda Jirakadyarishtam (J), respectively. The LC-MS analysis produced 15, 20, and 16 peaks for Kottakkal Ayurveda Triphala (T), Kottakkal Ayurveda Hinguvachadi Churnam (H), and Kottakkal Ayurveda Jirakadyarishtam (J), with m/z values of 982, 981, 972, and 933; 987, 985, 974, and 945; and 969, 965, 951, and 941, respectively, confirming their precision. Moreover, characterization of the Ayurvedic products was carried out using FT-IR, UV-vis, and 1H-NMR spectroscopy to identify significant functional groups and chemical substances. Kottakkal Ayurveda Triphala (T) was evaluated for antibacterial activity against Gram-positive bacteria (Streptococcus pneumoniae and Staphylococcus aureus) along with Gram-negative bacteria (Escherichia coli and Klebsiella pneumoniae), yielding a P value of 0.0650 (P \\< 0.0001). Both Kottakkal Ayurveda Hinguvachadi Churnam (H) and Kottakkal Ayurveda Jirakadyarishtam (J) were subjected to analysis for their effectiveness against Aspergillus niger and Aspergillus fumigatus, also revealing a P value within the acceptable range of 0.0650 (P \\< 0.0001). The anti-diabetic properties of Kottakkal Ayurveda Triphala (T) were assessed using the {\\alpha}-glucosidase inhibitory method, which exhibited a significant inhibitory effect on {\\alpha}-glucosidase, resulting in an average P value of 0.001 (P \\< 0.0001).\n\n<table>\n<tbody>\n<tr>\n<td>Comments:</td>\n<td>for associated mpeg file, see this http URL</td>\n</tr>\n<tr>\n<td>Subjects:</td>\n<td>Other Quantitative Biology (q-bio.OT)</td>\n</tr>\n<tr>\n<td>MSC classes:</td>\n<td>14J60 (Primary) 14F05, 14J26 (Secondary)</td>\n</tr>\n<tr>\n<td>ACM classes:</td>\n<td>F.2.2; I.2.7</td>\n</tr>\n<tr>\n<td>Report number:</td>\n<td>Report-no: EFI-94-11</td>\n</tr>\n<tr>\n<td>Cite as:</td>\n<td>arXiv:2412.17005 [q-bio.OT]</td>\n</tr>\n<tr>\n<td> </td>\n<td>(or arXiv:2412.17005v1 [q-bio.OT] for this version)</td>\n</tr>\n<tr>\n<td> </td>\n<td>https://doi.org/10.48550/arXiv.2412.17005 img.svg Focus to learn more arXiv-issued DOI via DataCite</td>\n</tr>\n<tr>\n<td>Journal reference:</td>\n<td>J.Hasty Results 1 (2008) 1-9; Erratum: J.Hasty Results 2 (2008) 1-2</td>\n</tr>\n<tr>\n<td>Related DOI:</td>\n<td>https://doi.org/10.1016/S0550-3213%2801%2900405-9 img.svg Focus to learn more DOI(s) linking to related resources</td>\n</tr>\n</tbody>\n</table>\n\n## Submission history\n\nFrom: Mohamed Hyder Thivan \\[<a href=\"/show-email/d1df0355/2412.17005\" rel=\"nofollow\">view email</a>\\]\n**\\[v1\\]** Sun, 22 Dec 2024 12:56:06 UTC (606 KB)\n\nFull-text links:\n\n## Access Paper:\n\n- View a PDF of the paper titled Investigation of phytochemicals, spectral properties, anticancer, antidiabetic, and antimicrobial activities of chosen Ayurvedic remedies, by T. H. Mohamed Ahadu Shareef and 4 other authors\n\n- <a href=\"/pdf/2412.17005\" accesskey=\"f\" aria-describedby=\"download-button-info\">View PDF</a>\n\n[view license](http://arxiv.org/licenses/nonexclusive-distrib/1.0/ \"Rights to this article\")\n\nCurrent browse context:\n\nq-bio.OT\n\n<a href=\"/prevnext?id=2412.17005&amp;function=prev&amp;context=q-bio.OT\" accesskey=\"p\" rel=\"nofollow\" title=\"previous in q-bio.OT (accesskey p)\">&lt; prev</a>   \\|   <a href=\"/prevnext?id=2412.17005&amp;function=next&amp;context=q-bio.OT\" accesskey=\"n\" rel=\"nofollow\" title=\"next in q-bio.OT (accesskey n)\">next &gt;</a>\n\n<a href=\"/list/q-bio.OT/new\" rel=\"nofollow\">new</a> \\| <a href=\"/list/q-bio.OT/recent\" rel=\"nofollow\">recent</a> \\| <a href=\"/list/q-bio.OT/2024-12\" rel=\"nofollow\">2024-12</a>\n\nChange to browse by:\n\n<a href=\"/abs/2412.17005?context=q-bio\" rel=\"nofollow\">q-bio</a>\n\n### References & Citations\n\n- [NASA ADS](https://ui.adsabs.harvard.edu/abs/arXiv:2412.17005)\n- <a href=\"https://scholar.google.com/scholar_lookup?arxiv_id=2412.17005\" rel=\"noopener\" target=\"_blank\">Google Scholar</a>\n- <a href=\"https://api.semanticscholar.org/arXiv:2412.17005\" rel=\"noopener\" target=\"_blank\">Semantic Scholar</a>\n\nexport BibTeX citation Loading...\n\n## BibTeX formatted citation\n\n×\n\nData provided by:\n\n### Bookmark\n\n[](http://www.bibsonomy.org/BibtexHandler?requTask=upload&url=https://arxiv.org/abs/2412.17005&description=Investigation%20of%20phytochemicals,%20spectral%20properties,%20anticancer,%20antidiabetic,%20and%20antimicrobial%20activities%20of%20chosen%20Ayurvedic%20remedies \"Bookmark on BibSonomy\") [BibSonomy logo](/static/browse/0.3.4/images/icons/social/bibsonomy.png) [](https://reddit.com/submit?url=https://arxiv.org/abs/2412.17005&title=Investigation%20of%20phytochemicals,%20spectral%20properties,%20anticancer,%20antidiabetic,%20and%20antimicrobial%20activities%20of%20chosen%20Ayurvedic%20remedies \"Bookmark on Reddit\") [Reddit logo](/static/browse/0.3.4/images/icons/social/reddit.png)\n\nBibliographic Tools\n\n# Bibliographic and Citation Tools\n\nBibliographic Explorer Toggle\n\nBibliographic Explorer *([What is the Explorer?](https://info.arxiv.org/labs/showcase.html#arxiv-bibliographic-explorer))*\n\nConnected Papers Toggle\n\nConnected Papers *(<a href=\"https://www.connectedpapers.com/about\" target=\"_blank\">What is Connected Papers?</a>)*\n\nLitmaps Toggle\n\nLitmaps *(<a href=\"https://www.litmaps.co/\" target=\"_blank\">What is Litmaps?</a>)*\n\nscite.ai Toggle\n\nscite Smart Citations *(<a href=\"https://www.scite.ai/\" target=\"_blank\">What are Smart Citations?</a>)*\n\nCode, Data, Media\n\n# Code, Data and Media Associated with this Article\n\nalphaXiv Toggle\n\nalphaXiv *(<a href=\"https://alphaxiv.org/\" target=\"_blank\">What is alphaXiv?</a>)*\n\nLinks to Code Toggle\n\nCatalyzeX Code Finder for Papers *(<a href=\"https://www.catalyzex.com\" target=\"_blank\">What is CatalyzeX?</a>)*\n\nDagsHub Toggle\n\nDagsHub *(<a href=\"https://dagshub.com/\" target=\"_blank\">What is DagsHub?</a>)*\n\nGotitPub Toggle\n\nGotit.pub *(<a href=\"http://gotit.pub/faq\" target=\"_blank\">What is GotitPub?</a>)*\n\nHuggingface Toggle\n\nHugging Face *(<a href=\"https://huggingface.co/huggingface\" target=\"_blank\">What is Huggingface?</a>)*\n\nLinks to Code Toggle\n\nPapers with Code *(<a href=\"https://paperswithcode.com/\" target=\"_blank\">What is Papers with Code?</a>)*\n\nScienceCast Toggle\n\nScienceCast *(<a href=\"https://sciencecast.org/welcome\" target=\"_blank\">What is ScienceCast?</a>)*\n\nDemos\n\n# Demos\n\nReplicate Toggle\n\nReplicate *(<a href=\"https://replicate.com/docs/arxiv/about\" target=\"_blank\">What is Replicate?</a>)*\n\nSpaces Toggle\n\nHugging Face Spaces *(<a href=\"https://huggingface.co/docs/hub/spaces\" target=\"_blank\">What is Spaces?</a>)*\n\nSpaces Toggle\n\nTXYZ.AI *(<a href=\"https://txyz.ai\" target=\"_blank\">What is TXYZ.AI?</a>)*\n\nRelated Papers\n\n# Recommenders and Search Tools\n\nLink to Influence Flower\n\nInfluence Flower *(<a href=\"https://influencemap.cmlab.dev/\" target=\"_blank\">What are Influence Flowers?</a>)*\n\nCore recommender toggle\n\nCORE Recommender *([What is CORE?](https://core.ac.uk/services/recommender))*\n\n- Author\n- Venue\n- Institution\n- Topic\n\n[img.svg](data:image...)\n\n[img.svg](data:image...)\n\n[img.svg](data:image...)\n\n[img.svg](data:image...)\n\nAbout arXivLabs\n\n# arXivLabs: experimental projects with community collaborators\n\narXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.\n\nBoth individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.\n\nHave an idea for a project that will add value for arXiv's community? [**Learn more about arXivLabs**](https://info.arxiv.org/labs/index.html).\n\n[img.svg](data:image...)\n\n<a href=\"/auth/show-endorsers/2412.17005\" rel=\"nofollow\">Which authors of this paper are endorsers?</a> \\| <a href=\"javascript:setMathjaxCookie()\" id=\"mathjax_toggle\">Disable MathJax</a> ([What is MathJax?](https://info.arxiv.org/help/mathjax.html))<|endoftext|>"
    }
  },
  "cyber": {
    "train": {
      "total_tokens": 842959874,
      "example": "# Performance Analysis of V2I Zone Activation and Scalability for C-V2X Transactional Services\n\nMahdi Zaman, Md Saifuddin, Mahdi Razzaghpour, Yaser P. Fallah\nDept. of Electrical & Computer Engineering, Univ. of Central Florida, Orlando, FL\n{mahdizaman, md.saif, razzaghpour.mahdi}@knights.ucf.edu, yaser.fallah@ucf.edu\n\n###### Abstract\n\nCellular-V2X (C-V2X) enables communication between vehicles and other transportation entities over the 5.9GHz spectrum. C-V2X utilizes direct communication mode for safety packet broadcasts (through the usage of periodic basic safety messages) while leaving sufficient room in the resource pool for advanced service applications. While many such ITS applications are under development, it is crucial to identify and optimize the relevant network parameters. In this paper, we envision an infrastructure-assisted transaction procedure entirely carried out by C-V2X, and we optimize it in terms of the service parameters. To achieve the service utility of a transaction class, two C-V2X entities require a successive exchange of multiple messages. With this notion, our proposed application prototype can be generalized for any vehicular service to establish connections on-the-fly. We identify suitable activation zones for vehicles and assess their impact on service efficiency. The results show a variety of potential service and parameter settings that can be appropriate for different use-cases, laying the foundation for subsequent studies.\n\n###### Index Terms:\n\nCellular-V2X, 5G-NR, V2I, Infrastructure, Transaction Service, Intelligent Transportation\n\n## I Introduction\n\nOur current transportation ecosystem consists of diverse categories of vehicles with infrastructural entities providing complimentary assistance in mobility. The evolution trajectory of the transport system is gradually morphing into an increasingly autonomous one. As a massive quantity of resources is spent on developing infrastructure support every year\\[[1](#bib.bib1)\\], efficient infrastructure design is equally crucial. When equipped with radio technology, infrastructures are capable of contributing in efficient transportation, which enables a wide range of service leveraging the connectivity. For users to enjoy quality of service, reliable and scalable communication along with efficient application protocol designs are required. C-V2X has been developed under stringent QoS requirement to ensure such communication. It enables cooperative applications that expand the service utilities beyond the safety-critical services achievable by Basic Safety Message (BSM). The use cases, relevant requirements, and the major key performance indicators (KPI) are elaborated in 3GPP Services and Systems Aspects (SA) and 5G Automotive Association (5GAA) \\[[2](#bib.bib2)\\].\n\nApplications like Advanced and Cooperative Driving leverage Vehicle-to-Vehicle (V2V) and Vehicle-to-Infrastructure (V2I) communications to deploy on-the-fly group formation, transaction etc. In theory, the scope of these applications spans over the whole listening range of C-V2X. However, the applications are by definition utility-oriented and context-aware. This makes the activation zone a critical factor in ensuring the quality of service. For infrastructure-assisted services, the activation zone refers to a virtual trigger line, where vehicles crossing the line can be considered eligible for the service usage. We aim to identify the impact of the trigger line on the efficiency and scalability of a V2I-based service. In this motive, we first design a prototype service which can operate on any C-V2X unit; a Road-Side Unit (RSU) and the vehicle’s C-V2X equipped On Board Unit (OBU) to operate as the service provider and the users, respectively, in a one-to-many fashion. The service utilizes a sequence of messages transmitted over LTE air interface to carry out the service procedure. From an application-centric perspective, we observe how different settings of activation zone affects the service efficiency.\n\nLeveraging communication systems for traffic management has been of decade long research interest. Prior works by Schulz* et al.* \\[[3](#bib.bib3)\\] discuss the features, benefits, and drawbacks of the concurrent communication technologies for this purpose. It sheds light on the Global System for Mobile Commmunication (GSM), short messaging service (SMS), and general packet radio service (GPRS) for traffic management. Although these protocols proved the potential for providing autonomous in-car navigation, they did not scale. As communication protocols evolved and dedicated frequency bandwidth was claimed for transportation services, the vision of efficient traffic management through radio technology accelerated as well.\n\nTraffic management encompasses many different types of applications; prime examples can be intersection management via smart traffic lights, smart parking in urban cities, etc. Djahel* et al.* discusses a protocol-agnostic adaptive traffic management architecture for emergency vehicles \\[[4](#bib.bib4)\\] with the ability to adjust per driving policy and behavior \\[[5](#bib.bib5)\\]. Evolution towards 6G also enables usage of larger bandwidth with low latency, which promises data utilization in platooning \\[[6](#bib.bib6)\\], predictive QoS \\[[7](#bib.bib7)\\] and cooperative perception by leveraging remote vision via V2X \\[[8](#bib.bib8)\\]. In this paper, we explore:\n\n- •\n\n  a general prototype of a V2I-assisted transaction service,\n\n- •\n\n  identifying the major parameters with potential impact on the service performance,\n\n- •\n\n  optimizing the service based on the V2I zone activation.\n\n## II System Model\n\nInfrastructures can play two distinct roles in the traffic system: one as a base station with network assistance capability, and the other as an RSU. The RSU can communicate with surrounding C-V2X entities (OBU and other RSU) and provide enhanced V2X (eV2X) context under both mode-3 (in-coverage) and mode-4 (out of coverage) operations \\[[9](#bib.bib9)\\]. On freeways, this can lead to safer highway entrance and lane merge events. For urban streets, this can introduce no-stop intersections to reduce fleet time. In this work, we assume mode-4 sidelink operation for all communications.\n\nThe prototype under discussion assumes service-specific message transmissions beside BSM. The transmission and reception procedures do not differ for BSM and V2I packets. However, V2I packets need to propagate the context over successive packet exchanges, which is difficult with BSM as it is designed for broadcasting periodic updates. Applications like cooperative driving and collective perception require arbitration-specific message exchange (V2V or V2I), whereas fee collection involves transaction-specific message exchange (V2I) \\[[10](#bib.bib10)\\]. This calls for a (1) modern message set dictionary encompassing BSM, and (2) a handshake algorithm to address the objective by utilizing the context. Because safety applications generate BSM periodically, the lower level procedures including sensing-based semipersistent (SB-SPS) resource selection (under mode-4) are tailored to optimize primarily for periodic reservation of resources. On the contrary, the advanced use cases are opportunist and mission-critical, thus they mostly resort to aperiodic transmission. To the best of authors’ knowledge, no standardized resource allocation procedure currently exists in C-V2X to facilitate such aperiodic transmissions. In \\[[11](#bib.bib11)\\], the authors show the limitations of the current C-V2X protocol in handling aperiodic packets, especially with high traffic arrival rate or large packetsize. Authors in \\[[12](#bib.bib12)\\] suggest amendments in the mode-4 application layer for the resource selection scheme and emphasizes the need for application-oriented evaluation to maintain efficient cooperative awareness. In presence of aperiodic packets, channel efficiency can suffer for similar reasons. However due to lack of directives, we assume that all transmissions in the simulated scenarios employ standardized lower layer procedures including medium access \\[[13](#bib.bib13)\\]. During this stage of transmission, BSMs are subject to congestion control \\[[14](#bib.bib14)\\] and one-shot semipersistent scheduling \\[[15](#bib.bib15)\\]. The one-shot transmissions are proposed to increase aperiodic packet reception chances so one might argue for using one-shot for all V2I transmissions. However one-shot adds more randomness to minimize inter-packet delay \\[[15](#bib.bib15)\\] without selecting a better resource, hence does not particularly help aperiodic transmission. In addition. V2I packets also have a lower ProSe Per Packet Priority and larger packetsize than BSM. These parameters were adopted from \\[[10](#bib.bib10)\\] and summarized in table [I](#S2.T1 \"TABLE I ‣ II-A Service Design & Relevant Assumptions ‣ II System Model ‣ Performance Analysis of V2I Zone Activation and Scalability for C-V2X Transactional Services\")\n\n###\n\nII-A\n\nService Design & Relevant Assumptions\n\n[Refer to caption](/html/2212.13984/assets/x1.png)\n\nFigure 1: Service Procedure in Timeline\n\nFigure [1](#S2.F1 \"Figure 1 ‣ II-A Service Design & Relevant Assumptions ‣ II System Model ‣ Performance Analysis of V2I Zone Activation and Scalability for C-V2X Transactional Services\") serves as a frame of reference to demonstrate the service as a handshake mechanism. The core operational entity is a static RSU that caters to transactions within a preset geo-zone as dictated by the protocol parameters. Vehicles with appropriate subscription and authentication keys can use the service by communicating with the RSU during passing through its geo-zone. The RSU initiates the service by broadcasting a Service Advertisement Message (SAM) periodically at $`\\lbrack T_{1},{T_{1} + {n \\ast 1000}}\\rbrack`$ms $`({n \\in N})`$. SAM consists of necessary information for subscribed users to respond upon their eligibility. A subscribed vehicle moving towards the RSU can store the relevant contents from a SAM. On the vehicle side, the service API prompts a usage request called Service Usage Message (SUM) at time $`T_{2}`$ when it crosses a virtual trigger line ($`d_{t}`$). Our analyses and results is centered around $`d_{t}`$, which is predetermined at a particular distance from the RSU for each scenario. SUM consists of information about service usage intent, timestamp, vehicle ID, etc. Following reception of a SUM, internal procedures at the RSU checks the user’s eligibility for usage and subscription status, following with authentication or discarding of the request based on the assessment. For qualifying users, an acknowledgment packet (ACK) is transmitted at time $`T_{3}`$ with the necessary information to declare the usage as complete for the targetted vehicle. The transaction is marked complete for a particular user when it receives an ACK with its unique ID.\n\nThe actions occur sequentially from $`T_{1}`$ to $`T_{2}`$ and then to $`T_{3}`$. All vehicles within the RSU vicinity can receive SAM at $`T_{1}`$ and store it until reacting accordingly when it crosses trigger line at $`T_{2}`$. So the interval between $`T_{1}`$ and $`T_{2}`$ effectively depends on the time of crossing the trigger. At $`T_{2}`$, the usage intention is conveyed and the actual service usage starts, until it ends with an ACK reception at $`T_{3}`$. To minimize service latency is to minimize interval between $`T_{2}`$ and $`T_{3}`$. In ideal communication where all transmitted packets are received, this interval equals the net intra-layer delay on the OBU and RSU, which can be $`\\lbrack 8\\ 200\\rbrack`$ms. Instead, we set up realistic communication with a stochastic channel model \\[[16](#bib.bib16)\\] where packets can be lost due to interference. If a transmitted SUM is not met with an ACK within a predefined interval ($`T_{S\\hspace{0pt}U\\hspace{0pt}M}`$), the respective UE retransmits SUM. Multiple retransmissions can occur until an ACK is received. Additionally, the RSU does not differentiate between the first ACK from a user and the retransmissions, so it addresses all SUM receptions with an ACK transmission. Notably, the RSU can send ACKs to individual users, or to a group of users ($`B_{A\\hspace{0pt}C\\hspace{0pt}K}`$), or it can wait for $`T_{A\\hspace{0pt}C\\hspace{0pt}K}`$ duration and reply all the SUM received within this duration with a single ACK consisting multiple recipients. We configured this range of control at the RSU with a combination of $`B_{A\\hspace{0pt}C\\hspace{0pt}K}`$ and $`T_{A\\hspace{0pt}C\\hspace{0pt}K}`$, i.e. an ACK is transmitted when either the size of the group of unresponded SUMs reaches $`B_{A\\hspace{0pt}C\\hspace{0pt}K}`$, or when $`T_{A\\hspace{0pt}C\\hspace{0pt}K}`$ time has passed since its previous ACK transmission. Each of these schemes provides a trade-off between promptness and channel occupancy.\nThe service procedures include operations with user information. The exact approaches on maintaining authenticity, availability, and confidentiality \\[[17](#bib.bib17)\\] throughout the procedure are subjects of ongoing discussions under \\[[18](#bib.bib18)\\] and \\[[19](#bib.bib19)\\]. These protocols are also dependent on the message content for SAM, SUM, and ACK, which presumably are different from that of BSM. For the sake of a system-level analysis, we assume these processing overheads are negligible compared to the propagation delays. From the design prototype, the major factors that can affect the Service Completion Time ($`\\Delta\\hspace{0pt}{({T_{3} - T_{2}})}`$) for each vehicle are:\nTrigger Line Distance ($`d_{t}`$) : a vehicle transmits SUM immediately after crossing $`d_{t}`$. Smaller $`d_{t}`$ implies smaller distance between UE, hence the higher reception chances.\nACK Batchsize ($`B_{A\\hspace{0pt}C\\hspace{0pt}K}`$): It determines the number of recipients within a single ACK. While a small $`B_{A\\hspace{0pt}C\\hspace{0pt}K}`$ can crowd the channel with V2I packets, a large $`B_{A\\hspace{0pt}C\\hspace{0pt}K}`$ can inversely affect a subset of users in each batch by adding delay induced by the time to fill a batch.\nSUM Repeat Interval ($`T_{S\\hspace{0pt}U\\hspace{0pt}M}`$): Each vehicle initiates a timer immediately after a SUM transmission. Once the timer reaches $`T_{S\\hspace{0pt}U\\hspace{0pt}M}`$, the user repeats a SUM transmission. Hence a longer $`T_{S\\hspace{0pt}U\\hspace{0pt}M}`$ implies longer wait time for users, while short $`T_{S\\hspace{0pt}U\\hspace{0pt}M}`$ will force more SUM transmission, thereby increasing resource consumption.\nACK Transmission Interval ($`T_{A\\hspace{0pt}C\\hspace{0pt}K}`$): Similar to $`T_{S\\hspace{0pt}U\\hspace{0pt}M}`$ at UE, $`T_{A\\hspace{0pt}C\\hspace{0pt}K}`$ is another count-up timer that runs at RSU. This count starts after every ACK transmission and determines the time for the next ACK transmission. This works simultaneously with $`B_{A\\hspace{0pt}C\\hspace{0pt}K}`$ to generate ACK transmission requests from the upper layers at RSU.\n\nTABLE I: Simulation Parameters & Configurations\n<table>\n<tbody>\n<tr>\n<th>BSM periodicity</th>\n<td>[100 600] ms</td>\n</tr>\n<tr>\n<th>BSM PPPP</th>\n<td>5</td>\n</tr>\n<tr>\n<th>BSM Payload Size</th>\n<td>300 byte</td>\n</tr>\n<tr>\n<th>BSM MCS</th>\n<td>11</td>\n</tr>\n<tr>\n<th>SAM periodicity</th>\n<td>1s</td>\n</tr>\n<tr>\n<th>SAM,SUM,SCM PPPP</th>\n<td>6</td>\n</tr>\n<tr>\n<th>SAM,SUM,SCM Payload Size</th>\n<td>700,450,300 bytes</td>\n</tr>\n<tr>\n<th>SAM,SUM,SCM MCS</th>\n<td>7,11,6</td>\n</tr>\n<tr>\n<th>Trigger Distance (dtsubscript𝑑𝑡d_{t})</th>\n<td>300m, 0m, -100m</td>\n</tr>\n<tr>\n<th>ACK Batchsize (BA​C​Ksubscript𝐵𝐴𝐶𝐾B_{ACK})</th>\n<td>16</td>\n</tr>\n<tr>\n<th>SUM Repeat Interval (TS​U​Msubscript𝑇𝑆𝑈𝑀T_{SUM})</th>\n<td>600ms</td>\n</tr>\n<tr>\n<th>ACK Transmission Interval (TA​C​Ksubscript𝑇𝐴𝐶𝐾T_{ACK})</th>\n<td>400ms</td>\n</tr>\n<tr>\n<th>Traffic Crossing Rate</th>\n<td>10, 20 & 30 veh/sec</td>\n</tr>\n<tr>\n<th>Propagation Loss Model</th>\n<td>I-405 Model [16]</td>\n</tr>\n</tbody>\n</table>\n\nFor different use case and traffic flow, the suitable choice for these parameters can vary. With respect to the RSU, the range of $`d_{t}`$ suggests three options: (1) a positive trigger line implies users approaching the RSU will transmit SUM before reaching the RSU, (2) a negative trigger implies users cross the RSU first and transmit SUM afterward, and (3) a $`0\\hspace{0pt}m`$ trigger line implies users transmit SUM right while crossing the RSU. In the following sections, we present analysis of these settings for $`d_{t}`$ before presenting the results in section [IV](#S4 \"IV Simulation Results ‣ Performance Analysis of V2I Zone Activation and Scalability for C-V2X Transactional Services\"). Analyses on the other three parameters are also under investigation and will be published in future works.\n\n###\n\nII-B\n\nAnalysis and Optimization\n\nIn this section, a system-level analysis of the service protocol is described. To achieve service completion during one runtime, both entities need to convey the context to each other through handshake (figure [1](#S2.F1 \"Figure 1 ‣ II-A Service Design & Relevant Assumptions ‣ II System Model ‣ Performance Analysis of V2I Zone Activation and Scalability for C-V2X Transactional Services\")), starting with a SUM transmission and ending with an ACK reception. Reception probabilities in wireless radio networks are measured by Packet Error Rate (PER); the percentage of lost packets over all transmitted packets in a network. PER can be expressed as a function of vehicle density and distance between communicating pair. In case of the service under experimentation, each V2I link is formed once a user crosses the trigger. Hence, the success of the service can be impacted by $`d_{t}`$ can directly impact the PER of the V2I packets.\n\nFor the basis of steady-state analysis, we assume no vehicle is entering or leaving the RSU coverage during $`1\\hspace{0pt}m\\hspace{0pt}s`$ time, which is the resource granularity for C-V2X physical layer. We assume that the vehicles arrive at $`d_{t}`$ following a Poisson process. Hence the SUM packets which are generated by each vehicle follows the same distribution. A constant $`{30\\hspace{0pt}m}/s`$ velocity profile is used for mobility. Recall that a successful service cycle requires consecutive SUM and ACK reception. In terms of PER, the success probability of such case with $`d_{t}`$ is a product of individual reception probability for the two packets (SUM and ACK) as $`P{(d_{t})} \\times P\\left( d_{t} - \\left. (\\tau \\times \\overline{v} \\right) \\right.`$, where $`P\\hspace{0pt}{(d_{t})}`$ is the probability of success at distance $`d_{t}`$, $`\\overline{v}`$ is the average speed of the UEs and $`\\tau`$ is the small duration between a SUM reception at RSU and the corresponding ACK reception at UE. $`\\tau`$ depends on the lower layer operations of C-V2X for priority management and physical resource allocation.\n\nOnce a successful cycle occurs, no further exchange is required between that RSU-OBU pair for this service. Thus for efficiency, the service needs to be completed in as early as possible, with the least possible count of attempts. Hence, maximizing the expected service utility implies maximizing the probability of success at the first attempt by UE given a specific $`d_{t}`$. In case of failed first attempt, the second (and all following) transmission carries equal significance. Probability of first success at $`n^{t\\hspace{0pt}h}`$ trial is therefore given by\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>P(1s​t success after n trial)=P(success at nt​h trial)×\\displaystyle P(\\text{$1^{st}$ success after n trial})=P(\\text{success at $n^{th}$ trial})\\times</td>\n<td></td>\n<td rowspan=\"2\">(1)</td>\n</tr>\n<tr>\n<td></td>\n<td>∏m=1n−1(1−P​(success at mt​h trial))superscriptsubscriptproduct𝑚1𝑛11𝑃success at mth trial\\displaystyle\\prod_{m=1}^{n-1}(1-P(\\text{success at $m^{th}$ trial}))</td>\n<td></td>\n</tr>\n</tbody>\n</table>\n\nEach of the attempts made by individual UEs are separated in time by $`T_{S\\hspace{0pt}U\\hspace{0pt}M} = {0.6\\hspace{0pt}s}`$. The joint reception probability at the $`n^{t\\hspace{0pt}h}`$ trial therefore can be expressed as a product of success probabilities at the distances in each SUM attempts:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>P(dt−0.6nv¯)×P(dt−(0.6n+τ)v¯)×∏m=1n−1[1−P(dt−0.6mv¯)−P(dt−(0.6m+τ)v¯)+P(dt−0.6mv¯)×P(dt−(0.6m+τ)v¯)]𝑃subscript𝑑𝑡0.6𝑛¯𝑣𝑃subscript𝑑𝑡0.6𝑛𝜏¯𝑣superscriptsubscriptproduct𝑚1𝑛1delimited-[]1𝑃subscript𝑑𝑡0.6𝑚¯𝑣𝑃subscript𝑑𝑡0.6𝑚𝜏¯𝑣𝑃subscript𝑑𝑡0.6𝑚¯𝑣𝑃subscript𝑑𝑡0.6𝑚𝜏¯𝑣P(d_{t}-0.6n\\bar{v})\\times P\\big{(}d_{t}-(0.6n+\\tau)\\bar{v}\\big{)}\\times\\\\ \\prod_{m=1}^{n-1}\\bigg{[}1-P(d_{t}-0.6m\\bar{v})-P\\big{(}d_{t}-(0.6m+\\tau)\\bar{v}\\big{)}\\\\ +P(d_{t}-0.6m\\bar{v})\\times P\\big{(}d_{t}-(0.6m+\\tau)\\bar{v}\\big{)}\\bigg{]}</td>\n<td></td>\n<td>(2)</td>\n</tr>\n</tbody>\n</table>\n\nWe noted the theoretical implication of this algorithm in terms of the average number of UE attempts made before achieving first success. Figure [2](#S2.F2 \"Figure 2 ‣ II-B Analysis and Optimization ‣ II System Model ‣ Performance Analysis of V2I Zone Activation and Scalability for C-V2X Transactional Services\") shows the mean attempts required for completion across a range of trigger distances, where $`d_{t} = {0\\hspace{0pt}m}`$ outperforms the others. This implies the best service utility can be achieved with a trigger being located right below the RSU. Since PER gradually increases with traffic density, we have higher average required attempts for higher densities. In the case of positive trigger distances, the probability of success for consecutive attempts increases while vehicles approach RSU. For negative trigger distance, since vehicles move away from the RSU, this change in success probability is the opposite. In Figure [2](#S2.F2 \"Figure 2 ‣ II-B Analysis and Optimization ‣ II System Model ‣ Performance Analysis of V2I Zone Activation and Scalability for C-V2X Transactional Services\"), the model depicts this fact by producing a skewed left-half with a greater slope than its positive counterpart on the right.\n\n[Refer to caption](/html/2212.13984/assets/x2.png)\n\nFigure 2: Mean number of attempts for successful completion (analytical)\n\n## III Experiment Setup\n\nIn the this section, we discuss the implementation of the network and the scenario in the simulator.\n\n###\n\nIII-A\n\nDeployment in Simulator\n\nWe deployed the service prototype in a link-level network simulator equipped with C-V2X protocol layers. The high-fidelity simulator has been developed over several years to simulate the communication protocol as specified in 3GPP and SAE standards for release 14 with incremental upgrades as they were released\\[[20](#bib.bib20)\\].\n\n###\n\nIII-B\n\nScenario Description\n\n[Refer to caption](/html/2212.13984/assets/x3.png)\n\nFigure 3: Freeway Service Zone with RSU and Trigger Lines under test\n\nWe modeled a 3km, 16 lanes bidirectional freeway. An RSU situated mid-stretch (Figure [3](#S3.F3 \"Figure 3 ‣ III-B Scenario Description ‣ III Experiment Setup ‣ Performance Analysis of V2I Zone Activation and Scalability for C-V2X Transactional Services\")) which provides V2X services to traffic along both directions. The simulator mimics realistic communication between different entities with BSM and V2I service-specific messages. All vehicles are capable of exchanging BSM (among themselves) and service packets (with the RSU) through C-V2X sidelink channel. The traffic density is characterized in terms of traffic flow rate.\n\n[Refer to caption](/html/2212.13984/assets/x4.png)\n\nFigure 4: Service Completion Time for 10,20,30 veh/s\n\n## IV Simulation Results\n\nWe present Service Completion Time (Figure [4](#S3.F4 \"Figure 4 ‣ III-B Scenario Description ‣ III Experiment Setup ‣ Performance Analysis of V2I Zone Activation and Scalability for C-V2X Transactional Services\")) and the average number of SUM attempts (Figure [5](#S4.F5 \"Figure 5 ‣ IV Simulation Results ‣ Performance Analysis of V2I Zone Activation and Scalability for C-V2X Transactional Services\")) for performance assessment. Service Completion Time (SCT) refers to the time a UE is occupied with the transaction procedure. Specifically, for a particular UE, the tolling process effectively starts with the transmission of the first SUM and ends with the reception of an ACK. This time gap is gauged for all participating UEs and presented as SCT. In Figure [4](#S3.F4 \"Figure 4 ‣ III-B Scenario Description ‣ III Experiment Setup ‣ Performance Analysis of V2I Zone Activation and Scalability for C-V2X Transactional Services\"), we show SCT for three trigger distances under observation. The figure demonstrates a range of traffic flow rates spanning from medium-low to high. Within each flow rate group, SCT with 0m trigger outperforms the other trigger settings, followed by -100m trigger.\n\nSince the reception probability can be the highest right below the RSU, the 0m setting shows the smallest 90th%tile SCT. While differentiating between 300m and -100m, it should be noted that packet reception can be comparable at equal absolute Transmitter-Receiver distance, so reception probability at $`d_{t} = {- {100\\hspace{0pt}m}}`$ is equivalent to $`d_{t} = {100\\hspace{0pt}m}`$, which is higher than that at $`d_{t} = {300\\hspace{0pt}m}`$. This translates to lower SCT for $`d_{t} = {- {100\\hspace{0pt}m}}`$. However, the relative direction of UE motion with respect to RSU have the potential to impact the service performance. Let’s consider a group of vehicles moving towards an RSU (V1), and another group (V2) moving away. V1 will gradually get a higher success probability in the spatial sense, while V2 spatially moves to a lower success probability. Since there is a time gap between each UE’s SUM and it’s corresponding ACK, V1 gets a systematic advantage. The same event can occur for both SUM and ACK. If repetition is required for either of them, V1 would enjoy higher success probability for those repeated trials in comparison to V2.\n\nIn order to capture the number of attempts made by the UEs to complete the procedure, the mean number of attempts across the same range of traffic flow rates are plotted in Figure [5](#S4.F5 \"Figure 5 ‣ IV Simulation Results ‣ Performance Analysis of V2I Zone Activation and Scalability for C-V2X Transactional Services\"). While 0m trigger results shows reliable completion with the expense of 1 attempt by UEs, -100m trigger results in a trend of increasing attempts in case of higher flow rates. This increment is more rapid for 300m trigger line. Overall, 0m outperforms the others with the least number of average attempts. Throughout different configurations, BSM reception shows no advert impact in terms of PER, as shown in Figure [6](#S5.F6 \"Figure 6 ‣ V Concluding Remarks ‣ Performance Analysis of V2I Zone Activation and Scalability for C-V2X Transactional Services\").\n\n[Refer to caption](/html/2212.13984/assets/x5.png)\n\nFigure 5: Mean number of attempts for successful completion (empirical)\n\n## V Concluding Remarks\n\nInfrastructure-based services are one of the fundamental blocks for the next-generation intelligent transportation systems. In this paper, we explored various strategies that can be used in the communication between infrastructure and vehicles, especially where a transaction occurs. We demonstrated that a trigger line adjacent to the RSU outperforms trigger lines before or after the RSU. While this suggests the superiority of a particular setting, it may not be a one-stop solution for all traffic scenarios or applications because of different sets of case-specific requirements. While in this paper, we focus on the impact of trigger-line location choice, the findings can provide further performance optimizations. We are currently conducting further research on the impact of other parameters of transactional protocols; the findings from this paper will allow a smaller search space for optimal solutions.\n\n[Refer to caption](/html/2212.13984/assets/x6.png)\n\nFigure 6: Packet Error Rate for BSM Reception\n\n## References\n\n- \\[1\\] “Usdot outlines its vision for the future of the country’s streets - national association of city transportation officials.” <a href=\"https://nacto.org/2021/12/16/fhwa-vision-for-infrastructure-spending/\" target=\"_blank\">https://nacto.org/2021/12/16/fhwa-vision-for-infrastructure-spending/</a>.\n\n- \\[2\\]\n\n  M. B. et al, “Connected roads of the future: Use cases, requirements, and design considerations for vehicle-to-everything communications,” IEEE Vehicular Technology Magazine, vol. 13, no. 3, pp. 110–123, 2018.\n\n- \\[3\\]\n\n  W. Schulz, “Traffic management improvement by integrating modern communication systems,” IEEE Communications Magazine, vol. 34, no. 10, pp. 56–60, 1996.\n\n- \\[4\\]\n\n  S. Djahel, M. Salehie, I. Tal, and P. Jamshidi, “Adaptive traffic management for secure and efficient emergency services in smart cities,” in 2013 IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOM Workshops), IEEE, 2013.\n\n- \\[5\\] A. Jami, M. Razzaghpour, H. Alnuweiri, and Y. P. Fallah, “Augmented driver behavior models for high-fidelity simulation study of crash detection algorithms,” 2022.\n\n- \\[6\\]\n\n  M. Razzaghpour, A. Datar, D. Schneider, M. Zaman, H. Werner, H. Frey, J. M. Velni, and Y. P. Fallah, “Finite state markov modeling of c-v2x erasure links for performance and stability analysis of platooning applications,” in 2022 IEEE International Systems Conference (SysCon), pp. 1–8, 2022.\n\n- \\[7\\]\n\n  M. Boban, M. Giordani, and M. Zorzi, “Predictive quality of service (pqos): The next frontier for fully autonomous systems,” arXiv preprint arXiv:2109.09376, 2021.\n\n- \\[8\\]\n\n  R. Xu, H. Xiang, Z. Tu, X. Xia, M.-H. Yang, and J. Ma, “V2x-vit: Vehicle-to-everything cooperative perception with vision transformer,” arXiv preprint arXiv:2203.10638, 2022.\n\n- \\[9\\]\n\n  C.-P. et al, “Prototyping and evaluation of infrastructure-assisted transition of control for cooperative automated vehicles,” IEEE Transactions on Intelligent Transportation Systems, pp. 1–17, 2021.\n\n- \\[10\\] S. International, “Profiles for v2x-based fee collection,” Standard Doc J3217, Society of Automotive Engineers, 11 2019. Issued 2019-11, WIP.\n\n- \\[11\\]\n\n  L. Lusvarghi and M. L. Merani, “On the coexistence of aperiodic and periodic traffic in cellular vehicle-to-everything,” IEEE Access, vol. 8, pp. 207076–207088, 2020.\n\n- \\[12\\]\n\n  P. Wendland, G. Schaefer, and R. Thomä, “An application-oriented evaluation of lte-v’s mode 4 for v2v communication,” in Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing, 2019.\n\n- \\[13\\] 3gpp, “Evolved universal terrestrial radio access (e-utra); medium access control (mac) protocol specification (v14.7.0),” tech. rep., July 2018.\n\n- \\[14\\] S. International, “Lte vehicle-to-everything (lte-v2x) deployment profiles and radio parameters for single radio channel multi-service coexistence,” Standard Doc J3161, Society of Automotive Engineers, 04 2022.\n\n- \\[15\\]\n\n  A. Bazzi, C. Campolo, A. Molinaro, A. O. Berthet, B. M. Masini, and A. Zanella, “On wireless blind spots in the c-v2x sidelink,” IEEE Transactions on Vehicular Technology, vol. 69, no. 8, 2020.\n\n- \\[16\\]\n\n  E. E. Marvasti, S. M. O. Gani, and Y. P. Fallah, “A statistical approach toward channel modeling with application to large-scale censored data,” IEEE Transactions on Intelligent Transportation Systems, pp. 1–12, 2020.\n\n- \\[17\\]\n\n  A. Ghosal and M. Conti, “Security issues and challenges in v2x: A survey,” Computer Networks, vol. 169, p. 107093, 2020.\n\n- \\[18\\] 3gpp, “Security aspect for lte support of vehicle-to-everything (v2x) services specification \\# 33.185,” tech. rep., July 2020.\n\n- \\[19\\] 3gpp, “Architecture enhancements for v2x services (release 15), specification \\# 23.285,” tech. rep., Mar 2018.\n\n- \\[20\\]\n\n  T. et al, “Multiple access in cellular v2x: Performance analysis in highly congested vehicular networks,” in 2018 IEEE Vehicular Networking Conference (VNC), pp. 1–8, 2018.<|endoftext|>"
    },
    "test": {
      "total_tokens": 93883449,
      "example": "# Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\n\nChuang Zhang, Geng Sun1,  Jiahui Li, Qingqing Wu,  Jiacheng Wang, Dusit Niyato,  and Yuanwei Liu This study is supported in part by the National Natural Science Foundation of China (62172186, 62272194), and in part by the Science and Technology Development Plan Project of Jilin Province (20230201087GX). (Corresponding author: Geng Sun.) Chuang Zhang and Jiahui Li are with the College of Computer Science and Technology, Jilin University, Changchun 130012, China, and also with the Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China. E-mail: chuangzhang1999@gmail.com, lijiahui0803@foxmail.com. Geng Sun is with the College of Computer Science and Technology, Jilin University, Changchun 130012, China, and also with the College of Computing and Data Science, Nanyang Technological University, Singapore 639798. E-mail: sungeng@jlu.edu.cn). Qingqing Wu is with the Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai, China. E-mail: qingqingwu@sjtu.edu.cn. Jiacheng Wang and Dusit Niyato are with the College of Computing and Data Science, Nanyang Technological University, Singapore 639798. E-mail: jiacheng.wang@ntu.edu.sg, dniyato@ntu.edu.sg. Yuanwei Liu is with the School of Electronic Engineering and Computer Science, Queen Mary University of London, London E1 4NS, U.K. E-mail: yuanwei.liu@qmul.ac.uk.\n\n###### Abstract\n\nDue to flexibility and low-cost, unmanned aerial vehicles (UAVs) are increasingly crucial for enhancing coverage and functionality of wireless networks. However, incorporating UAVs into next-generation wireless communication systems poses significant challenges, particularly in sustaining high-rate and long-range secure communications against eavesdropping attacks. In this work, we consider a UAV swarm-enabled secure surveillance network system, where a UAV swarm forms a virtual antenna array to transmit sensitive surveillance data to a remote base station (RBS) via collaborative beamforming (CB) so as to resist mobile eavesdroppers. Specifically, we formulate an aerial secure communication and energy efficiency multi-objective optimization problem (ASCEE-MOP) to maximize the secrecy rate of the system and to minimize the flight energy consumption of the UAV swarm. To address the non-convex, NP-hard and dynamic ASCEE-MOP, we propose a generative diffusion model-enabled twin delayed deep deterministic policy gradient (GDMTD3) method. Specifically, GDMTD3 leverages an innovative application of diffusion models to determine optimal excitation current weights and position decisions of UAVs. The diffusion models can better capture the complex dynamics and the trade-off of the ASCEE-MOP, thereby yielding promising solutions. Simulation results highlight the superior performance of the proposed approach compared with traditional deployment strategies and some other deep reinforcement learning (DRL) benchmarks. Moreover, performance analysis under various parameter settings of GDMTD3 and different numbers of UAVs verifies the robustness of the proposed approach.\n\n###### Index Terms:\n\nSecure communications, collaborative beamforming, unmanned aerial vehicle, deep reinforcement learning, generative diffusion models.\n\n## 1 Introduction\n\nUnmanned aerial vehicles (UAVs), noted for their flexibility and low-cost, have become increasingly pivotal in various sectors, including military surveillance \\[[1](#bib.bib1)\\], environmental monitoring \\[[2](#bib.bib2)\\], and emergency response \\[[3](#bib.bib3)\\], etc. With the widespread deployment of the sixth generation (6G) wireless networks, UAVs are foreseen to play a crucial role in wireless networks as well as key enablers of innovative wireless applications \\[[4](#bib.bib4)\\]. For instance, UAVs can serve as the mobile aerial base stations \\[[5](#bib.bib5)\\] to support temporary and instant network coverage, which is especially valuable when the ground infrastructure is disrupted or the network capacity is insufficient to meet the demands. Moreover, UAVs can function as the aerial relays \\[[6](#bib.bib6)\\] for connecting the ground users to the distant base stations and extending the coverage, particularly in rural and remote areas. Furthermore, UAVs can also access the wireless network by acting as the mobile users \\[[7](#bib.bib7)\\], enabling them to obtain real-time data and support various applications such as precision agriculture, aerial goods delivery, and environmental monitoring.\n\nAlthough the UAVs offer significant advantages in enhancing the coverage and functionality of wireless networks, integrating them into the next-generation wireless communication and network systems also raises some crucial challenges. Specifically, maintaining high-rate and long-range communications simultaneously with a single UAV can be difficult due to the limited onboard power and potential interference \\[[8](#bib.bib8)\\]. Moreover, the broadcast nature of wireless channels makes sensitive information vulnerable to eavesdropping attacks, and this vulnerability is further exacerbated in UAV-involved communications due to the high line-of-sight (LoS) probability of links \\[[9](#bib.bib9)\\]. Although the traditional high-layer encryption and decryption techniques aim to protect data confidentiality, the advancing computing capabilities of eavesdroppers demand increasingly sophisticated algorithms, resulting in the higher computational overhead and intricate key management, which are unfeasible for UAV-involved communication systems \\[[10](#bib.bib10)\\].\n\nCollaborative beamforming (CB) has arisen as a potential solution to the above challenges \\[[11](#bib.bib11)\\], \\[[12](#bib.bib12)\\]. Specifically, multiple UAVs can work cooperatively to construct a UAV-enabled virtual antenna array (UVAA), thereby enhancing the signal strength and directivity, which not only extends the communication range but also improves the overall secrecy rate by effectively concentrating the radiated energy in the desired direction. However, there exists a fundamental trade-off between the secure communication performance and energy consumption in the UVAA system design. In particular, to achieve an optimal beam pattern and maximize the secure transmission rate, all participating UAVs need to relocate to more suitable positions and readjust their excitation current weights, causing the increasing of the energy. Moreover, the UAVs of UVAA need to continuously adjust their positions if mobile eavesdroppers exist, which further results in additional flight energy consumption. Thus, the UVAA system must be carefully designed to balance the objectives of improving the secrecy rate of the system and reducing the flight energy consumption of the UAV swarm.\n\nTraditional optimization methods, such as convex optimization \\[[13](#bib.bib13)\\] and evolutionary strategies \\[[12](#bib.bib12)\\], have been employed to deal with the optimization problems of UVAA. However, these methods may be impractical in dynamic environments due to the mobility of eavesdroppers and time-varying channel characteristics. Deep reinforcement learning (DRL) presents a compelling alternative, offering the capability to adapt to the changing conditions. It can learn optimal strategies through interactions with the environment, eliminating the need for prior knowledge and achieving near-optimal performance. Thus, DRL has been demonstrated to have great potential in wireless network optimizations \\[[14](#bib.bib14)\\]. Nevertheless, standard DRL techniques may encounter challenges in representing the complex and high-dimensional action space required for the joint optimization of excitation current weights and positions of UAVs in UVAA. Specifically, traditional DRL methods typically use stacked fully-connected layers in the actor network, which may struggle to capture deeper data features \\[[15](#bib.bib15)\\]. As a result, these algorithms usually exhibit high variance, leading to a learned policy distribution that deviates from the true data distribution.\n\nRecent developments in generative artificial intelligence, notably in generative diffusion models, have advanced the effective representation of complex data distributions \\[[16](#bib.bib16)\\]. Consequently, in this study, we delve into the combination of DRL and generative diffusion models to tackle the multi-objective optimization problem in UVAA system, aimed at countering the presence of mobile eavesdroppers. The main contributions of this paper are summarized as follows:\n\n- •\n\n  UAV Swarm-enabled Secure Surveillance Network System: We propose a novel UAV swarm-enabled secure surveillance network system under the threat of mobile eavesdroppers. In this system, a UAV swarm performs CB to enhance the signal strength and directivity, thereby ensuring the secure communications between the UAV swarm and the remote base station (RBS). To the best of our knowledge, this is the first work that focuses on mobile eavesdroppers in the context of UAV-enabled CB secure communications, which is directly applicable real-world scenarios.\n\n- •\n\n  Multi-objective Optimization Problem Formulation: We formulate an aerial secure communication and energy efficiency multi-objective optimization problem (ASCEE-MOP), with the objective of maximizing the secrecy rate between UAV swarm and RBS while minimizing the flight energy consumption of the UAV swarm by jointly optimizing the excitation current weights and positions of UAVs. Moreover, we show that the formulated ASCEE-MOP is a non-convex, NP-hard and dynamic optimization problem involving the complex trade-off, rendering it challenging to solve using traditional convex optimization techniques and evolutionary methods.\n\n- •\n\n  Generative Diffusion Model-enabled DRL Approach Design: To deal with the non-convexity and dynamic nature of the formulated ASCEE-MOP, we re-formulate it as a Markov decision process, and address it by the DRL framework. Specifically, we propose a generative diffusion model-enabled twin delayed deep deterministic policy gradient (GDMTD3) method, which integrates the generative diffusion models within twin delayed deep deterministic policy gradient (TD3) algorithm. By utilizing the generation and inference capabilities of diffusion model, the proposed GDMTD3 can capture the complex probabilistic distribution more effectively in the high-dimensional action spaces.\n\n- •\n\n  Simulation Validation: Simulation results are provided to demonstrate the effectiveness and robustness of the proposed approach. Specifically, compared with four deployment policies and five DRL benchmarks, the proposed approach exhibits superior performance. To further verify to the robustness, we conduct the performance analysis of the proposed GDMTD3 under various parameter settings and varying numbers of UAVs.\n\nThe remainder of this paper is structured as follows. An overview of related work is provided in Section [2](#S2 \"2 Related Work ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\"). Section [3](#S3 \"3 System Model ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\") outlines the system model. Next, the optimization problem is formulated and analyzed in Section [4](#S4 \"4 Problem Formulation and Analysis ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\"). Section [5](#S5 \"5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\") details the GDMTD3 for addressing the formulated optimization problem. Simulation results are listed and discussed in Section [6](#S6 \"6 Simulation Results ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\"), and the conclusion of the paper is presented in Section [7](#S7 \"7 Conclusion ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\").\n\nTABLE I: Major Notions\n<table>\n<thead>\n<tr>\n<th></th>\n<th>Symbols</th>\n<th>Definition</th>\n<th>Symbols</th>\n<th>Definition</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td rowspan=\"18\">System Model</td>\n<td>𝒦𝒦\\mathcal{K}</td>\n<td>Set of UAV indexes, |𝒦|=K𝒦𝐾|\\mathcal{K}|=K</td>\n<td>𝒘Bsubscript𝒘𝐵\\bm{w}_{B}</td>\n<td>Coordinate of BS</td>\n</tr>\n<tr>\n<td>N𝑁N</td>\n<td>Total number of time slots</td>\n<td>𝒒kUsuperscriptsubscript𝒒𝑘𝑈\\bm{q}_{k}^{U}</td>\n<td>Coordinate of UAV k𝑘k</td>\n</tr>\n<tr>\n<td>IkUsuperscriptsubscript𝐼𝑘𝑈I_{k}^{U}</td>\n<td>Excitation current weight of UAV k𝑘k</td>\n<td>𝒒csubscript𝒒𝑐\\bm{q}_{c}</td>\n<td>Coordinate of UVAA center</td>\n</tr>\n<tr>\n<td>θ,φ𝜃𝜑\\theta,\\varphi</td>\n<td>Elevation and azimuth angles</td>\n<td>𝒒Esubscript𝒒𝐸\\bm{q}_{E}</td>\n<td>Coordinate of mobile eavesdropper</td>\n</tr>\n<tr>\n<td>A​F𝐴𝐹AF</td>\n<td>Array factor of UVAA</td>\n<td>ΨksubscriptΨ𝑘\\Psi_{k}</td>\n<td>Initial phase of UAV k𝑘k</td>\n</tr>\n<tr>\n<td>c0,c1subscript𝑐0subscript𝑐1c_{0},c_{1}</td>\n<td>Two constants depending on wireless environment</td>\n<td>cpsubscript𝑐𝑝c_{p}</td>\n<td>Phase constant</td>\n</tr>\n<tr>\n<td>dc,𝒮,dc,Esubscript𝑑𝑐𝒮subscript𝑑𝑐𝐸d_{c,\\mathcal{S}},d_{c,E}</td>\n<td>Distances between UVAA and BS/eavesdropper</td>\n<td>λ𝜆\\lambda</td>\n<td>Wavelength</td>\n</tr>\n<tr>\n<td>Pc,𝒮LoS,Pc,ELoSsuperscriptsubscript𝑃𝑐𝒮LoSsuperscriptsubscript𝑃𝑐𝐸LoSP_{c,\\mathcal{S}}^{\\text{LoS}},P_{c,E}^{\\text{LoS}}</td>\n<td>LoS link probability between UVAA and BS/eavesdropper</td>\n<td>c,fc𝑐subscript𝑓𝑐c,f_{c}</td>\n<td>Light speed and Carrier frequency</td>\n</tr>\n<tr>\n<td>L¯c,𝒮subscript¯𝐿𝑐𝒮\\overline{L}_{c,\\mathcal{S}}</td>\n<td>Average pass loss between UVAA and BS</td>\n<td>ξ𝜉\\xi</td>\n<td>Elevation between UVAA and BS</td>\n</tr>\n<tr>\n<td>gc,𝒮,gc,ℰsubscript𝑔𝑐𝒮subscript𝑔𝑐ℰg_{c,\\mathcal{S}},g_{c,\\mathcal{E}}</td>\n<td>Channel gain between UVAA and BS/eavesdropper</td>\n<td>μ1,μ2subscript𝜇1subscript𝜇2\\mu_{1},\\mu_{2}</td>\n<td>Excessive path loss for LoS and NLoS links</td>\n</tr>\n<tr>\n<td>GU,𝒮,GU,Esubscript𝐺𝑈𝒮subscript𝐺𝑈𝐸G_{U,\\mathcal{S}},G_{U,E}</td>\n<td>Antenna gain of UVAA towards BS/eavesdropper</td>\n<td>α𝛼\\alpha</td>\n<td>Path loss exponent</td>\n</tr>\n<tr>\n<td>RU,𝒮,RU,Esubscript𝑅𝑈𝒮subscript𝑅𝑈𝐸R_{U,\\mathcal{S}},R_{U,E}</td>\n<td>Transmission rate from UVAA to BS/eavesdropper</td>\n<td>B𝐵B</td>\n<td>Transmission bandwidth</td>\n</tr>\n<tr>\n<td>σ2superscript𝜎2\\sigma^{2}</td>\n<td>Noise power of A2G channel</td>\n<td>RS​Esubscript𝑅𝑆𝐸R_{SE}</td>\n<td>Achievable secrecy rate of A2G link</td>\n</tr>\n<tr>\n<td>vkx,vky,vkzsuperscriptsubscript𝑣𝑘𝑥superscriptsubscript𝑣𝑘𝑦superscriptsubscript𝑣𝑘𝑧v_{k}^{x},v_{k}^{y},v_{k}^{z}</td>\n<td>x/y/z𝑥𝑦𝑧x/y/z-axis component speed of the UAV k𝑘k</td>\n<td>ρ𝜌\\rho</td>\n<td>Density of air</td>\n</tr>\n<tr>\n<td>W𝑊W</td>\n<td>Weight of UAV</td>\n<td>A𝐴A</td>\n<td>Total area of UAV rotor disks</td>\n</tr>\n<tr>\n<td>v0subscript𝑣0v_{0}</td>\n<td>Mean rotor induced velocity for hovering</td>\n<td>d0subscript𝑑0d_{0}</td>\n<td>Fuselage drag ratio</td>\n</tr>\n<tr>\n<td>s𝑠s</td>\n<td>Rotor solidity</td>\n<td>Plevelksuperscriptsubscript𝑃level𝑘P_{\\text{level}}^{k}</td>\n<td>Induced power of UAV k𝑘k for level flight</td>\n</tr>\n<tr>\n<td>Pverticalksuperscriptsubscript𝑃vertical𝑘P_{\\text{vertical}}^{k}</td>\n<td>Power of UAV k𝑘k for vertical flight</td>\n<td>E𝐸E</td>\n<td>Energy consumption of UAV swarm</td>\n</tr>\n<tr>\n<td rowspan=\"6\">Algorithm</td>\n<td>𝒮,𝒔𝒮𝒔\\mathcal{S},\\bm{s}</td>\n<td>State space and state vector of environment</td>\n<td>𝒜,𝒂𝒜𝒂\\mathcal{A},\\bm{a}</td>\n<td>Action space and action vector of agent</td>\n</tr>\n<tr>\n<td>𝒫𝒫\\mathcal{P}</td>\n<td>State transition probability of environment</td>\n<td>ℛ,rℛ𝑟\\mathcal{R},r</td>\n<td>Reward space and reward</td>\n</tr>\n<tr>\n<td>γ𝛾\\gamma</td>\n<td>Discount factor</td>\n<td>d𝑑d</td>\n<td>Frequency of policy update</td>\n</tr>\n<tr>\n<td>𝜽𝑸𝒊,𝜽𝑸𝒊′subscript𝜽subscript𝑸𝒊superscriptsubscript𝜽subscript𝑸𝒊bold-′\\bm{\\theta_{Q_{i}}},\\bm{\\theta_{Q_{i}}^{\\prime}}</td>\n<td>Parameters of the i𝑖ith critic network and target critic network</td>\n<td>𝑸​(𝒔,𝒂)𝑸𝒔𝒂\\bm{Q}(\\bm{s},\\bm{a})</td>\n<td>State-action value function</td>\n</tr>\n<tr>\n<td>𝜽𝒅,𝜽𝒅′subscript𝜽𝒅superscriptsubscript𝜽𝒅bold-′\\bm{\\theta_{d}},\\bm{\\theta_{d}^{\\prime}}</td>\n<td>Parameters of actor network and target actor network</td>\n<td>𝜿𝜽𝒅​(𝒙t,t,𝒈)subscript𝜿subscript𝜽𝒅subscript𝒙𝑡𝑡𝒈\\bm{\\kappa_{\\theta_{d}}}(\\bm{x}_{t},t,\\bm{g})</td>\n<td>Mean function of diffusion reverse process</td>\n</tr>\n<tr>\n<td>𝒙𝒕subscript𝒙𝒕\\bm{x_{t}}</td>\n<td>Noisy sample at the t𝑡tth denoising step</td>\n<td>β~tsubscript~𝛽𝑡\\tilde{\\beta}_{t}</td>\n<td>Predetermined variance factor</td>\n</tr>\n</tbody>\n</table>\n\nNotations: We use plain symbols to stand for scalars (e.g., $`a,b`$), bold symbols for vectors or functions (e.g., $`{\\mathbf{a}},{\\mathbf{b}}`$), and calligraphic symbols for sets (e.g., $`\\mathcal{A},\\mathcal{B}`$). $`\\parallel \\cdot \\parallel`$ represents Euclidean norm, and $`\\left\\{ \\cdot \\right\\}^{+}`$ refers to $`\\max{\\{ 0, \\cdot \\}}`$. Accordingly, Table [I](#S1.T1 \"TABLE I ‣ 1 Introduction ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\") outlines the major notions adopted in the following sections.\n\n## 2 Related Work\n\nIn this section, we discuss related works on UAV-enabled secure communications, optimization objectives in aerial secure communications, and optimization methods for aerial secure communications.\n\n### 2.1 UAV-enabled Aerial Secure Communications\n\nA number of prior works have concentrated on utilizing UAVs to enhance the security performance of wireless communications. In terms of the number of UAVs, the existing works can primarily be categorized into the single UAV-enabled secure communications and multiple UAVs-enabled secure communications.\n\nFor the single UAV-enabled secure communications, Zhang et al. \\[[17](#bib.bib17)\\] investigated the security of both UAV-to-ground and ground-to-UAV communications to mitigate the risk posed by an stationary eavesdropper. Cheng et al. \\[[18](#bib.bib18)\\] introduced a secure scheme to maximize the secrecy rate of the UAV-enabled wireless relay networks with caching, where a UAV is employed to relay the data from the base station to the users, leveraging its mobility. In \\[[19](#bib.bib19)\\], the authors considered a secure UAV mobile edge computing system, where a legitimate UAV assists in processing large computing tasks offloaded from multiple ground users in the presence of multiple eavesdropping UAVs. Moreover, Sun et al. \\[[20](#bib.bib20)\\] explored UAV-enabled downlink mmWave simultaneous wireless information and power transfer (SWIPT) networks, involving two types of authorized users with different communication needs and multiple passive eavesdroppers modeled by independent homogeneous Poisson point processes. In \\[[21](#bib.bib21)\\], the authors studied a UAV-enabled mobile jamming strategy to enhance the secrecy rate of ground wiretap channels.\n\nFor multiple UAVs-enabled secure communications, Cai et al. \\[[22](#bib.bib22)\\] explored a joint optimization strategy for the trajectory and resource allocation of the UAV communication systems. In their approach, one UAV acts as an information transmitter while another one serves as an assisting jammer to enhance the energy efficiency and security. In \\[[23](#bib.bib23)\\], the authors presented a dynamic role-switching strategy, where the UAVs act as data collectors or jammers based on their locations to serve multiple ground users. Hanna et al. \\[[24](#bib.bib24)\\] achieved the reliable beamforming by considering estimation errors and employing a Kalman filter for frequency tracking, with validation through simulations and experiments on software-defined radios and UAVs.\n\nHowever, these aforementioned works focus on non-remote communication settings due to the limited energy of UAVs. Moreover, they primarily consider secure communication scenarios involving static eavesdroppers.\n\n### 2.2 Optimization Objectives in Aerial Secure Communications\n\nOptimization objectives have a significant role in enhancing the performance and security of UAV-enabled secure communications. Previous research has predominantly concentrated on two aspects that are the secrecy rate and flight energy consumption of UAVs.\n\nThe secrecy rate is a key metric for measuring communication security, representing the maximum achievable confidential transmission rate in the existence of potential eavesdroppers. Several studies are dedicated to maximizing the secrecy rate in UAV-enabled secure communication systems. For example, in \\[[25](#bib.bib25)\\], the authors studied a secure short-packet communication system by using a UAV as the mobile relay. Specifically, they jointly optimized the coding blocklengths, transmit powers, and UAV trajectory to enhance the secrecy throughput. Fan et al. \\[[26](#bib.bib26)\\] proposed an iterative algorithm to optimize the UAV trajectory, transmit power, and user scheduling for achieving secure communications, addressing eavesdropper position estimation errors and ensuring user service fairness. In \\[[27](#bib.bib27)\\], the authors investigated an iterative suboptimal algorithm to maximize the worst average secrecy rate in the UAV-enabled networks by optimizing the UAV trajectory, transmit power, and user scheduling while considering energy constraints and security threats from external and internal eavesdroppers.\n\nSeveral studies take into account the flight energy consumption of UAVs due to the limited battery capacity. For example, Gao et al. \\[[28](#bib.bib28)\\] aimed to minimize the energy consumption of a fixed-wing UAV under security constraints, where they jointly optimized user scheduling and UAV trajectory in a scenario with multiple colluding eavesdroppers. In \\[[29](#bib.bib29)\\], the authors formulated an energy consumption minimization problem subject to constraints such as users service quality and information security requirements by jointly optimizing the offloading time, CPU frequency, artificial noise, beamforming vectors, and trajectory of UAV, along with the offloading time, CPU frequency, and transmit power of each user.\n\nHowever, there exists a clear trade-off between maximizing the secrecy rate and minimizing flight energy consumption, especially in UAV-enabled CB communication systems. In such systems, each individual in the UAV swarm must continuously adjust its position to enhance the directivity of UVAA. Dong et al. \\[[30](#bib.bib30)\\] considered a UVAA-enabled relay system, where they focused on maximizing achievable secrecy rate of downlink by jointly optimizing the beamforming vector of UVAA and bandwidth allocation. Although this process improves the security performance compared to a single UAV-enabled secure communications, it also results in the increased flight energy consumption. To deal with this trade-off, we formulate a multi-objective optimization problem that seeks to maximize the secrecy rate of system and minimize the flight energy consumption of the UAV swarm by jointly optimizing the excitation current weights and positions of UAVs.\n\n### 2.3 Optimization Methods for Aerial Secure Communications\n\nTo address the optimization problems for the UAV-enabled secure communication systems, researchers are devoted to effective algorithm design by employing methodologies such as convex optimization, swarm intelligent and DRL methods. For example, Zhou et al. \\[[31](#bib.bib31)\\] utilized the successive convex approximation to solve the joint optimization problem of the transmit powers and trajectories of UAV jammer and aerial base station. Furthermore, Li et al. \\[[11](#bib.bib11)\\] proposed an improved multi-objective dragonfly algorithm with chaotic solution initialization and (IMODACH) to deal with the trade-off among the secrecy rate and maximum sidelobe level and energy consumption in UAV-enabled secure communications. Moreover, Xiao et al. \\[[32](#bib.bib32)\\] developed a hierarchical DRL algorithm to enhance the anti-eavesdropping performance, with regard to the outage probability, intercept probability, energy consumption and latency. Moreover, in \\[[33](#bib.bib33)\\], the authors utilized a modified proximal policy optimization method to minimize the secrecy outage duration and the weighted sum of flight period by jointly optimizing the UAV trajectory, the user scheduling and the beamforming vector.\n\nHowever, both convex optimization and swarm intelligence methods have certain limitations in their applicability to dynamic environments. Therefore, we explore DRL method to deal with the formulated optimization problem. Despite the potential advantages of many DRL-based methods in dynamic environments, they still face limitations in handling the complexities and uncertainties of dynamic environments. To address this issue, our work integrates the generative diffusion model with DRL, thereby improving the ability of the algorithm to model more complex probabilistic distribution in high-dimensional action spaces.\n\n## 3 System Model\n\nIn this section, we first present a comprehensive system description. Subsequently, we delve into the details of the considered models, including the array factor, channel gain, secrecy rate, and UAV energy consumption models.\n\n### 3.1 System Description\n\n[Refer to caption](/html/2407.08914/assets/x1.png)\n\nFigure 1: A UAV swarm-enabled secure surveillance network system, where a UAV swarm is deployed for surveillance tasks, transmitting sensitive data to a RBS. The security of system is challenged by a mobile eavesdropper, depicted by red dashed lines, attempting to intercept the data via wiretap links over various time slots.\n\nAs shown in Fig. [1](#S3.F1 \"Figure 1 ‣ 3.1 System Description ‣ 3 System Model ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\"), we consider a UAV swarm-enabled secure surveillance network system, which consists of $`K`$ UAVs denoted by $`\\mathcal{K} \\triangleq {\\{ 1,2,\\cdots,K\\}}`$ and one RBS denoted by $`\\mathcal{S}`$. Specifically, the UAVs have collected some sensitive surveillance data and need to transmit the data back to the RBS $`\\mathcal{S}`$ by wireless links over a given time period $`T`$. For ease of exposition, the total time $`T`$ is further divided into $`N`$ time slots with equal duration $`\\delta_{t}`$, i.e., $`T \\triangleq {N\\hspace{0pt}\\delta_{t}}`$. However, due to blockage of obstacles and signal attenuation for long distance communication, a single power-constrained UAV is not able to send data to RBS $`\\mathcal{S}`$ directly. Moreover, there exists a mobile eavesdropper on the ground trying to intercept the sensitive information. To enhance the transmission efficiency and resist eavesdropping attacks from the mobile eavesdropper, these UAVs will form a UVAA to perform CB and transmit data back to RBS $`\\mathcal{S}`$ on the air-to-ground (A2G) link.\n\nMathematically, all entities are defined within a three-dimensional Cartesian coordinate system. Specifically, the RBS $`\\mathcal{S}`$ is situated at a fixed point denoted by $`{\\mathbf{w}}_{B} = \\left( x_{\\mathcal{S}},y_{\\mathcal{S}},H_{\\mathcal{S}} \\right)`$. Moreover, it is worth noting that the position change of UAVs and eavesdropper within a time slot can be negligible since the duration $`\\delta_{t}`$ is chosen to be sufficiently small. Thus, the $`3`$D coordinates of UAV $`k`$ and mobile eavesdropper at time slot $`n`$ are denoted by $`{{\\mathbf{q}}_{k}^{U}\\hspace{0pt}{\\lbrack n\\rbrack}} = \\left( {x_{k}^{U}\\hspace{0pt}{\\lbrack n\\rbrack}},{y_{k}^{U}\\hspace{0pt}{\\lbrack n\\rbrack}},{z_{k}^{U}\\hspace{0pt}{\\lbrack n\\rbrack}} \\right)`$ and $`{{\\mathbf{q}}_{E}\\hspace{0pt}{\\lbrack n\\rbrack}} = \\left( {x_{E}\\hspace{0pt}{\\lbrack n\\rbrack}},{y_{E}\\hspace{0pt}{\\lbrack n\\rbrack}},0 \\right)`$, respectively.\n\n### 3.2 Array Factor Model\n\nThe virtual antenna array formed by UAV swarm can significantly improve the antenna directivity by optimizing its beam pattern. Specifically, at time slot $`n`$, the excitation current weight of UAV $`k`$ is denoted as $`I_{k}^{U}\\hspace{0pt}{\\lbrack n\\rbrack}`$, the coordinate of UVAA center $`{{\\mathbf{q}}_{c}\\hspace{0pt}{\\lbrack n\\rbrack}} = \\left( {x_{c}^{U}\\hspace{0pt}{\\lbrack n\\rbrack}},{y_{c}^{U}\\hspace{0pt}{\\lbrack n\\rbrack}},{z_{c}^{U}\\hspace{0pt}{\\lbrack n\\rbrack}} \\right)`$, and the component distances in the $`x`$-axis, $`y`$-axis and $`z`$-axis between UAV $`k`$ and UVAA center are represented by $`{d_{c,k}^{x}\\hspace{0pt}{\\lbrack n\\rbrack}},{d_{c,k}^{y}\\hspace{0pt}{\\lbrack n\\rbrack}}`$ and $`d_{c,k}^{z}\\hspace{0pt}{\\lbrack n\\rbrack}`$, respectively. According to electromagnetic wave superposition principle, the array factor (AF) of UVAA at time slot $`n`$ can be described as follows \\[[34](#bib.bib34)\\]:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>A​F𝐴𝐹\\displaystyle AF</td>\n<td>(θ,φ|θ𝒮[n],φ𝒮[n])=∑k=1K(IkU[n]eΨk​(θ𝒮​[n],φ𝒮​[n])\\displaystyle\\left(\\theta,\\varphi~{}|~{}\\theta_{\\mathcal{S}}[n],\\varphi_{\\mathcal{S}}[n]\\right)=\\sum_{k=1}^{K}\\Big{(}I^{U}_{k}[n]e^{\\Psi_{k}\\left(\\theta_{\\mathcal{S}}[n],\\varphi_{\\mathcal{S}}[n]\\right)}</td>\n<td></td>\n<td rowspan=\"2\">(1)</td>\n</tr>\n<tr>\n<td></td>\n<td></td>\n<td>⋅ej​[cp​(dc,kx​[n]​sin⁡θ​cos⁡φ+dc,ky​[n]​sin⁡θ​sin⁡φ+dc,kz​[n]​cos⁡θ)]),\\displaystyle\\cdot e^{j\\left[c_{p}\\left(d_{c,k}^{x}[n]\\operatorname{sin}\\theta\\operatorname{cos}\\varphi+d_{c,k}^{y}[n]\\operatorname{sin}\\theta\\operatorname{sin}\\varphi+d_{c,k}^{z}[n]\\operatorname{cos}\\theta\\right)\\right]}\\Big{)},</td>\n<td></td>\n</tr>\n</tbody>\n</table>\n\nwhere $`\\lambda`$ is the wavelength, and $`c_{p} = {{2\\hspace{0pt}\\pi}/\\lambda}`$ is the phase constant. Moreover, $`\\theta \\in {\\lbrack 0,\\pi\\rbrack}`$ and $`\\varphi \\in {\\lbrack{- \\pi},\\pi\\rbrack}`$ are the elevation and azimuth angles, respectively. In addition, the direction of RBS $`\\mathcal{S}`$ with respect to UVAA $`{\\mathbf{q}}_{c}\\hspace{0pt}{\\lbrack n\\rbrack}`$ is denoted as $`({\\theta_{\\mathcal{S}}\\hspace{0pt}{\\lbrack n\\rbrack}},{\\varphi_{\\mathcal{S}}\\hspace{0pt}{\\lbrack n\\rbrack}})`$ at time slot $`n`$, and $`\\Psi_{k}\\hspace{0pt}\\left( {\\theta_{\\mathcal{S}}\\hspace{0pt}{\\lbrack n\\rbrack}},{\\varphi_{\\mathcal{S}}\\hspace{0pt}{\\lbrack n\\rbrack}} \\right)`$ is the initial phase of UAV $`k`$ in UVAA at time slot $`n`$.\n\nIn this work, we adopt an open-loop phase synchronization scheme \\[[35](#bib.bib35)\\], which can be easily implemented through UAV swarm intra-cluster communication protocols \\[[36](#bib.bib36)\\]. For this case, the initial phase synchronization is accomplished by offsetting the distance between the UAV and UVAA center. As a result, the initial phase of UAV $`k`$ in UVAA can be calculated as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>Ψk(θ𝒮[n],φ𝒮[n])=−cp(\\displaystyle\\Psi_{k}\\left(\\theta_{\\mathcal{S}}[n],\\varphi_{\\mathcal{S}}[n]\\right)=-c_{p}\\Big{(}</td>\n<td>dc,kx​[n]​sin⁡θ𝒮​[n]​cos⁡φ𝒮​[n]superscriptsubscript𝑑𝑐𝑘𝑥delimited-[]𝑛sinsubscript𝜃𝒮delimited-[]𝑛cossubscript𝜑𝒮delimited-[]𝑛\\displaystyle d_{c,k}^{x}[n]\\operatorname{sin}\\theta_{\\mathcal{S}}[n]\\operatorname{cos}\\varphi_{\\mathcal{S}}[n]</td>\n<td></td>\n<td rowspan=\"3\">(2)</td>\n</tr>\n<tr>\n<td></td>\n<td></td>\n<td>+dc,ky​[n]​sin⁡θ𝒮​[n]​sin⁡φ𝒮​[n]superscriptsubscript𝑑𝑐𝑘𝑦delimited-[]𝑛sinsubscript𝜃𝒮delimited-[]𝑛sinsubscript𝜑𝒮delimited-[]𝑛\\displaystyle+d_{c,k}^{y}[n]\\operatorname{sin}\\theta_{\\mathcal{S}}[n]\\operatorname{sin}\\varphi_{\\mathcal{S}}[n]</td>\n<td></td>\n</tr>\n<tr>\n<td></td>\n<td></td>\n<td>+dc,kz[n]cosθ𝒮[n]).\\displaystyle+d_{c,k}^{z}[n]\\operatorname{cos}\\theta_{\\mathcal{S}}[n]\\Big{)}.</td>\n<td></td>\n</tr>\n</tbody>\n</table>\n\n### 3.3 Channel Gain Model\n\nTo precisely model the A2G wireless communications, we utilize the elevation angle-dependent probabilistic LoS model \\[[37](#bib.bib37)\\] to characterize the A2G communication between UVAA and RBS $`\\mathcal{S}`$. Specifically, the LoS link probability between UVAA and RBS $`\\mathcal{S}`$ at time slot $`n`$ can be given by\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>Pc,𝒮LoS​[n]=11+c0​exp⁡(−c1​(ξ​[n]−c0)),superscriptsubscript𝑃𝑐𝒮LoSdelimited-[]𝑛11subscript𝑐0expsubscript𝑐1𝜉delimited-[]𝑛subscript𝑐0P_{c,\\mathcal{S}}^{\\text{LoS}}[n]=\\frac{1}{1+c_{0}\\operatorname{exp}\\left(-c_{1}\\left(\\xi[n]-c_{0}\\right)\\right)},</td>\n<td></td>\n<td>(3)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`c_{0}`$ and $`c_{1}`$ are two constants depending on the carrier frequency and environment. As depicted in Fig. [1](#S3.F1 \"Figure 1 ‣ 3.1 System Description ‣ 3 System Model ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\"), $`\\xi\\hspace{0pt}{\\lbrack n\\rbrack}`$ is the elevation between UVAA center and RBS $`\\mathcal{S}`$ at time slot $`n`$ and can be calculated by $`\\frac{180}{\\pi}\\hspace{0pt}{\\arcsin\\left( \\frac{{z_{c}^{U}\\hspace{0pt}{\\lbrack n\\rbrack}} - H_{\\mathcal{S}}}{d_{c,\\mathcal{S}}\\hspace{0pt}{\\lbrack n\\rbrack}} \\right)}`$, wherein $`{d_{c,\\mathcal{S}}\\hspace{0pt}{\\lbrack n\\rbrack}} = \\sqrt{{\\|{{{\\mathbf{q}}_{c}\\hspace{0pt}{\\lbrack n\\rbrack}} - {\\mathbf{w}}_{B}}\\|}^{2}}`$ is the distance between UVAA center and RBS $`\\mathcal{S}`$ at time slot $`n`$. Accordingly, the NLoS link probability at time slot $`n`$ can be expressed as $`{P_{c,\\mathcal{S}}^{\\text{NLoS}}\\hspace{0pt}{\\lbrack n\\rbrack}} = {1 - {P_{c,\\mathcal{S}}^{\\text{LoS}}\\hspace{0pt}{\\lbrack n\\rbrack}}}`$.\n\nThus, the path loss for LoS and NLoS links between UVAA and RBS $`\\mathcal{S}`$ at time slot $`n`$ can be given by \\[[38](#bib.bib38)\\]\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>Lc,𝒮[n]={μ1​(4​π​fc​dc,𝒮​[n]c)α,LoS linkμ2​(4​π​fc​dc,𝒮​[n]c)α,NLoS link,L_{c,\\mathcal{S}}[n]=\\left\\{\\begin{aligned} \\mu_{1}\\left(\\frac{4\\pi f_{c}d_{c,\\mathcal{S}}[n]}{c}\\right)^{\\alpha},&\\quad\\text{LoS~{}link}\\\\ \\mu_{2}\\left(\\frac{4\\pi f_{c}d_{c,\\mathcal{S}}[n]}{c}\\right)^{\\alpha},&\\quad\\text{NLoS~{}link}\\\\ \\end{aligned}\\right.,</td>\n<td></td>\n<td>(4)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`\\mu_{1}`$ and $`\\mu_{2}`$ $`\\left( {\\mu_{2} > \\mu_{1} > 1} \\right)`$ represent the excessive path loss for LoS and NLoS links, respectively. Moreover, $`c`$ is the light speed, $`\\alpha`$ is the path loss exponent, and $`f_{c}`$ is the carrier frequency.\n\nTypically, considering both LoS and NLoS links, the average pass loss between UVAA and RBS $`\\mathcal{S}`$ at time slot $`n`$ can be express as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>L¯c,𝒮​[n]=[Pc,𝒮LoS​[n]​μ1+Pc,𝒮NLoS​[n]​μ2]​(Ko​dc,𝒮​[n])α,subscript¯𝐿𝑐𝒮delimited-[]𝑛delimited-[]superscriptsubscript𝑃𝑐𝒮LoSdelimited-[]𝑛subscript𝜇1superscriptsubscript𝑃𝑐𝒮NLoSdelimited-[]𝑛subscript𝜇2superscriptsubscript𝐾𝑜subscript𝑑𝑐𝒮delimited-[]𝑛𝛼\\overline{L}_{c,\\mathcal{S}}[n]=\\left[P_{c,\\mathcal{S}}^{\\text{LoS}}[n]\\mu_{1}+P_{c,\\mathcal{S}}^{\\text{NLoS}}[n]\\mu_{2}\\right]\\left(K_{o}d_{c,\\mathcal{S}}[n]\\right)^{\\alpha},</td>\n<td></td>\n<td>(5)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`K_{o} = \\frac{4\\hspace{0pt}\\pi\\hspace{0pt}f_{c}}{c}`$ represents the free-space path loss factor. Furthermore, the channel gain between UVAA center and RBS $`\\mathcal{S}`$ at time slot $`n`$ can be calculated as $`{g_{c,\\mathcal{S}}\\hspace{0pt}{\\lbrack n\\rbrack}} = \\frac{1}{{\\overline{L}}_{c,\\mathcal{S}}\\hspace{0pt}{\\lbrack n\\rbrack}}`$.\n\nSimilarly, the channel gain between UVAA and mobile eavesdropper at time slot $`n`$ is described as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>gc,E​[n]=1[Pc,ELoS​[n]​μ1+Pc,ENLoS​[n]​μ2]​(Ko​dc,E​[n])α,subscript𝑔𝑐𝐸delimited-[]𝑛1delimited-[]superscriptsubscript𝑃𝑐𝐸LoSdelimited-[]𝑛subscript𝜇1superscriptsubscript𝑃𝑐𝐸NLoSdelimited-[]𝑛subscript𝜇2superscriptsubscript𝐾𝑜subscript𝑑𝑐𝐸delimited-[]𝑛𝛼{g}_{c,E}[n]=\\frac{1}{\\left[P_{c,E}^{\\text{LoS}}[n]\\mu_{1}+P_{c,E}^{\\text{NLoS}}[n]\\mu_{2}\\right]\\left(K_{o}d_{c,E}[n]\\right)^{\\alpha}},</td>\n<td></td>\n<td>(6)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`P_{c,E}^{\\text{LoS}}\\hspace{0pt}{\\lbrack n\\rbrack}`$ and $`P_{c,E}^{\\text{NLoS}}\\hspace{0pt}{\\lbrack n\\rbrack}`$ represent the probabilities of LoS and NLoS links between UVAA and mobile eavesdropper at time slot $`n`$, respectively. Moreover, $`d_{c,E}\\hspace{0pt}{\\lbrack n\\rbrack}`$ is the distance between UVAA center and mobile eavesdropper at time slot $`n`$, which can be calculated by $`{d_{c,E}\\hspace{0pt}{\\lbrack n\\rbrack}} = \\sqrt{{\\|{{{\\mathbf{q}}_{c}\\hspace{0pt}{\\lbrack n\\rbrack}} - {{\\mathbf{q}}_{E}\\hspace{0pt}{\\lbrack n\\rbrack}}}\\|}^{2}}`$.\n\n### 3.4 Secrecy Rate Model\n\nBy exploiting the previously mentioned array factor and channel model, the transmission rate from UVAA to RBS at time slot $`n`$ can be expressed as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>RU,𝒮​[n]=log2⁡(1+PU​[n]​gc,𝒮​[n]​GU,𝒮​(θ𝒮​[n],φ𝒮​[n])σ2),subscript𝑅𝑈𝒮delimited-[]𝑛subscriptlog21subscript𝑃𝑈delimited-[]𝑛subscript𝑔𝑐𝒮delimited-[]𝑛subscript𝐺𝑈𝒮subscript𝜃𝒮delimited-[]𝑛subscript𝜑𝒮delimited-[]𝑛superscript𝜎2R_{U,\\mathcal{S}}[n]=\\operatorname{log}_{2}\\left(1+\\frac{P_{U}[n]g_{c,\\mathcal{S}}[n]G_{U,\\mathcal{S}}(\\theta_{\\mathcal{S}}[n],\\varphi_{\\mathcal{S}}[n])}{\\sigma^{2}}\\right),</td>\n<td></td>\n<td>(7)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`P_{U}\\hspace{0pt}{\\lbrack n\\rbrack}`$ represents the transmit power of UVAA, and $`\\sigma^{2}`$ is the noise power of the A2G channel. Moreover, $`G_{U,\\mathcal{S}}\\hspace{0pt}{({\\theta_{\\mathcal{S}}\\hspace{0pt}{\\lbrack n\\rbrack}},{\\varphi_{\\mathcal{S}}\\hspace{0pt}{\\lbrack n\\rbrack}})}`$ is the antenna gain\n\n<sup>1</sup>\n\n<sup>1</sup>1In this work, we assume that the magnitude of the far-field beam pattern of each UAV element is $`0`$ dB since each UAV is equipped with a single isotropic antenna under the same power constraints. Moreover, the antenna efficiency is approximated as to be $`1`$.\n\nof UVAA towards RBS $`\\mathcal{S}`$ at time slot $`n`$, which can be defined as follows:\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>GU,𝒮subscript𝐺𝑈𝒮\\displaystyle G_{U,\\mathcal{S}}</td>\n<td>(θ𝒮​[n],φ𝒮​[n])=subscript𝜃𝒮delimited-[]𝑛subscript𝜑𝒮delimited-[]𝑛absent\\displaystyle(\\theta_{\\mathcal{S}}[n],\\varphi_{\\mathcal{S}}[n])=</td>\n<td></td>\n<td rowspan=\"2\">(8)</td>\n</tr>\n<tr>\n<td></td>\n<td></td>\n<td>4π|AF(θ𝒮[n],φ𝒮[n]|θ𝒮[n],φ𝒮[n])|2∫02​π∫0π|AF(θ,φ|θ𝒮[n],φ𝒮[n])|2sinθdθdφ.\\displaystyle\\frac{4\\pi\\left|AF\\left(\\theta_{\\mathcal{S}}[n],\\varphi_{\\mathcal{S}}[n]|\\theta_{\\mathcal{S}}[n],\\varphi_{\\mathcal{S}}[n]\\right)\\right|^{2}}{\\int_{0}^{2\\pi}\\int_{0}^{\\pi}|AF(\\theta,\\varphi|\\theta_{\\mathcal{S}}[n],\\varphi_{\\mathcal{S}}[n])|^{2}\\sin\\theta\\mathrm{d}\\theta\\mathrm{d}\\varphi}.</td>\n<td></td>\n</tr>\n</tbody>\n</table>\n\nSimilarly, the antenna gain of UVAA towards the mobile eavesdropper at time slot $`n`$ can be written as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>GU,Esubscript𝐺𝑈𝐸\\displaystyle G_{U,E}</td>\n<td>(θE​[n],φE​[n])=subscript𝜃𝐸delimited-[]𝑛subscript𝜑𝐸delimited-[]𝑛absent\\displaystyle(\\theta_{E}[n],\\varphi_{E}[n])=</td>\n<td></td>\n<td rowspan=\"2\">(9)</td>\n</tr>\n<tr>\n<td></td>\n<td></td>\n<td>4π|AF(θE[n],φE[n]|θ0[n],φ0[n])|2∫02​π∫0π|AF(θ,φ|θ0[n],φ0[n])|2sinθdθdφ,\\displaystyle\\frac{4\\pi\\left|AF\\left(\\theta_{E}[n],\\varphi_{E}[n]|\\theta_{0}[n],\\varphi_{0}[n]\\right)\\right|^{2}}{\\int_{0}^{2\\pi}\\int_{0}^{\\pi}|AF(\\theta,\\varphi|\\theta_{0}[n],\\varphi_{0}[n])|^{2}\\sin\\theta\\mathrm{d}\\theta\\mathrm{d}\\varphi},</td>\n<td></td>\n</tr>\n</tbody>\n</table>\n\nwhere $`\\left( {\\theta_{E}\\hspace{0pt}{\\lbrack n\\rbrack}},{\\varphi_{E}\\hspace{0pt}{\\lbrack n\\rbrack}} \\right)`$ is the direction of the mobile eavesdropper with respect to the UVAA center at time slot $`n`$. Accordingly, the transmission rate from UVAA to the mobile eavesdropper can be expressed as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>RU,E​[n]=log2⁡(1+PU​[n]​gc,E​[n]​GU,E​(θ0​[n],φ0​[n])σ2).subscript𝑅𝑈𝐸delimited-[]𝑛subscriptlog21subscript𝑃𝑈delimited-[]𝑛subscript𝑔𝑐𝐸delimited-[]𝑛subscript𝐺𝑈𝐸subscript𝜃0delimited-[]𝑛subscript𝜑0delimited-[]𝑛superscript𝜎2R_{U,E}[n]=\\operatorname{log}_{2}\\left(1+\\frac{P_{U}[n]g_{c,E}[n]G_{U,E}(\\theta_{0}[n],\\varphi_{0}[n])}{\\sigma^{2}}\\right).</td>\n<td></td>\n<td>(10)</td>\n</tr>\n</tbody>\n</table>\n\nFurthermore, the achievable secrecy rate of A2G wireless link at time slot $`n`$ is given by\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>RS​E​[n]subscript𝑅𝑆𝐸delimited-[]𝑛\\displaystyle R_{SE}[n]</td>\n<td>={RU,𝒮​[n]−RU,E​[n]}+,absentsuperscriptsubscript𝑅𝑈𝒮delimited-[]𝑛subscript𝑅𝑈𝐸delimited-[]𝑛\\displaystyle=\\left\\{R_{U,\\mathcal{S}}[n]-R_{U,E}[n]\\right\\}^{+},</td>\n<td></td>\n<td>(11)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`\\left\\{ x \\right\\}^{+}`$ is defined as $`\\max{\\{ x,0\\}}`$.\n\n### 3.5 UAV Energy Consumption Model\n\nAccording to the aircraft dynamics of rotary-wing UAVs, the power consumption can be expressed as the sum of the power for level flight and the power for vertical flight \\[[39](#bib.bib39)\\]. Specifically, the power of UAV $`k`$ for level flight at time slot $`n`$ can be calculated as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>Plevelk​[n]=superscriptsubscript𝑃level𝑘delimited-[]𝑛absent\\displaystyle P_{\\text{level}}^{k}[n]=</td>\n<td>Pi​1+∥vkx[n],vky[n]∥44​v04−∥vkx[n],vky[n]∥22​v02\\displaystyle P_{i}\\sqrt{\\sqrt{1+\\frac{\\|v_{k}^{x}[n],v_{k}^{y}[n]\\|^{4}}{4v_{0}^{4}}}-\\frac{\\|v_{k}^{x}[n],v_{k}^{y}[n]\\|^{2}}{2v_{0}^{2}}}</td>\n<td></td>\n<td rowspan=\"3\">(12)</td>\n</tr>\n<tr>\n<td></td>\n<td></td>\n<td>+P0​(1+3∥vkx[n],vky[n]∥2ut​i​p2)\\displaystyle+P_{0}\\left(1+\\frac{3\\|v_{k}^{x}[n],v_{k}^{y}[n]\\|^{2}}{u_{tip}^{2}}\\right)</td>\n<td></td>\n</tr>\n<tr>\n<td></td>\n<td></td>\n<td>+12d0ρsA∥vkx[n],vky[n]∥3,\\displaystyle+\\frac{1}{2}d_{0}\\rho sA\\|v_{k}^{x}[n],v_{k}^{y}[n]\\|^{3},</td>\n<td></td>\n</tr>\n</tbody>\n</table>\n\nwhere $`v_{k}^{x}`$ and $`v_{k}^{y}`$ are the $`x`$-axis component speed and $`y`$-axis component speed of UAV $`k`$ at time slot $`n`$, respectively. $`v_{0}`$ is the mean rotor induced velocity for hovering, $`U_{t\\hspace{0pt}i\\hspace{0pt}p}`$ is the tip speed of the rotor blade, $`d_{0}`$ is the fuselage drag ratio, $`\\rho`$ is the density of air, $`s`$ is the rotor solidity and $`A`$ is the rotor disk area. Moreover, $`P_{i}`$ and $`P_{0}`$ denote the induced power and the blade profile power in hovering status, which can be calculated as follows \\[[40](#bib.bib40)\\]:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>Pi=(1+M)​W3/22​ρ​A,P0=κ8​ρ​s​A​Ω3​Λ3,formulae-sequencesubscript𝑃𝑖1𝑀superscript𝑊322𝜌𝐴subscript𝑃0𝜅8𝜌𝑠𝐴superscriptΩ3superscriptΛ3P_{i}=(1+M)\\frac{W^{3/2}}{\\sqrt{2\\rho A}},P_{0}=\\frac{\\kappa}{8}\\rho sA\\Omega^{3}\\Lambda^{3},</td>\n<td></td>\n<td>(13)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`\\Omega`$ is the blade angular velocity, $`M`$ is the incremental correction factor to induced power, $`\\Lambda`$ is the rotor radius, and $`\\kappa`$ is the profile drag coefficient. Moreover, $`W = {m\\hspace{0pt}g}`$ is the weight of UAV, wherein $`g`$ is gravitational acceleration and $`m`$ is the mass of UAV.\n\nIn addition, the power of UAV $`k`$ for vertical flight at time slot $`n`$ can be modeled as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>Pverticalk[n]={W​vkz​[n],vkz​[n]>00,vkz​[n]≤0,P_{\\text{vertical}}^{k}[n]=\\left\\{\\begin{aligned} &Wv_{k}^{z}[n],&&v_{k}^{z}[n]>0\\\\ &0,&&v_{k}^{z}[n]\\leq 0\\end{aligned}\\right.,</td>\n<td></td>\n<td>(14)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`v_{k}^{z}`$ is the $`z`$-axis component speed of UAV $`k`$ at time slot $`n`$. Moreover, $`{P_{\\text{vertical}}^{k}\\hspace{0pt}{\\lbrack n\\rbrack}} = 0`$ as the UAVs operate in auto-rotation and are unpowered during the vertical descent \\[[39](#bib.bib39)\\].\n\nAccordingly, the flight energy consumption of UAV swarm at time slot $`n`$ can be modeled as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>E​[n]=∑k=1Kδt​(Plevelk​[n]+Pverticalk​[n]).𝐸delimited-[]𝑛superscriptsubscript𝑘1𝐾subscript𝛿𝑡superscriptsubscript𝑃level𝑘delimited-[]𝑛superscriptsubscript𝑃vertical𝑘delimited-[]𝑛E[n]=\\sum_{k=1}^{K}\\delta_{t}(P_{\\text{level}}^{k}[n]+P_{\\text{vertical}}^{k}[n]).</td>\n<td></td>\n<td>(15)</td>\n</tr>\n</tbody>\n</table>\n\n## 4 Problem Formulation and Analysis\n\nIn this work, we aim to maximize the secrecy rate of the system while minimizing the flight energy consumption of the UAV swarm by determining the excitation current weights and positions of UAVs during a period of $`N`$ time slots. Thus, the ASCEE-MOP is formulated as follows:\n\n<table>\n<tbody>\n<tr>\n<td colspan=\"5\"></td>\n</tr>\n<tr>\n<td></td>\n<td>P1:</td>\n<td>max𝑰,𝒒​(∑n=1NRS​E​[n],−∑n=1NE​[n]),𝑰𝒒maxsuperscriptsubscript𝑛1𝑁subscript𝑅𝑆𝐸delimited-[]𝑛superscriptsubscript𝑛1𝑁𝐸delimited-[]𝑛\\displaystyle\\underset{\\bm{I},\\bm{q}}{\\text{max}}\\ (\\sum_{n=1}^{N}R_{SE}[n],-\\sum_{n=1}^{N}E[n]),</td>\n<td></td>\n<td>(16a)</td>\n</tr>\n<tr>\n<td></td>\n<td>s.t.</td>\n<td>0≤IkU​[n]≤1,∀k∈{1,…,K},formulae-sequence0superscriptsubscript𝐼𝑘𝑈delimited-[]𝑛1for-all𝑘1…𝐾\\displaystyle 0\\leq I_{k}^{U}[n]\\leq 1,\\forall k\\in\\{1,...,K\\},</td>\n<td></td>\n<td>(16b)</td>\n</tr>\n<tr>\n<td></td>\n<td></td>\n<td>Xm​i​n≤xkU​[n]≤Xm​a​x,∀k∈{1,…,K},formulae-sequencesubscript𝑋𝑚𝑖𝑛superscriptsubscript𝑥𝑘𝑈delimited-[]𝑛subscript𝑋𝑚𝑎𝑥for-all𝑘1…𝐾\\displaystyle X_{min}\\leq x_{k}^{U}[n]\\leq X_{max},\\forall k\\in\\{1,...,K\\},</td>\n<td></td>\n<td>(16c)</td>\n</tr>\n<tr>\n<td></td>\n<td></td>\n<td>Ym​i​n≤ykU​[n]≤Ym​a​x,∀k∈{1,…,K},formulae-sequencesubscript𝑌𝑚𝑖𝑛superscriptsubscript𝑦𝑘𝑈delimited-[]𝑛subscript𝑌𝑚𝑎𝑥for-all𝑘1…𝐾\\displaystyle Y_{min}\\leq y_{k}^{U}[n]\\leq Y_{max},\\forall k\\in\\{1,...,K\\},</td>\n<td></td>\n<td>(16d)</td>\n</tr>\n<tr>\n<td></td>\n<td></td>\n<td>Zm​i​n≤zkU​[n]≤Zm​a​x,∀k∈{1,…,K},formulae-sequencesubscript𝑍𝑚𝑖𝑛superscriptsubscript𝑧𝑘𝑈delimited-[]𝑛subscript𝑍𝑚𝑎𝑥for-all𝑘1…𝐾\\displaystyle Z_{min}\\leq z_{k}^{U}[n]\\leq Z_{max},\\forall k\\in\\{1,...,K\\},</td>\n<td></td>\n<td>(16e)</td>\n</tr>\n<tr>\n<td></td>\n<td></td>\n<td>0≤vkU​[n]≤Vm​a​x,∀k∈{1,…,K},formulae-sequence0superscriptsubscript𝑣𝑘𝑈delimited-[]𝑛subscript𝑉𝑚𝑎𝑥for-all𝑘1…𝐾\\displaystyle 0\\leq v_{k}^{U}[n]\\leq V_{max},\\forall k\\in\\{1,...,K\\},</td>\n<td></td>\n<td>(16f)</td>\n</tr>\n<tr>\n<td></td>\n<td></td>\n<td>∥𝒒k1[n],𝒒k2[n]∥≥Dm​i​nU,∀k1,k2∈{1,…,K},\\displaystyle\\|\\bm{q}_{k_{1}}[n],\\bm{q}_{k_{2}}[n]\\|\\geq D_{min}^{U},\\forall k_{1},k_{2}\\in\\{1,...,K\\},</td>\n<td></td>\n<td>(16g)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`{\\mathbf{I}}\\hspace{0pt}{\\lbrack n\\rbrack}`$ and $`{\\mathbf{q}}\\hspace{0pt}{\\lbrack n\\rbrack}`$ are the excitation current weights and positions of UAVs at time slot $`n`$, respectively. Constraint ([16b](#S4.E16.2 \"In 16 ‣ 4 Problem Formulation and Analysis ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")) expresses the range constraint of the excitation current weight. Moreover, Constraints ([16c](#S4.E16.3 \"In 16 ‣ 4 Problem Formulation and Analysis ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")), ([16d](#S4.E16.4 \"In 16 ‣ 4 Problem Formulation and Analysis ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")) and ([16e](#S4.E16.5 \"In 16 ‣ 4 Problem Formulation and Analysis ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")) restrict the flight area of the UAV which may be imposed by surveillance area and government regulations. In addition, Constraint ([16f](#S4.E16.6 \"In 16 ‣ 4 Problem Formulation and Analysis ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")) is the speed constrain of the UAV, and Constraint ([16g](#S4.E16.7 \"In 16 ‣ 4 Problem Formulation and Analysis ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")) is imposed to guarantee the minimum distance between two UAVs.\n\nNon-convexity: The ASCEE-MOP is inherently non-convex, stemming from both its imposed safety constraints and objective function. Specifically, the safety constraint, as delineated in Constraint ([16g](#S4.E16.7 \"In 16 ‣ 4 Problem Formulation and Analysis ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")), necessitates a minimum separation distance between UAVs, thereby resulting in a non-convex solution space defined by regions external to spherical boundaries.\n\nNP-hard: The formulated ASCEE-MOP can be proven to be NP-hard. Specifically, we assume that the optimization problem is simplified by only considering to maximize the secrecy rate of system at a given time slot with fixing the positions of UAVs. Moreover, the excitation current weights are further simplified as the discrete values, i.e., $`I_{k}^{U} \\in {\\mathbf{S}} = {\\{ 0,1\\}}`$. Accordingly, the simplified problem is given as follows:\n\n<table>\n<tbody>\n<tr>\n<td colspan=\"5\"></td>\n</tr>\n<tr>\n<td></td>\n<td>P2:</td>\n<td>max𝑰RS​E,𝑰maxsubscript𝑅𝑆𝐸\\displaystyle\\underset{\\bm{I}}{\\text{max}}\\quad R_{SE},</td>\n<td></td>\n<td>(17a)</td>\n</tr>\n<tr>\n<td></td>\n<td>s.t.</td>\n<td>IkU∈𝑺,∀k∈{1,…,K},formulae-sequencesuperscriptsubscript𝐼𝑘𝑈𝑺for-all𝑘1…𝐾\\displaystyle I_{k}^{U}\\in\\bm{S},\\forall k\\in\\{1,...,K\\},</td>\n<td></td>\n<td>(17b)</td>\n</tr>\n<tr>\n<td></td>\n<td></td>\n<td>∑k=1KIkU≤K,∀k∈{1,…,K},formulae-sequencesuperscriptsubscript𝑘1𝐾superscriptsubscript𝐼𝑘𝑈𝐾for-all𝑘1…𝐾\\displaystyle\\sum_{k=1}^{K}{I_{k}^{U}}\\leq K,\\forall k\\in\\{1,...,K\\},</td>\n<td></td>\n<td>(17c)</td>\n</tr>\n</tbody>\n</table>\n\nAs such, the P2 is structured as a nonlinear multi-dimensional knapsack problem, which is NP-hard \\[[41](#bib.bib41)\\]. Therefore, the ASCEE-MOP is an NP-hard optimization problem since it is much more complex than P2.\n\nTrade-off: Furthermore, the objective function of ASCEE-MOP seeks to concurrently maximize the secrecy rate of the system while minimizing the flight energy consumption of the UAV swarm. Specifically, it is essential for UAVs to fly to suitable positions to improve the antenna directivity of the UVAA system, thereby maximizing the total secrecy rate during task execution. However, constantly adjusting the positions of UAVs to maintain optimal antenna directivity leads to significant energy consumption. Thus, there is an inherent trade-off between maximizing the secrecy rate of the system and minimizing flight energy consumption of the UAV swarm within the formulated ASCEE-MOP, and striking the right balance between these two conflicting objectives poses a challenging task.\n\nTo deal with such non-convex optimization problems, most works subdivide them into several convex subproblems which can be solved by an iterative manner. However, the accuracy is impacted as a result of the decomposition. Moreover, the dynamics of environment, e.g., the changed position of mobile eavesdropper and the time-varying channel, brings some challenges. In this case, existing optimization-based methods and heuristic algorithms needs to re-run once the environment changes. Fortunately, DRL provides a feasible and efficient way for the sequential decision making and optimal control in dynamic environments. Thus, this motives us to utilize DRL-based methods to address the formulated ASCEE-MOP.\n\n## 5 The Proposed GDMTD3\n\nIn this section, the formulated non-convex multi-objective optimization problem is solved by the DRL-based method. Specifically, we first adopt a Markov decision process to reformulate the ASCEE-MOP, and then propose the GDMTD3 method to solve the problem.\n\n### 5.1 Markov Decision Process for ASCEE-MOP\n\nThe formulated ASCEE-MOP of the UAV swarm-enabled surveillance network system can be modeled as a Markov decision process to facilitate the application of DRL. In general, a Markov decision process is represented as a tuple $`< \\mathcal{S},\\mathcal{A},\\mathcal{P},\\mathcal{R},\\gamma >`$, where $`\\mathcal{S}`$ is the state space of environment, $`\\mathcal{A}`$ is the action space of agent, $`\\mathcal{P}`$ denotes the state transition probability of environment, $`\\mathcal{R}`$ is the reward space, and $`\\gamma \\in {\\lbrack 0,1\\rbrack}`$ denotes the reward discount factor. Specifically, the UVAA is treated as a decision-making agent in the Markov decision process. With the framework of the Markov decision process, the environment state at any given time slot $`n`$ is signified by $`{\\mathbf{s}}\\hspace{0pt}{\\lbrack{\\mathbf{n}}\\rbrack}`$, wherein $`{{\\mathbf{s}}\\hspace{0pt}{\\lbrack n\\rbrack}} \\in \\mathcal{S}`$. Subsequently, the agent selects an action $`{\\mathbf{a}}\\hspace{0pt}{\\lbrack n\\rbrack}`$ according to the policy $`{\\mathbf{π}}\\hspace{0pt}{({{\\mathbf{s}}\\hspace{0pt}{\\lbrack n\\rbrack}})}`$. After that, the environment dispenses the agent a reward $`r\\hspace{0pt}{\\lbrack n\\rbrack}`$ and transitions to the next state $`{\\mathbf{s}}\\hspace{0pt}{\\lbrack{n + 1}\\rbrack}`$ based on the transition probability function $`\\mathcal{P}\\hspace{0pt}{(\\left. {{\\mathbf{s}}\\hspace{0pt}{\\lbrack{n + 1}\\rbrack}} \\middle| {{{\\mathbf{s}}\\hspace{0pt}{\\lbrack n\\rbrack}},{{\\mathbf{a}}\\hspace{0pt}{\\lbrack n\\rbrack}}} \\right.)}`$. Accordingly, the crucial elements in our model are described below in detail.\n\n#### 5.1.1 State Space\n\nThe state of the system at time slot $`n`$ can be defined by $`{{\\mathbf{s}}\\hspace{0pt}{\\lbrack n\\rbrack}} = {({{\\mathbf{q}}\\hspace{0pt}{\\lbrack n\\rbrack}},{{\\mathbf{q}}_{E}^{x\\hspace{0pt}y}\\hspace{0pt}{\\lbrack n\\rbrack}})}`$. Specifically, $`{\\mathbf{q}}\\hspace{0pt}{\\lbrack n\\rbrack}`$ represents the positions of all UAVs at time slot $`n`$, and $`{\\mathbf{q}}_{E}^{x\\hspace{0pt}y}\\hspace{0pt}{\\lbrack n\\rbrack}`$ is the coordinates of the eavesdroppers within the $`x`$-$`y`$ plane at time slot $`n`$.\n\n#### 5.1.2 Action Space\n\nAt a certain time slot $`n`$, each UAV needs to choose its own proper excitation current weight and position. Accordingly, the action set of UAV swarm can be represented by $`{{\\mathbf{a}}\\hspace{0pt}{\\lbrack n\\rbrack}} = {({{\\mathbf{I}}\\hspace{0pt}{\\lbrack n\\rbrack}},{{\\mathbf{q}}\\hspace{0pt}{\\lbrack n\\rbrack}})}`$, where $`{\\mathbf{I}}\\hspace{0pt}{\\lbrack n\\rbrack}`$ and $`{\\mathbf{q}}\\hspace{0pt}{\\lbrack n\\rbrack}`$ represent the excitation current weights and positions of all UAVs at time slot $`n`$, respectively.\n\n#### 5.1.3 Reward Function\n\nIn DRL, the reward garnered from the agent-environment interchange provides a quantifiable measure of action efficiency in a given state. Therefore, the formulated ASCEE-MOP can be transformed into maximizing the accumulative reward. Accordingly, the reward function can be constructed as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>r​[n]=ω1​rS​E​[n]+ω2​rE​[n]−rP​[n],𝑟delimited-[]𝑛subscript𝜔1subscript𝑟𝑆𝐸delimited-[]𝑛subscript𝜔2subscript𝑟𝐸delimited-[]𝑛subscript𝑟𝑃delimited-[]𝑛r[n]=\\omega_{1}r_{SE}[n]+\\omega_{2}r_{E}[n]-r_{P}[n],</td>\n<td></td>\n<td>(18)</td>\n</tr>\n</tbody>\n</table>\n\nwhere the first term, i.e., $`{r_{S\\hspace{0pt}E}\\hspace{0pt}{\\lbrack n\\rbrack}} = {R_{S\\hspace{0pt}E}\\hspace{0pt}{\\lbrack n\\rbrack}}`$ represents the secrecy rate that the system achieves at time slot $`n`$. Moreover, the second term $`{r_{E}\\hspace{0pt}{\\lbrack n\\rbrack}} = {- {E\\hspace{0pt}{\\lbrack n\\rbrack}}}`$ quantifies the total flight energy consumption of all UAVs at time slot $`n`$. Furthermore, $`\\omega_{1}`$ and $`\\omega_{2}`$ denote the weight factors for the two objectives, which can be determined based on their respective value ranges. In addition, the penalty $`r_{P}\\hspace{0pt}{\\lbrack n\\rbrack}`$ is applied if the UAVs violate the constraint of speed or collide with each other.\n\n#### 5.1.4 Transition Probability\n\nIn our work, the transition probability of the state, which is denoted as $`\\mathcal{P}\\hspace{0pt}{(\\left. {{\\mathbf{s}}\\hspace{0pt}{\\lbrack{n + 1}\\rbrack}} \\middle| {{{\\mathbf{s}}\\hspace{0pt}{\\lbrack n\\rbrack}},{{\\mathbf{a}}\\hspace{0pt}{\\lbrack n\\rbrack}}} \\right.)}`$, specifies the probability distribution of the subsequent state after the UAVs execute their respective actions in the current state.\n\n### 5.2 Basic Principles of Conventional TD3\n\nTD3\\[[42](#bib.bib42)\\] is an advanced reinforcement learning algorithm that extends from the foundations of deep deterministic policy gradient (DDPG)\\[[43](#bib.bib43)\\] method. Specifically, TD3 addresses the key limitations in DDPG by incorporating several novel techniques including twin critic networks, delayed policy updates, and target policy smoothing, which collectively contribute to its superior performance in continuous control tasks.\n\n#### 5.2.1 Actor-Critic Framework\n\nSimilar to DDPG, TD3 employs an actor-critic structure, where the actor network $`{\\mathbf{μ}}\\hspace{0pt}{(\\left. {\\mathbf{s}} \\middle| {\\mathbf{θ}}_{\\mathbf{μ}} \\right.)}`$ outputs deterministic actions, and the critic networks $`{\\mathbf{Q}}\\hspace{0pt}{({\\mathbf{s}},\\left. {\\mathbf{a}} \\middle| {\\mathbf{θ}}_{\\mathbf{Q}} \\right.)}`$ evaluate the action-state value function. The objective is to find the optimal policy $`\\mathbf{π}`$ that maximizes the expected accumulated return.\n\nThe Bellman equation provides a recursive decomposition to update the action-value function $`{\\mathbf{Q}}\\hspace{0pt}{({\\mathbf{s}},{\\mathbf{a}})}`$, which can be described mathematically as follows \\[[44](#bib.bib44)\\]:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>𝑸(𝒔[n],𝒂[n])=r[n]+γ𝔼𝒔​[n+1]∼𝒑𝝅[\\displaystyle\\bm{Q}(\\bm{s}[n],\\bm{a}[n])=r[n]+\\gamma\\mathbb{E}_{\\bm{s}[n+1]\\sim\\bm{p_{\\pi}}}[</td>\n<td>𝑸(𝒔[n+1],\\displaystyle\\bm{Q}(\\bm{s}[n+1],</td>\n<td></td>\n<td rowspan=\"2\">(19)</td>\n</tr>\n<tr>\n<td></td>\n<td></td>\n<td>𝝁(𝒔[n+1]))],\\displaystyle\\bm{\\mu}(\\bm{s}[n+1]))],</td>\n<td></td>\n</tr>\n</tbody>\n</table>\n\nwhere $`{\\mathbf{p}}_{\\mathbf{π}}`$ represents the transition probability distribution under policy $`\\mathbf{π}`$.\n\n#### 5.2.2 Twin Critic Networks\n\nOne of the significant improvements in TD3 is the use of twin critic networks to address overestimation bias. Specifically, overestimation usually occurs when the action-value estimates are consistently higher than the true values, leading to the suboptimal policy updates. While in TD3, two independent critic networks, i.e., $`{\\mathbf{Q}}_{\\mathbf{1}}\\hspace{0pt}{({\\mathbf{s}},\\left. {\\mathbf{a}} \\middle| {\\mathbf{θ}}_{{\\mathbf{Q}}_{\\mathbf{1}}} \\right.)}`$ and $`{\\mathbf{Q}}_{\\mathbf{2}}\\hspace{0pt}{({\\mathbf{s}},\\left. {\\mathbf{a}} \\middle| {\\mathbf{θ}}_{{\\mathbf{Q}}_{\\mathbf{2}}} \\right.)}`$, are used to estimate the value of state-action pairs. The target Q-value is computed as the minimum of the two estimates, which is represented as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>y​[n]=r​[n]+γ​mini=1,2⁡𝑸𝒊′​(𝒔​[n+1],𝝁′​(𝒔​[n+1]|𝜽𝝁′)),𝑦delimited-[]𝑛𝑟delimited-[]𝑛𝛾subscript𝑖12subscriptsuperscript𝑸bold-′𝒊𝒔delimited-[]𝑛1superscript𝝁bold-′conditional𝒔delimited-[]𝑛1superscriptsubscript𝜽𝝁bold-′y[n]=r[n]+\\gamma\\min_{i=1,2}\\bm{Q^{\\prime}_{i}}(\\bm{s}[n+1],\\bm{\\mu^{\\prime}}(\\bm{s}[n+1]|\\bm{\\theta_{\\mu}^{\\prime}})),</td>\n<td></td>\n<td>(20)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`{\\mathbf{Q}}_{\\mathbf{i}}'`$ is the target critic networks corresponding to $`{\\mathbf{Q}}_{\\mathbf{i}}`$, and $`{\\mathbf{μ}}'`$ is the target actor network.\n\n#### 5.2.3 Delayed Policy Update\n\nTD3 incorporates the delayed policy update to prevent the policy network from overfitting to noisy value estimates. While the critic networks are updated at each time step, the actor network is updated less frequently. Specifically, the policy is updated every $`d`$ iterations of the critic networks, and this delay allows the value estimates to stabilize, leading to more reliable policy updates.\n\n#### 5.2.4 Target Policy Smoothing\n\nTo further enhance the stability, TD3 introduces target policy smoothing, which adds extra noise to the target action during the critic update process. This process involves sampling noise from a Gaussian distribution $`\\mathbf{\\epsilon} \\sim {\\mathcal{N}\\hspace{0pt}{(0,\\sigma^{2})}}`$ and clipping it to a certain range to maintain the target action within the permissible action space. Specifically, the process above can be represented as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>𝒔​𝒂​[n+1]=𝝁′​(𝒔​[n+1]|𝜽𝝁′)+ϵ,ϵ∼clip​(𝒩​(0,σ2),−c,c),formulae-sequence𝒔𝒂delimited-[]𝑛1superscript𝝁bold-′conditional𝒔delimited-[]𝑛1superscriptsubscript𝜽𝝁bold-′bold-italic-ϵsimilar-tobold-italic-ϵclip𝒩0superscript𝜎2𝑐𝑐\\bm{sa}[n+1]=\\bm{\\mu^{\\prime}}(\\bm{s}[n+1]|\\bm{\\theta_{\\mu}^{\\prime}})+\\bm{\\epsilon},\\bm{\\epsilon}\\sim\\text{clip}(\\mathcal{N}(0,\\sigma^{2}),-c,c),</td>\n<td></td>\n<td>(21)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`\\text{clip}\\hspace{0pt}{(x,a,b)}`$ is a clipping operator, which is defined as $`{\\text{clip}\\hspace{0pt}{(x,a,b)}} = x`$ if $`a < x < b`$, $`{\\text{clip}\\hspace{0pt}{(x,a,b)}} = a`$ if $`x \\leq a`$ and $`{\\text{clip}\\hspace{0pt}{(x,a,b)}} = b`$ if $`x \\geq b`$. This smoothed target action $`{\\mathbf{s}}\\hspace{0pt}{\\mathbf{a}}\\hspace{0pt}{\\lbrack{n + 1}\\rbrack}`$ is used in the Bellman update to replace the target action $`{\\mathbf{μ}}'\\hspace{0pt}{(\\left. {{\\mathbf{s}}\\hspace{0pt}{\\lbrack{n + 1}\\rbrack}} \\middle| {\\mathbf{θ}}_{\\mathbf{μ}}' \\right.)}`$ in Eq. ([20](#S5.E20 \"In 5.2.2 Twin Critic Networks ‣ 5.2 Basic Principles of Conventional TD3 ‣ 5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")), which reduces the variance of the value estimates and preventing sharp changes in the policy.\n\n#### 5.2.5 Network Training\n\nThe training process of TD3 involves updating the actor and critic networks based on specific loss functions, which is designed to improve the learning stability and performance. The update of critic network is through minimizing the temporal difference (TD) error loss function, which is defined as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>L​(𝜽𝑸𝒊)=𝔼​[(𝑸𝒊​(𝒔​[n],𝒂​[n]|𝜽𝑸𝒊)−y​[n])2],i=1,2.formulae-sequence𝐿subscript𝜽subscript𝑸𝒊𝔼delimited-[]superscriptsubscript𝑸𝒊𝒔delimited-[]𝑛conditional𝒂delimited-[]𝑛subscript𝜽subscript𝑸𝒊𝑦delimited-[]𝑛2𝑖12L(\\bm{\\theta_{Q_{i}}})=\\mathbb{E}\\left[\\left(\\bm{Q_{i}}(\\bm{s}[n],\\bm{a}[n]|\\bm{\\theta_{Q_{i}}})-y[n]\\right)^{2}\\right],i=1,2.</td>\n<td></td>\n<td>(22)</td>\n</tr>\n</tbody>\n</table>\n\nWith a batch of randomly sampled $`B`$ transitions from experience replay buffer $`\\mathcal{D}`$, the loss function for the critic network can be approximated as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>L​(𝜽𝑸𝒊)≈1B​∑b=1B(𝑸𝒊​(𝒔𝒃,𝒂𝒃|𝜽𝑸𝒊)−yb)2,i=1,2,formulae-sequence𝐿subscript𝜽subscript𝑸𝒊1𝐵superscriptsubscript𝑏1𝐵superscriptsubscript𝑸𝒊subscript𝒔𝒃conditionalsubscript𝒂𝒃subscript𝜽subscript𝑸𝒊subscript𝑦𝑏2𝑖12L(\\bm{\\theta_{Q_{i}}})\\approx\\frac{1}{B}\\sum_{b=1}^{B}\\left(\\bm{Q_{i}}(\\bm{s_{b}},\\bm{a_{b}}|\\bm{\\theta_{Q_{i}}})-y_{b}\\right)^{2},i=1,2,</td>\n<td></td>\n<td>(23)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`y_{b} = {r_{b} + {\\gamma\\hspace{0pt}{\\min_{i = {1,2}}{\\mathbf{Q}}_{\\mathbf{i}}'}\\hspace{0pt}{({{\\mathbf{s}}\\hspace{0pt}\\mathbf{\\_}_{\\mathbf{b}}},{{{\\mathbf{μ}}'\\hspace{0pt}{(\\left. {{\\mathbf{s}}\\hspace{0pt}\\mathbf{\\_}_{\\mathbf{b}}} \\middle| {\\mathbf{θ}}_{\\mathbf{μ}}' \\right.)}} + \\mathbf{\\epsilon}})}}}`$.\n\nThe actor network $`{\\mathbf{μ}}\\hspace{0pt}{(\\left. {\\mathbf{s}} \\middle| {\\mathbf{θ}}_{\\mathbf{μ}} \\right.)}`$ is updated less frequently than the critic networks to ensure stable learning. The objective of actor network is to maximize the expected Q-value as evaluated by the first critic network. The loss function for the actor network is represented as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>L​(𝜽𝝁)=−𝔼​[𝑸𝟏​(𝒔,𝝁​(𝒔|𝜽𝝁)|𝜽𝑸𝟏)].𝐿subscript𝜽𝝁𝔼delimited-[]subscript𝑸1𝒔conditional𝝁conditional𝒔subscript𝜽𝝁subscript𝜽subscript𝑸1L(\\bm{\\theta_{\\mu}})=-\\mathbb{E}\\left[\\bm{Q_{1}}(\\bm{s},\\bm{\\mu}(\\bm{s}|\\bm{\\theta_{\\mu}})|\\bm{\\theta_{Q_{1}}})\\right].</td>\n<td></td>\n<td>(24)</td>\n</tr>\n</tbody>\n</table>\n\nWith a batch of randomly sampled $`B`$ transitions from experience replay buffer $`\\mathcal{D}`$, the loss function for the actor network can be approximated as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>L​(𝜽𝝁)≈−1B​∑b=1B𝑸𝟏​(𝒔𝒃,𝝁​(𝒔𝒃|𝜽𝝁)|𝜽𝑸𝟏).𝐿subscript𝜽𝝁1𝐵superscriptsubscript𝑏1𝐵subscript𝑸1subscript𝒔𝒃conditional𝝁conditionalsubscript𝒔𝒃subscript𝜽𝝁subscript𝜽subscript𝑸1L(\\bm{\\theta_{\\mu}})\\approx-\\frac{1}{B}\\sum_{b=1}^{B}\\bm{Q_{1}}(\\bm{s_{b}},\\bm{\\mu}(\\bm{s_{b}}|\\bm{\\theta_{\\mu}})|\\bm{\\theta_{Q_{1}}}).</td>\n<td></td>\n<td>(25)</td>\n</tr>\n</tbody>\n</table>\n\nThe target networks are updated using a soft update mechanism, which blends the parameters of the main networks with those of the target networks using a weight factor. The updates are defined as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>𝜽𝑸𝒊′←τ​𝜽𝑸𝒊+(1−τ)​𝜽𝑸𝒊′,i=1,2,formulae-sequence←superscriptsubscript𝜽subscript𝑸𝒊bold-′𝜏subscript𝜽subscript𝑸𝒊1𝜏superscriptsubscript𝜽subscript𝑸𝒊bold-′𝑖12\\bm{\\theta_{Q_{i}}^{\\prime}}\\leftarrow\\tau\\bm{\\theta_{Q_{i}}}+(1-\\tau)\\bm{\\theta_{Q_{i}}^{\\prime}},i=1,2,</td>\n<td></td>\n<td>(26)</td>\n</tr>\n</tbody>\n</table>\n\nand\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>𝜽𝝁′←τ​𝜽𝝁+(1−τ)​𝜽𝝁′,←superscriptsubscript𝜽𝝁bold-′𝜏subscript𝜽𝝁1𝜏superscriptsubscript𝜽𝝁bold-′\\bm{\\theta_{\\mu}^{\\prime}}\\leftarrow\\tau\\bm{\\theta_{\\mu}}+(1-\\tau)\\bm{\\theta_{\\mu}^{\\prime}},</td>\n<td></td>\n<td>(27)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`\\tau`$ is a small soft weight factor. It can be observed that the updated parameters of a target network are a weighted combination of its original parameters and the corresponding network parameters.\n\n### 5.3 Generative Diffusion Model for Actor Network\n\nIn this section, we first elaborate the motivation behind employing diffusion models within the actor network of TD3 algorithm. Then, we explore the customization of the diffusion model for generating optimal decisions regarding the formulated ASCEE-MOP.\n\n#### 5.3.1 Motivation of Employing Diffusion Model\n\nDeep reinforcement learning (DRL) has become an effective method for dealing with various network optimization problems in dynamic environments. Generally, DRL employs deep neural networks (DNNs) to provide optimal actions according to the current environment state. Multi-layer perceptrons (MLPs), a prevalent fully-connected DNN architecture in DRL, consist of hidden layers with nonlinear activation functions. However, the ASCEE-MOP faces unique challenges, such as the mobility of eavesdroppers, which introduces uncertainty and results in a highly dynamic and complex state space. Moreover, ASCEE-MOP involves intricate trade-offs between various optimization objectives, making it challenging to identify optimal solutions in this constantly changing environment. Thus, traditional MLP approaches may struggle to fully capture and balance these interconnected objectives.\n\nIn contrast, generative diffusion models \\[[45](#bib.bib45)\\], \\[[46](#bib.bib46)\\], with their superior feature learning capabilities, can better comprehend environmental states and the relationships between different objectives. This understanding allows DRL agents to make more balanced and optimized decisions in the highly uncertain and dynamic environment of ASCEE-MOP. Consequently, the use of diffusion models can be highly advantageous for addressing the complex issues inherent in ASCEE-MOP.\n\n#### 5.3.2 Diffusion Model\n\nDiffusion model, such as the denoising diffusion probabilistic model (DDPM)\\[[47](#bib.bib47)\\], operate through a dual-phase process that are the forward process and reverse process. Specifically, the forward phase incrementally adds Gaussian noise to the data, converting it progressively into a pure noise distribution. Conversely, the reverse phase reconstructs the original data by systematically removing this noise.\n\nForward Process: Given a original data $`{\\mathbf{x}}_{0}`$, the forward process produces a series of noisy samples $`{\\{{\\mathbf{x}}_{t}\\}}_{t = 0}^{T}`$ by gradually adding the Gaussian noise. Specifically, at each step $`t`$, the noisy sample $`{\\mathbf{x}}_{t}`$ is sampled from the distribution $`{\\mathbf{p}}\\hspace{0pt}{(\\left. {\\mathbf{x}}_{t} \\middle| {\\mathbf{x}}_{t - 1} \\right.)}`$, which is generated from the previous sample $`{\\mathbf{x}}_{t - 1}`$ by using the method as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>𝒑​(𝒙t|𝒙t−1)=𝒩​(𝒙t;1−βt​𝒙t−1,βt​𝑰),𝒑conditionalsubscript𝒙𝑡subscript𝒙𝑡1𝒩subscript𝒙𝑡1subscript𝛽𝑡subscript𝒙𝑡1subscript𝛽𝑡𝑰\\bm{p}(\\bm{x}_{t}|\\bm{x}_{t-1})=\\mathcal{N}(\\bm{x}_{t};\\sqrt{1-\\beta_{t}}\\bm{x}_{t-1},\\beta_{t}\\bm{I}),</td>\n<td></td>\n<td>(28)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`\\mathbf{I}`$ represents the identity matrix, and $`\\beta_{t}`$ is a variance schedule that is controlled by the variance preserving (VP) schedule. Moreover, $`\\beta_{t}`$ is the variance function of VP stochastic differential equations, which is as follows \\[[48](#bib.bib48)\\]:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>βt=1−e−βminT−2​t−12​T2​(βmax−βmin),subscript𝛽𝑡1superscript𝑒subscript𝛽min𝑇2𝑡12superscript𝑇2subscript𝛽maxsubscript𝛽min\\beta_{t}=1-e^{-\\frac{\\beta_{\\text{min}}}{T}-\\frac{2t-1}{2T^{2}}(\\beta_{\\text{max}}-\\beta_{\\text{min}})},</td>\n<td></td>\n<td>(29)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`\\beta_{m\\hspace{0pt}i\\hspace{0pt}n}`$ and $`\\beta_{m\\hspace{0pt}a\\hspace{0pt}x}`$ are the two constants that define the minimum and maximum variance.\n\nThe entire forward process from $`{\\mathbf{x}}_{0}`$ to $`{\\mathbf{x}}_{T}`$ can be expressed as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>𝒑​(𝒙T|𝒙0)=∏t=1T𝒑​(𝒙t|𝒙t−1).𝒑conditionalsubscript𝒙𝑇subscript𝒙0superscriptsubscriptproduct𝑡1𝑇𝒑conditionalsubscript𝒙𝑡subscript𝒙𝑡1\\bm{p}(\\bm{x}_{T}|\\bm{x}_{0})=\\prod_{t=1}^{T}\\bm{p}(\\bm{x}_{t}|\\bm{x}_{t-1}).</td>\n<td></td>\n<td>(30)</td>\n</tr>\n</tbody>\n</table>\n\nMoreover, the forward process that delineates the mathematical relation between $`{\\mathbf{x}}_{0}`$ and any $`{\\mathbf{x}}_{t}`$ is described as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>𝒙t=α¯t​𝒙0+1−α¯t​ϵ,subscript𝒙𝑡subscript¯𝛼𝑡subscript𝒙01subscript¯𝛼𝑡bold-italic-ϵ\\bm{x}_{t}=\\sqrt{\\bar{\\alpha}_{t}}\\bm{x}_{0}+\\sqrt{1-\\bar{\\alpha}_{t}}\\bm{\\epsilon},</td>\n<td></td>\n<td>(31)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`{\\overline{\\alpha}}_{t} = {\\prod_{k = 1}^{t}\\alpha_{k}}`$ represents the cumulative product of $`\\alpha_{k}`$ for all steps $`k \\leq t`$, wherein $`\\alpha_{t} = {1 - \\beta_{t}}`$, and $`\\mathbf{\\epsilon} \\sim {\\mathcal{N}\\hspace{0pt}(\\mathbf{0},\\mathbf{I})}`$ is a standard Gaussian noise. With an increase in $`t`$, $`{\\mathbf{x}}_{T}`$ gradually transitions into purely noise, adhering to an isotropic Gaussian distribution $`\\mathcal{N}\\hspace{0pt}{(0,{\\mathbf{I}})}`$. However, note that due to the absence of an optimal decision solution dataset (i.e., $`{\\mathbf{x}}_{0}`$ in the forward process) for the formulated optimization problem, the forward process is not integrated into the proposed GDMTD3.\n\nReverse Process: In the reverse process, the goal is to recover the original data $`{\\mathbf{x}}_{0}`$ from a noisy sample $`{\\mathbf{x}}_{T}`$ that follows a standard Gaussian distribution $`\\mathcal{N}\\hspace{0pt}{(\\mathbf{0},{\\mathbf{I}})}`$ by iteratively removing the noise. However, the statistical distribution $`q\\hspace{0pt}{(\\left. {\\mathbf{x}}_{t - 1} \\middle| {\\mathbf{x}}_{t} \\right.)}`$ necessitate computations that involve the data distribution, which is typically intractable in practice. Instead, our strategy is to approximate the conditional distribution $`q\\hspace{0pt}{(\\left. {\\mathbf{x}}_{t - 1} \\middle| {\\mathbf{x}}_{t} \\right.)}`$ by using a parameterized model $`p_{\\theta_{d}}`$, which can be expressed as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>𝒑𝜽𝒅​(𝒙t−1|𝒙t)=𝒩​(𝒙t−1;𝜿𝜽𝒅​(𝒙t,t,𝒈),β~t​𝑰),subscript𝒑subscript𝜽𝒅conditionalsubscript𝒙𝑡1subscript𝒙𝑡𝒩subscript𝒙𝑡1subscript𝜿subscript𝜽𝒅subscript𝒙𝑡𝑡𝒈subscript~𝛽𝑡𝑰\\bm{p_{\\theta_{d}}}(\\bm{x}_{t-1}|\\bm{x}_{t})=\\mathcal{N}(\\bm{x}_{t-1};\\bm{\\kappa_{\\theta_{d}}}(\\bm{x}_{t},t,\\bm{g}),\\tilde{\\beta}_{t}\\bm{I}),</td>\n<td></td>\n<td>(32)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`{\\mathbf{κ}}_{{\\mathbf{θ}}_{\\mathbf{d}}}\\hspace{0pt}{({\\mathbf{x}}_{t},t,{\\mathbf{g}})}`$ is the mean, wherein $`\\mathbf{g}`$ is the condition information, and $`{\\overset{\\sim}{\\beta}}_{t}`$ represents a predetermined variance factor, which is represented as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>β~t=1−α¯t−11−α¯t​βt.subscript~𝛽𝑡1subscript¯𝛼𝑡11subscript¯𝛼𝑡subscript𝛽𝑡\\tilde{\\beta}_{t}=\\frac{1-\\bar{\\alpha}_{t-1}}{1-\\bar{\\alpha}_{t}}\\beta_{t}.</td>\n<td></td>\n<td>(33)</td>\n</tr>\n</tbody>\n</table>\n\nUtilizing Bayesian formulation, the reverse process is restructured as a Gaussian probability density function. The mean for the reverse process is computed as follows \\[[47](#bib.bib47)\\]:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>𝜿𝜽𝒅​(𝒙t,t,𝒈)=αt​(1−α¯t−1)1−α¯t​𝒙t+α¯t−1​βt1−α¯t​𝒙0.subscript𝜿subscript𝜽𝒅subscript𝒙𝑡𝑡𝒈subscript𝛼𝑡1subscript¯𝛼𝑡11subscript¯𝛼𝑡subscript𝒙𝑡subscript¯𝛼𝑡1subscript𝛽𝑡1subscript¯𝛼𝑡subscript𝒙0\\bm{\\kappa_{\\theta_{d}}}(\\bm{x}_{t},t,\\bm{g})=\\frac{\\sqrt{\\alpha_{t}}\\left(1-\\bar{\\alpha}_{t-1}\\right)}{1-\\bar{\\alpha}_{t}}\\bm{x}_{t}+\\frac{\\sqrt{\\bar{\\alpha}_{t-1}}\\beta_{t}}{1-\\bar{\\alpha}_{t}}\\bm{x}_{0}.</td>\n<td></td>\n<td>(34)</td>\n</tr>\n</tbody>\n</table>\n\nNonetheless, the parameterized model $`{\\mathbf{p}}_{{\\mathbf{θ}}_{\\mathbf{d}}}`$ does not have access to $`{\\mathbf{x}}_{0}`$ and therefore must estimate it as a substitute. According to Eq. ([31](#S5.E31 \"In 5.3.2 Diffusion Model ‣ 5.3 Generative Diffusion Model for Actor Network ‣ 5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")), $`{\\mathbf{x}}_{0}`$ can be calculated as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>𝒙0=1α¯t​(𝒙t−1−α¯t⋅𝜺𝜽𝒅​(𝒙t,t,𝒈)),subscript𝒙01subscript¯𝛼𝑡subscript𝒙𝑡⋅1subscript¯𝛼𝑡subscript𝜺subscript𝜽𝒅subscript𝒙𝑡𝑡𝒈\\bm{x}_{0}=\\frac{1}{\\sqrt{\\bar{\\alpha}_{t}}}\\left(\\bm{x}_{t}-\\sqrt{1-\\bar{\\alpha}_{t}}\\cdot\\bm{\\varepsilon_{\\theta_{d}}}(\\bm{x}_{t},t,\\bm{g})\\right),</td>\n<td></td>\n<td>(35)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`{\\mathbf{ε}}_{{\\mathbf{θ}}_{\\mathbf{d}}}\\hspace{0pt}{({\\mathbf{x}}_{t},t,{\\mathbf{g}})}`$ is a deep neural network that generates the denoising noise based on the condition $`\\mathbf{g}`$, and then indirectly approximate the mean by\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>𝜿𝜽𝒅​(𝒙t,t,𝒈)=1αt​(𝒙t−βt⋅𝜺𝜽𝒅​(𝒙t,t,𝒈)1−α¯t).subscript𝜿subscript𝜽𝒅subscript𝒙𝑡𝑡𝒈1subscript𝛼𝑡subscript𝒙𝑡⋅subscript𝛽𝑡subscript𝜺subscript𝜽𝒅subscript𝒙𝑡𝑡𝒈1subscript¯𝛼𝑡\\bm{\\kappa_{\\theta_{d}}}(\\bm{x}_{t},t,\\bm{g})=\\frac{1}{\\sqrt{\\alpha_{t}}}\\left(\\bm{x}_{t}-\\frac{\\beta_{t}\\cdot\\bm{\\varepsilon_{\\theta_{d}}}(\\bm{x}_{t},t,\\bm{g})}{\\sqrt{1-\\bar{\\alpha}_{t}}}\\right).</td>\n<td></td>\n<td>(36)</td>\n</tr>\n</tbody>\n</table>\n\nTracing the reverse transitions from $`{\\mathbf{x}}_{T}`$ back to $`{\\mathbf{x}}_{1}`$, we can establish the generative distribution $`{\\mathbf{p}}_{{\\mathbf{θ}}_{\\mathbf{d}}}\\hspace{0pt}{({\\mathbf{x}}_{0})}`$ as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>𝒑𝜽𝒅​(𝒙0)=𝒑​(𝒙T)​∏t=1T𝒑𝜽𝒅​(𝒙t−1|𝒙t),subscript𝒑subscript𝜽𝒅subscript𝒙0𝒑subscript𝒙𝑇superscriptsubscriptproduct𝑡1𝑇subscript𝒑subscript𝜽𝒅conditionalsubscript𝒙𝑡1subscript𝒙𝑡\\bm{p_{\\theta_{d}}}(\\bm{x}_{0})=\\bm{p}(\\bm{x}_{T})\\prod_{t=1}^{T}\\bm{p_{\\theta_{d}}}(\\bm{x}_{t-1}|\\bm{x}_{t}),</td>\n<td></td>\n<td>(37)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`p\\hspace{0pt}{({\\mathbf{x}}_{T})}`$ represents a standard normal distribution. Once the generative distribution $`{\\mathbf{p}}_{{\\mathbf{θ}}_{\\mathbf{d}}}\\hspace{0pt}{({\\mathbf{x}}_{0})}`$ is successfully trained, we can then proceed to sample $`{\\mathbf{x}}_{0}`$ from Eq. ([37](#S5.E37 \"In 5.3.2 Diffusion Model ‣ 5.3 Generative Diffusion Model for Actor Network ‣ 5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")).\n\nInput:\n\nThe state of current environment $`{\\mathbf{s}}\\hspace{0pt}{\\lbrack n\\rbrack}`$\n\nOutput:\n\nThe action decision $`{\\mathbf{a}}\\hspace{0pt}{\\lbrack n\\rbrack}`$\n\n1 Initialize a random Gaussian distribution $`{\\mathbf{x}}_{T} \\sim {\\mathcal{N}\\hspace{0pt}{(0,{\\mathbf{I}})}}`$;\n\n2 for *the denoising step $`t = T`$ to $`1`$* do\n\n3       Deduce a denoising distribution $`{\\mathbf{ε}}_{{\\mathbf{θ}}_{\\mathbf{d}}}\\hspace{0pt}{({\\mathbf{x}}_{t},t,{{\\mathbf{s}}\\hspace{0pt}{\\lbrack n\\rbrack}})}`$ by a deep neural network;\n\n4       Compute the mean $`{\\mathbf{κ}}_{{\\mathbf{θ}}_{\\mathbf{d}}}\\hspace{0pt}{({\\mathbf{x}}_{\\mathbf{t}},t,{{\\mathbf{s}}\\hspace{0pt}{\\lbrack n\\rbrack}})}`$ of $`{\\mathbf{p}}_{{\\mathbf{θ}}_{\\mathbf{d}}}\\hspace{0pt}{(\\left. {\\mathbf{x}}_{t - 1} \\middle| {\\mathbf{x}}_{t} \\right.)}`$ according to Eq. ([36](#S5.E36 \"In 5.3.2 Diffusion Model ‣ 5.3 Generative Diffusion Model for Actor Network ‣ 5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\"));\n\n5       Compute the distribution $`{\\mathbf{x}}_{t - 1}`$ using the reparameterization trick according to Eq. ([38](#S5.E38 \"In 5.3.3 Integration of Diffusion Model and Actor Network of TD3 ‣ 5.3 Generative Diffusion Model for Actor Network ‣ 5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\"));\n\n6 end for\n\n7Compute the distribution of $`{\\mathbf{x}}_{0}`$ according to Eq. ([37](#S5.E37 \"In 5.3.2 Diffusion Model ‣ 5.3 Generative Diffusion Model for Actor Network ‣ 5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")) and randomly select an action $`{\\mathbf{a}}\\hspace{0pt}{\\lbrack n\\rbrack}`$ based on it;\n\nreturn $`{\\mathbf{a}}\\hspace{0pt}{\\lbrack n\\rbrack}`$\n\nAlgorithm 1\n\nAction Sampling Based on Generative Diffusion Model\n\n#### 5.3.3 Integration of Diffusion Model and Actor Network of TD3\n\nIntegrating diffusion model into the actor network of conventional TD3 algorithm significantly enhances the decision-making by providing a more diverse set of potential actions. Specifically, the generative capabilities of diffusion model allow for the creation of complex action sets, which are refined through the learned reverse process, enabling direct sampling of actions from the generative distribution $`{\\mathbf{p}}_{{\\mathbf{θ}}_{\\mathbf{d}}}\\hspace{0pt}{({\\mathbf{x}}_{0})}`$.\n\nA significant challenge in integrating diffusion model is managing stochastic components, which complicates gradient descent methods typically used in training. To overcome this issue, a reparameterization process that facilitates differentiable sampling is employed, which can be represented as follows:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>𝒙t−1=𝜿𝜽𝒅​(𝒙t,t,𝒔)+(β~t/2)2⊙ϵ,subscript𝒙𝑡1subscript𝜿subscript𝜽𝒅subscript𝒙𝑡𝑡𝒔direct-productsuperscriptsubscript~𝛽𝑡22bold-italic-ϵ\\bm{x}_{t-1}=\\bm{\\kappa}_{\\bm{{\\theta_{d}}}}\\left(\\bm{x}_{t},t,\\bm{s}\\right)+\\left(\\tilde{\\beta}_{t}/2\\right)^{2}\\odot\\bm{\\epsilon},</td>\n<td></td>\n<td>(38)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`\\mathbf{s}`$ which represents the current state of the environment in DRL, is used as a conditional variable in the parameterization function $`{\\mathbf{κ}}_{{\\mathbf{θ}}_{\\mathbf{d}}}`$. Moreover, $`\\odot`$ is the operator of Hadamard product.\n\nThis adaptation allows the diffusion process to be contextually responsive and adjusting actions dynamically according to the state of the environment, which is crucial for DRL algorithms where the environmental state guides the necessary action responses. Accordingly, the main steps of the action sampling process based on generative diffusion model is detailed in Algorithm [1](#alg1 \"In 5.3.2 Diffusion Model ‣ 5.3 Generative Diffusion Model for Actor Network ‣ 5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\").\n\n### 5.4 Main Flow of Proposed Algorithm\n\nFig. [2](#S5.F2 \"Figure 2 ‣ 5.4 Main Flow of Proposed Algorithm ‣ 5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\") shows the framework and main flow of the proposed GDMTD3 for the formulated ASCEE-MOP. Specifically, the proposed method integrates the diffusion model within DRL, which enhances the capability of the actor network for navigating the complex decision spaces under high-dimensional and noisy input data. The detailed implementation of this process is elaborated in Algorithm [2](#alg2 \"In 5.4 Main Flow of Proposed Algorithm ‣ 5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\").\n\n[Refer to caption](/html/2407.08914/assets/x2.png)\n\nFigure 2: Schematic of GDMTD3 framework, where the generative diffusion model is integrated into the actor network of TD3 algorithm to capture complex state features and generate optimal actions according to the current state of the environment.\n\n1 Initialize two online critic networks denoted as $`{\\mathbf{Q}}_{\\mathbf{1}}`$ and $`{\\mathbf{Q}}_{\\mathbf{2}}`$ with parameters $`{\\mathbf{θ}}_{{\\mathbf{Q}}_{\\mathbf{1}}}`$ and $`{\\mathbf{θ}}_{{\\mathbf{Q}}_{\\mathbf{2}}}`$ and a generative diffusion-enabled online actor network denoted as $`\\mathbf{ε}`$ with parameters $`{\\mathbf{θ}}_{\\mathbf{d}}`$;\n\n2 Initialize the corresponding target networks: $`{{\\mathbf{θ}}_{{\\mathbf{Q}}_{\\mathbf{1}}}'\\leftarrow{\\mathbf{θ}}_{{\\mathbf{Q}}_{\\mathbf{1}}}},{{\\mathbf{θ}}_{{\\mathbf{Q}}_{\\mathbf{2}}}'\\leftarrow{\\mathbf{θ}}_{{\\mathbf{Q}}_{\\mathbf{2}}}}`$ and $`{\\mathbf{θ}}_{\\mathbf{μ}}'\\leftarrow{\\mathbf{θ}}_{\\mathbf{μ}}`$;\n\n3 for *the training episode = $`1`$ to $`M`$* do\n\n4       Reset the initial state $`{\\mathbf{s}}\\hspace{0pt}{\\lbrack 0\\rbrack}`$ of environment;\n\n5       repeat\n\n6             $`{s\\hspace{0pt}t\\hspace{0pt}e\\hspace{0pt}p}\\leftarrow 0`$;\n\n7             Call Algorithm [1](#alg1 \"In 5.3.2 Diffusion Model ‣ 5.3 Generative Diffusion Model for Actor Network ‣ 5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\") to obtain the action $`{\\mathbf{a}}\\hspace{0pt}{\\lbrack{s\\hspace{0pt}t\\hspace{0pt}e\\hspace{0pt}p}\\rbrack}`$;\n\n8             Execute the action $`{\\mathbf{a}}\\hspace{0pt}{\\lbrack{s\\hspace{0pt}t\\hspace{0pt}e\\hspace{0pt}p}\\rbrack}`$ in the environment and receive the reward $`r\\hspace{0pt}{\\lbrack{s\\hspace{0pt}t\\hspace{0pt}e\\hspace{0pt}p}\\rbrack}`$ and the next state $`{\\mathbf{s}}\\hspace{0pt}{\\lbrack{{s\\hspace{0pt}t\\hspace{0pt}e\\hspace{0pt}p} + 1}\\rbrack}`$ from the environment;\n\n9             Store the experience $`({{\\mathbf{s}}\\hspace{0pt}{\\lbrack{s\\hspace{0pt}t\\hspace{0pt}e\\hspace{0pt}p}\\rbrack}},{{\\mathbf{a}}\\hspace{0pt}{\\lbrack{s\\hspace{0pt}t\\hspace{0pt}e\\hspace{0pt}p}\\rbrack}},{r\\hspace{0pt}{\\lbrack{s\\hspace{0pt}t\\hspace{0pt}e\\hspace{0pt}p}\\rbrack}},{{\\mathbf{s}}\\hspace{0pt}{\\lbrack{{s\\hspace{0pt}t\\hspace{0pt}e\\hspace{0pt}p} + 1}\\rbrack}})`$ in the replay buffer $`\\mathcal{D}`$;\n\n10             Sample a random batch $`\\mathcal{B}`$ from the replay buffer $`\\mathcal{D}`$;\n\n11             Update the online critic network parameters according to Eq. ([23](#S5.E23 \"In 5.2.5 Network Training ‣ 5.2 Basic Principles of Conventional TD3 ‣ 5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\"));\n\n12             if *$`{s\\hspace{0pt}t\\hspace{0pt}e\\hspace{0pt}p}\\operatorname{mod}d`$* then\n\n13                   Update the actor network parameters according to Eq. ([25](#S5.E25 \"In 5.2.5 Network Training ‣ 5.2 Basic Principles of Conventional TD3 ‣ 5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\"));\n\n14                   Soft-update the target networks according to Eqs. ([26](#S5.E26 \"In 5.2.5 Network Training ‣ 5.2 Basic Principles of Conventional TD3 ‣ 5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")) and ([27](#S5.E27 \"In 5.2.5 Network Training ‣ 5.2 Basic Principles of Conventional TD3 ‣ 5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\"));\n\n15             end if\n\n16            $`{s\\hspace{0pt}t\\hspace{0pt}e\\hspace{0pt}p}\\leftarrow{{s\\hspace{0pt}t\\hspace{0pt}e\\hspace{0pt}p} + 1}`$;\n\n17      until *environment is terminated*;\n\n18 end for\n\nAlgorithm 2\n\nGDMTD3\n\n#### 5.4.1 Training and Execution\n\nIn the considered UAV swarm-enabled surveillance network system, the RBS coordinates the training phase through an actor-critic network framework. In this phase, the interaction information between UAV swarm and the environment is regularly recorded and stored into a replay buffer. Note that the RBS possesses the sufficient capabilities to transmit the training parameters to UAV swarm \\[[49](#bib.bib49)\\]. Following a comprehensive training period, the actor network is then integrated with UAV swarm, steering their real-time operations to adaptively accomplish the secure communication mission throughout the execution phase.\n\n#### 5.4.2 Complexity Analysis\n\nIn this section, we analyze the computational and space complexity of GDMTD3 during training and execution phases.\n\nTraining Phase: The computational complexity of GDMTD3 is $`\\mathcal{O}\\hspace{0pt}{({{4\\hspace{0pt}{|{\\mathbf{θ}}_{{\\mathbf{Q}}_{\\mathbf{1}}}|}} + {2\\hspace{0pt}{|{\\mathbf{θ}}_{\\mathbf{d}}|}} + {M\\hspace{0pt}N\\hspace{0pt}T\\hspace{0pt}{|{\\mathbf{θ}}_{\\mathbf{d}}|}} + {M\\hspace{0pt}N\\hspace{0pt}V} + {M\\hspace{0pt}N\\hspace{0pt}{({2\\hspace{0pt}{|{\\mathbf{θ}}_{{\\mathbf{Q}}_{\\mathbf{1}}}|}})}} + {{{M\\hspace{0pt}N}/d}\\hspace{0pt}{({{2\\hspace{0pt}{|{\\mathbf{θ}}_{{\\mathbf{Q}}_{\\mathbf{1}}}|}} + {2\\hspace{0pt}{|{\\mathbf{θ}}_{\\mathbf{d}}|}}})}}})}`$ in the training phase, which can be summarized as follows:\n\n- •\n\n  Network Initialize: This phase involves the initialization of network parameters. Specifically, the computational complexity is expressed as $`\\mathcal{O}\\hspace{0pt}{({{4\\hspace{0pt}{|{\\mathbf{θ}}_{{\\mathbf{Q}}_{\\mathbf{1}}}|}} + {2\\hspace{0pt}{|{\\mathbf{θ}}_{\\mathbf{d}}|}}})}`$, where $`|{\\mathbf{θ}}_{{\\mathbf{Q}}_{\\mathbf{1}}}|`$ denotes the number of parameters in each of the twin online critic networks, and $`|{\\mathbf{θ}}_{\\mathbf{d}}|`$ represents the number of parameters in the diffusion-enabled online actor network.\n\n- •\n\n  Action Sampling: This phase entails generating actions according to the current state using the diffusion reverse process, and its complexity is $`\\mathcal{O}\\hspace{0pt}{({M\\hspace{0pt}N\\hspace{0pt}T\\hspace{0pt}{|{\\mathbf{θ}}_{\\mathbf{d}}|}})}`$. Here, $`M`$ denotes the number of training episodes, $`N`$ is the number of steps per episode, and $`T`$ is the number of denoising steps required to sample an action in diffusion-enabled actor network.\n\n- •\n\n  Replay Buffer Collection: The complexity of collecting state transitions in the replay buffer is $`\\mathcal{O}\\hspace{0pt}{({M\\hspace{0pt}N\\hspace{0pt}V})}`$, where $`V`$ represents the complexity of interacting with environment.\n\n- •\n\n  Network Update: The updating phase is divided into three main parts that are the frequent updates of the critic networks and less frequent updates of the actor network along with their respective soft updates. Thus, the complexity for this phase is calculated as $`\\mathcal{O}\\hspace{0pt}{({{M\\hspace{0pt}N\\hspace{0pt}{({2\\hspace{0pt}{|{\\mathbf{θ}}_{{\\mathbf{Q}}_{\\mathbf{1}}}|}})}} + {{{M\\hspace{0pt}N}/d}\\hspace{0pt}{({{2\\hspace{0pt}{|{\\mathbf{θ}}_{{\\mathbf{Q}}_{\\mathbf{1}}}|}} + {2\\hspace{0pt}{|{\\mathbf{θ}}_{\\mathbf{d}}|}}})}}})}`$.\n\nIn the training phase, the space complexity of GDMTD3 is $`\\mathcal{O}{(4|{\\mathbf{θ}}_{{\\mathbf{Q}}_{\\mathbf{1}}}| + 2|{\\mathbf{θ}}_{\\mathbf{d}}|)} + D\\left( 2|{\\mathbf{s}}| + |{\\mathbf{a}}| + 1 \\right))`$, where $`D`$ represents the size of the replay buffer and $`|{\\mathbf{s}}|`$, $`|{\\mathbf{a}}|`$ denote the dimensions of the state and action spaces, respectively. This space complexity accounts for the storage of neural network parameters and the data structures required to maintain the replay buffer, which holds tuples of states, actions, rewards, and next states.\n\nExecution Phase: During the execution phase, the computational complexity of GDMTD3 is $`\\mathcal{O}\\hspace{0pt}{({M\\hspace{0pt}N\\hspace{0pt}T\\hspace{0pt}{|{\\mathbf{θ}}_{\\mathbf{d}}|}})}`$, which can be contributed by action selection according to the current state using the diffusion-enabled actor network. Moreover, the space complexity during the execution phase is $`\\mathcal{O}\\hspace{0pt}{({|{\\mathbf{θ}}_{\\mathbf{d}}|})}`$ since the diffusion-enabled actor network parameters need to be stored in memory for action selection.\n\n## 6 Simulation Results\n\nIn this section, we present the comprehensive evaluations of our proposed approach and verify the effectiveness and robustness of the proposed GDMTD3 in addressing ASCEE-MOP under various settings.\n\n### 6.1 Simulation Setup\n\nThis section provides an extensive description of the simulation setup, including the simulation platform, environmental details, model design, and benchmarks utilized to evaluate the performance of the proposed approach.\n\n#### 6.1.1 Simulation Platform\n\nOur experiments are conducted using a computing setup that included an NVIDIA GeForce RTX 3090 GPU with 24 GB of memory and a 13th Gen Intel(R) Core(TM) i9-13900K 32-core processor with 128 GB of RAM. The operating system on the workstation is Ubuntu 22.04.3 LTS. For our deep learning computations, we use PyTorch 2.2.2, along with the CUDA 11.8.\n\n#### 6.1.2 Environmental Details\n\nIn this study, we consider a UAV swarm consisting of 8 individual UAVs, each of which equipped with a transmit power of $`0.1`$ W. Moreover, the swarm is dispersed randomly within an area measuring $`40`$ m by $`40`$ m. To simulate potential security threats, we incorporate a mobile eavesdropper, which follows the Gauss-Markov mobility model \\[[50](#bib.bib50)\\]. This model is characterized by an average speed of $`5.0`$ m/s, a correlation coefficient of $`0.1`$, and a random variance of $`1.0`$, which together dictate the stochastic and dynamic aspects of the eavesdropper movement. In addition, Table [II](#S6.T2 \"TABLE II ‣ 6.1.2 Environmental Details ‣ 6.1 Simulation Setup ‣ 6 Simulation Results ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\") provides the details about the channel characteristics and the UAVs.\n\nTABLE II: Other Environmental Parameter Settings \\[[40](#bib.bib40)\\] \\[[51](#bib.bib51)\\]\n<table>\n<thead>\n<tr>\n<th>Parameter</th>\n<th>Value</th>\n<th>Parameter</th>\n<th>Value</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<th>fcsubscript𝑓𝑐f_{c}</th>\n<td>2.42.42.4 GHz</td>\n<td>μ1subscript𝜇1\\mu_{1}</td>\n<td>111 dB</td>\n</tr>\n<tr>\n<th>c0subscript𝑐0c_{0}</th>\n<td>9.619.619.61</td>\n<td>μ2subscript𝜇2\\mu_{2}</td>\n<td>202020 dB</td>\n</tr>\n<tr>\n<th>c1subscript𝑐1c_{1}</th>\n<td>0.160.160.16</td>\n<td>W𝑊W</td>\n<td>19.619.619.6 N</td>\n</tr>\n<tr>\n<th>v0subscript𝑣0v_{0}</th>\n<td>4.034.034.03</td>\n<td>utipssubscript𝑢tipsu_{\\text{tips}}</td>\n<td>120120120</td>\n</tr>\n<tr>\n<th>d0subscript𝑑0d_{0}</th>\n<td>0.60.60.6</td>\n<td>ρ𝜌\\rho</td>\n<td>1.2251.2251.225</td>\n</tr>\n<tr>\n<th>s𝑠s</th>\n<td>0.050.050.05</td>\n<td>A𝐴A</td>\n<td>0.5030.5030.503</td>\n</tr>\n<tr>\n<th>M𝑀M</th>\n<td>0.10.10.1</td>\n<td>κ𝜅\\kappa</td>\n<td>0.0120.0120.012</td>\n</tr>\n<tr>\n<th>ΩΩ\\Omega</th>\n<td>300300300</td>\n<td>ΛΛ\\Lambda</td>\n<td>0.40.40.4</td>\n</tr>\n</tbody>\n</table>\n\n#### 6.1.3 Model Design\n\nGDMTD3 utilizes a diffusion model at the core of its actor network, and it employs two structurally identical critic networks to address overestimation issues. Specifically, the critic networks consist of three-layer MLPs with ReLU activation function \\[[52](#bib.bib52)\\]. Moreover, Fig. [3](#S6.F3 \"Figure 3 ‣ 6.1.3 Model Design ‣ 6.1 Simulation Setup ‣ 6 Simulation Results ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\") shows the detailed configuration of actor network. Specifically, the actor network in GDMTD3 uses sinusoidal position embeddings to capture the temporal dynamics inside the diffusion process and predicts the denoised distribution according to the current state and a random Gaussian distribution. This enhancement enables the actor network to better understand the interdependencies among steps in the diffusion chain. In addition, the Adam optimizer \\[[53](#bib.bib53)\\] is used to train the actor and critic networks, with a learning rate of $`{l\\hspace{0pt}r} = {3 \\times 10^{- 4}}`$ for each network. The target networks, which replicate the structure of the online networks, can minimize the learning variance. We adopt a soft update rate of $`\\tau = 0.005`$ as specified in Eqs. ([26](#S5.E26 \"In 5.2.5 Network Training ‣ 5.2 Basic Principles of Conventional TD3 ‣ 5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")) and ([27](#S5.E27 \"In 5.2.5 Network Training ‣ 5.2 Basic Principles of Conventional TD3 ‣ 5 The Proposed GDMTD3 ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")). Additional training hyperparameters are outlined in Table [III](#S6.T3 \"TABLE III ‣ 6.1.3 Model Design ‣ 6.1 Simulation Setup ‣ 6 Simulation Results ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\").\n\n[Refer to caption](/html/2407.08914/assets/x3.png)\n\nFigure 3: The diffusion-enabled actor network architecture, where Mish activation function \\[[54](#bib.bib54)\\] is adopted.\n\nTABLE III: Other Training Parameter Settings\n<table>\n<thead>\n<tr>\n<th>Parameter</th>\n<th>Description</th>\n<th>Value</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td>B𝐵B</td>\n<td>Batch size</td>\n<td>128128128</td>\n</tr>\n<tr>\n<td>γ𝛾\\gamma</td>\n<td>Discount factor</td>\n<td>0.900.900.90</td>\n</tr>\n<tr>\n<td>D𝐷D</td>\n<td>Capacity of the experience replay buffer</td>\n<td>2×1062superscript1062\\times 10^{6}</td>\n</tr>\n<tr>\n<td>d𝑑d</td>\n<td>Frequency of policy updates</td>\n<td>222</td>\n</tr>\n<tr>\n<td>T𝑇T</td>\n<td>Denoising steps for the diffusion model</td>\n<td>444</td>\n</tr>\n<tr>\n<td>M𝑀M</td>\n<td>Number of training episodes</td>\n<td>800080008000</td>\n</tr>\n</tbody>\n</table>\n\n#### 6.1.4 Benchmarks\n\nTo validate the superiority of our proposed approach, we compare the following approaches:\n\n- •\n\n  Random Strategy: The random strategy arranges each UAV in a random position within the surveillance area at each time slot, without any specific formation. The excitation current weight for each UAV is also assigned random values within the allowable range. This approach serves as a baseline to evaluate the performance improvements achieved by more strategies.\n\n- •\n\n  Linear Antenna Array Strategy: The linear antenna array (LAA) strategy arranges UAVs in a linear alignment with an equal inter-UAV separation distance of 0.5 m. Moreover, the geometric center of the linear formation of UAVs coincides with the center of the designated monitoring region.\n\n- •\n\n  Planar Antenna Array Strategy: The planar antenna array (PAA) strategy arranges UAVs in a two-dimensional grid with an equal inter-UAV separation distance of 0.5 m. Similarly, the geometric center of grid formation of UAVs coincides with the center of the monitoring region.\n\n- •\n\n  Circular Antenna Array Strategy: The circular antenna array (CAA) strategy arranges UAVs in a circular pattern with a radius of 0.5 m and equal inter-UAV separation distance. Similarly to the LAA and PAA strategies, the center point of this circular UAV formation coincides with the center of the designated monitoring region.\n\n- •\n\n  The Proposed GDM-enabled DRL Approach: Our approach optimizes the secure rate of system and the flight energy consumption of the UAV swarm by formulating the ASCEE-MOP, and then solving it by using the proposed GDMTD3 algorithm.\n\nIn addition to comparing these approaches, we also compare the proposed GDMTD3 with four well-known DRL benchmarks: DDPG, TD3, SAC \\[[55](#bib.bib55)\\], and PPO \\[[56](#bib.bib56)\\]. Specifically, DDPG, TD3, and SAC are off-policy methods that are used for the continuous action spaces and utilize advanced strategies for stability and performance enhancement. In contrast, PPO is an on-policy method that offers robustness and simplicity in implementation, which is also suitable for the continuous action but focuses on effective policy updates through direct learning from the current policy. Moreover, we implement a transformer-based TD3 method as another point of comparison, which serves as a benchmark to evaluate the capability of the proposed diffusion model in extracting relevant features and representing complex state representations for DRL. Specifically, this method employs a transformer network \\[[57](#bib.bib57)\\] with two attention heads as the actor network, designed to handle sequential dependencies and complex state representations.\n\n### 6.2 Simulation Results\n\nThe detailed results of our simulation are provided in this section. We compare the effectiveness of the proposed GDM-enabled DRL approach with several above-mentioned benchmark deployment policies, and analyze the performance of the proposed GDMTD3 under various algorithm configurations and environmental settings.\n\n#### 6.2.1 Comparisons with Other Deployment Policies\n\n[Refer to caption](/html/2407.08914/assets/x4.png)\n\nFigure 4: Comparison results of the proposed GDM-enabled DRL approach and other four deployment policies. (a) Average secrecy rate per step. (b) Average flight energy consumption per step.\n\nIn this part, the proposed GDM-enabled DRL approach is compared to the four different deployment policies. Specifically, Figs. [4](#S6.F4 \"Figure 4 ‣ 6.2.1 Comparisons with Other Deployment Policies ‣ 6.2 Simulation Results ‣ 6 Simulation Results ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")(a) and [4](#S6.F4 \"Figure 4 ‣ 6.2.1 Comparisons with Other Deployment Policies ‣ 6.2 Simulation Results ‣ 6 Simulation Results ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")(b) show the average secrecy rate of the system and average flight energy consumption of the UAV swarm, respectively.\n\nAs shown in Fig. [4](#S6.F4 \"Figure 4 ‣ 6.2.1 Comparisons with Other Deployment Policies ‣ 6.2 Simulation Results ‣ 6 Simulation Results ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")(a), the GDM-enabled DRL approach obtains a higher average secrecy rate. This result demonstrates the effectiveness of our proposed approach in ensuring secure communications by optimizing excitation current weights and positions of UAVs. Interestingly, the random strategy performs better than the structured LAA, PAA, and CAA strategies. The most likely reason is that the fixed formations in these three deployment strategies make it more difficult to handle the mobility of the eavesdropper.\n\nFrom Fig. [4](#S6.F4 \"Figure 4 ‣ 6.2.1 Comparisons with Other Deployment Policies ‣ 6.2 Simulation Results ‣ 6 Simulation Results ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")(b), it is evident that the suggested GDM-enabled DRL strategy uses less energy on average than the other approaches. the proposed GDM-enabled DRL approach exhibits the lower average energy consumption compared to the other strategies. This highlights the efficiency of the proposed GDM-enabled DRL approach in optimizing the flight energy consumption of UAV swarm, which is crucial for the operation of resource-constrained UAVs. Moreover, the random policy shows the highest energy consumption, reflecting its inefficiency. In addition, the LAA, PAA, and CAA strategies demonstrate moderate energy consumption, but they do not achieve the same level of secrecy rate as the proposed GDM-enabled DRL approach, underscoring the advantage of the proposed GDM-enabled DRL approach in optimizing energy consumption while maintaining secure communications.\n\nIn conclusion, it is apparent that the proposed GDM-enabled DRL approach achieves a superior performance in terms of both the secrecy rate of the system and the flight energy consumption of the UAV swarm.\n\n#### 6.2.2 Comparisons with Other DRL Benchmarks\n\n[Refer to caption](/html/2407.08914/assets/x5.png)\n\nFigure 5: Comparison results of GDMTD3 and DRL benchmarks. (a) Reward per episode. (b) Average secrecy rate per step. (c) Average flight energy consumption per step.\n\nFig. [5](#S6.F5 \"Figure 5 ‣ 6.2.2 Comparisons with Other DRL Benchmarks ‣ 6.2 Simulation Results ‣ 6 Simulation Results ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\") shows the comparison results of GDMTD3 with five different DRL benchmarks, including TD3, PPO, DDPG, SAC and transformer-based TD3 methods. As shown in Fig. [5](#S6.F5 \"Figure 5 ‣ 6.2.2 Comparisons with Other DRL Benchmarks ‣ 6.2 Simulation Results ‣ 6 Simulation Results ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")(a), the proposed GDMTD3 reports significantly higher rewards per episode than the other DRL methods. This superiority of GDMTD3 is originated from the incorporation of diffusion model in GDMTD3, which allows for more efficient exploration and exploitation of the state-action space, resulting in higher cumulative rewards. Moreover, Figs. [5](#S6.F5 \"Figure 5 ‣ 6.2.2 Comparisons with Other DRL Benchmarks ‣ 6.2 Simulation Results ‣ 6 Simulation Results ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")(b) and [5](#S6.F5 \"Figure 5 ‣ 6.2.2 Comparisons with Other DRL Benchmarks ‣ 6.2 Simulation Results ‣ 6 Simulation Results ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\")(c) indicate that GDMTD3 achieves the highest average secrecy rate of the system and relatively low average flight energy consumption of the UAV swarm among the compared methods. In addition, although the transformer-based TD3 method outperforms traditional TD3, PPO, DDPG, and SAC methods, it does not reach the secrecy rate achieved by GDMTD3, highlighting the advantage of diffusion model in adapting to the complex secure communication scenario involving the mobile eavesdropper.\n\n#### 6.2.3 Impact of Algorithm Parameters\n\nIn this section, we evaluate effects of different parameters on the performance of GDMTD3 including the random seed, noise schedule function, and denoising step.\n\nEffect of Different Random Seeds. DRL algorithms are known to be sensitive to random seeds, which can significantly impact their performance, sometimes even causing the algorithm failing to converge when different seeds are used \\[[58](#bib.bib58)\\]. Specifically, this sensitivity arises because random seeds influence various aspects of the training process, such as the initialization of neural network weights, the order of data processing, and the exploration strategies. To this end, we compare the impact of different random seeds on the performance of the GDMTD3. As shown in Fig. [6](#S6.F6 \"Figure 6 ‣ 6.2.3 Impact of Algorithm Parameters ‣ 6.2 Simulation Results ‣ 6 Simulation Results ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\"), GDMTD3 consistently converges and achieves high rewards although the reward curves vary slightly depending on the random seed. This result demonstrates its robustness and stability across different initial conditions.\n\n[Refer to caption](/html/2407.08914/assets/x6.png)\n\nFigure 6: Comparison of reward curves of GDMTD3 with different random seeds.\n\nEffect of Different Noise Schedule Functions. Diffusion-based models are also affected by the selection of noise schedule functions, which determine how parameters such as noise levels are adjusted over time \\[[59](#bib.bib59)\\]. Specifically, this influence stems from the direct effect of noise schedule functions on the diffusion process, which depends on how effectively the model learns to generate high-quality samples. In our scenario, we evaluate the impact of different noise schedule functions on the performance of GDMTD3, which includes VP, linear and cosine noise schedule functions \\[[59](#bib.bib59)\\]. As illustrated in Fig. [7](#S6.F7 \"Figure 7 ‣ 6.2.3 Impact of Algorithm Parameters ‣ 6.2 Simulation Results ‣ 6 Simulation Results ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\"), the results show that the VP schedule leads to the highest reward and faster convergence among the three noise schedule functions. This result highlights the superior performance of the VP schedule when applying GDMTD3 method to address the formulated ASCEE-MOP.\n\n[Refer to caption](/html/2407.08914/assets/x7.png)\n\nFigure 7: Comparison of reward curves of GDMTD3 with different schedule strategies.\n\nEffect of Different Denoising Steps. The number of denoising steps in the diffusion reverse process is another critical factor that can significantly impact the performance of diffusion-based models. First, denoising steps determine how effectively the model can reduce noise and generate high-quality samples \\[[60](#bib.bib60)\\]. Second, an increase in denoising steps also leads to longer training time. Therefore, we compare the impact of varying the number of denoising steps on the performance of GDMTD3. As shown in Fig. [8](#S6.F8 \"Figure 8 ‣ 6.2.3 Impact of Algorithm Parameters ‣ 6.2 Simulation Results ‣ 6 Simulation Results ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\"), increasing the number of denoising steps generally improves the performance of the diffusion model by enabling more precise noise reduction. However, beyond a certain step, which is 4 in the context of our formulated ASCEE-MOP, the benefits of additional denoising steps diminish. This is because increasing the denoising steps can cause the model to overfit the noise pattern. As a result, unnecessary details appear in the generated actions, reducing their quality. The result demonstrates the importance of selecting an appropriate number of denoising steps to balance performance and computational efficiency in the specific problem.\n\n[Refer to caption](/html/2407.08914/assets/x8.png)\n\nFigure 8: Comparison of curves of GDMTD3 with different denoising steps.\n\n#### 6.2.4 Impact of Number of UAVs\n\nTo verify the impact of the number of UAVs on system performance, we performed a detailed simulation under varying numbers of UAVs. As shown in Fig. [9](#S6.F9 \"Figure 9 ‣ 6.2.4 Impact of Number of UAVs ‣ 6.2 Simulation Results ‣ 6 Simulation Results ‣ Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning\"), the average secrecy rate of the system improves significantly with the initial increase in the number of UAVs. Specifically, when the number of UAVs increases from 4 to 8, the average secrecy rate per step rises from $`5.58`$ bps/Hz to approximately $`7.24`$ bps/Hz. This improvement is mainly attributed to the more accurate CB capabilities provided by the denser UAV network. However, the increase in the number of UAVs also leads to higher overall flight energy consumption. For instance, when the number of UAVs increases from $`8`$ to $`16`$, the average flight energy consumption per step of the system rises from approximately $`1879.85`$ J to $`2850.38`$ J. Moreover, we can notice that after the number of UAVs reaches a certain threshold, the improvement in terms of secrecy rate tends to saturate, while energy consumption still continues to increase. This may be because as the density of UAVs in the fixed space increases, the distance between array elements decreases, potentially leading to increased mutual coupling and interference among UAVs. Consequently, adding more UAVs beyond this number does not significantly enhance the security performance of the system.\n\n[Refer to caption](/html/2407.08914/assets/x9.png)\n\nFigure 9: Comparison of curves of GDMTD3 with different UAV numbers.\n\n## 7 Conclusion\n\nIn this work, we investigated a novel UAV swarm-enabled secure surveillance network system, where a UAV swarm perform CB to enhance the security performance between UAV swarm and RBS so as to resist eavesdropping attacks from mobile eavesdroppers. Moreover, we formulated an ASCEE-MOP with an aim to maximize the secrecy rate of the system while minimizing the flight energy consumption of the UAV swarm by optimizing both the excitation current weights and positions of UAVs in conjunction. To solve the non-convex, NP-hard and dynamic optimization problem, we introduced GDMTD3, which effectively captures the high-dimensional probabilistic distributions required for optimal policy decisions. Simulation results demonstrated that the GDMDRL approach outperforms various deployment policies in terms of both the secrecy rate of the system and the flight energy consumption of the UAV swarm. Additionally, the results highlighted the superiority of the GDMTD3 algorithm over several advanced DRL benchmarks in solving the formulated ASCEE-MOP.\n\n## References\n\n- \\[1\\] T. Samad, J. S. Bay, and D. N. Godbole, “Network-centric systems for military operations in urban terrain: The role of uavs,” *Proc. IEEE*, vol. 95, no. 1, pp. 92–107, Jan. 2007.\n- \\[2\\] K. Liu and J. Zheng, “UAV trajectory optimization for time-constrained data collection in UAV-enabled environmental monitoring systems,” *IEEE Internet Things J.*, vol. 9, no. 23, pp. 24 300–24 314, Dec. 2022.\n- \\[3\\] R. W. L. Coutinho and A. Boukerche, “UAV-mounted cloudlet systems for emergency response in industrial areas,” *IEEE Trans. Ind. Informatics*, vol. 18, no. 11, pp. 8007–8016, Nov. 2022.\n- \\[4\\] B. Li, Z. Fei, and Y. Zhang, “UAV communications for 5G and beyond: Recent advances and future trends,” *IEEE Internet Things J.*, vol. 6, no. 2, pp. 2241–2263, Apr. 2019.\n- \\[5\\] H. Wang, H. Zhao, W. Wu, J. Xiong, D. Ma, and J. Wei, “Deployment algorithms of flying base stations: 5G and beyond with UAVs,” *IEEE Internet Things J.*, vol. 6, no. 6, pp. 10 009–10 027, Dec. 2019.\n- \\[6\\] Y. Takahashi, Y. Kawamoto, H. Nishiyama, N. Kato, F. Ono, and R. Miura, “A novel radio resource optimization method for relay-based unmanned aerial vehicles,” *IEEE Trans. Wirel. Commun.*, vol. 17, no. 11, pp. 7352–7363, Nov. 2018.\n- \\[7\\] Y. Zeng, Q. Wu, and R. Zhang, “Accessing from the sky: A tutorial on UAV communications for 5G and beyond,” *Proc. IEEE*, vol. 107, no. 12, pp. 2327–2375, Dec. 2019.\n- \\[8\\] R. Shakeri, M. A. Al-Garadi, A. Badawy, A. Mohamed, T. Khattab, A. K. Al-Ali, K. A. Harras, and M. Guizani, “Design challenges of multi-UAV systems in cyber-physical applications: A comprehensive survey and future directions,” *IEEE Commun. Surv. Tutorials*, vol. 21, no. 4, pp. 3340–3385, Fourthquarter 2019.\n- \\[9\\] R. Ye, Y. Peng, F. Al-Hazemi, and R. Boutaba, “A robust cooperative jamming scheme for secure UAV communication via intelligent reflecting surface,” *IEEE Trans. Commun.*, vol. 72, no. 2, pp. 1005–1019, Feb. 2024.\n- \\[10\\] Z. Liu, B. Zhu, Y. Xie, K. Ma, and X. Guan, “UAV-aided secure communication with imperfect eavesdropper location: Robust design for jamming power and trajectory,” *IEEE Trans. Veh. Technol.*, vol. 73, no. 5, pp. 7276–7286, May 2024.\n- \\[11\\] J. Li, H. Kang, G. Sun, S. Liang, Y. Liu, and Y. Zhang, “Physical layer secure communications based on collaborative beamforming for UAV networks: A multi-objective optimization approach,” in *40th IEEE Conference on Computer Communications (INFOCOM)*, May 2021, pp. 1–10.\n- \\[12\\] C. Zhang, G. Sun, Q. Wu, J. Li, S. Liang, D. Niyato, and V. C. Leung, “UAV swarm-enabled collaborative secure relay communications with time-domain colluding eavesdropper,” *IEEE Trans. Mob. Comput.*, pp. 1–18, Early Access, 2024, doi: 10.1109/TMC.2024.3350885.\n- \\[13\\] M. Mozaffari, W. Saad, M. Bennis, and M. Debbah, “Communications and control for wireless drone-based antenna array,” *IEEE Trans. Commun.*, vol. 67, no. 1, pp. 820–834, Jan. 2019.\n- \\[14\\] N. C. Luong, D. T. Hoang, S. Gong, D. Niyato, P. Wang, Y. Liang, and D. I. Kim, “Applications of deep reinforcement learning in communications and networking: A survey,” *IEEE Commun. Surv. Tutorials*, vol. 21, no. 4, pp. 3133–3174, Fourthquarter 2019.\n- \\[15\\] Z. Wang, J. J. Hunt, and M. Zhou, “Diffusion policies as an expressive policy class for offline reinforcement learning,” arXiv:2208.06193 \\[cs\\], 2022, doi: 10.48550/ARXIV.2208.06193.\n- \\[16\\] H. Cao, C. Tan, Z. Gao, Y. Xu, G. Chen, P.-A. Heng, and S. Z. Li, “A survey on generative diffusion models,” *IEEE Trans. Knowledge Data Eng.*, pp. 1–20, Early Access, 2024, doi: 10.1109/TKDE.2024.3361474.\n- \\[17\\] G. Zhang, Q. Wu, M. Cui, and R. Zhang, “Securing UAV communications via joint trajectory and power control,” *IEEE Trans. Wirel. Commun.*, vol. 18, no. 2, pp. 1376–1389, Feb. 2019.\n- \\[18\\] F. Cheng, G. Gui, N. Zhao, Y. Chen, J. Tang, and H. Sari, “UAV-relaying-assisted secure transmission with caching,” *IEEE Trans. Commun.*, vol. 67, no. 5, pp. 3140–3153, May 2019.\n- \\[19\\] Y. Zhou, C. Pan, P. L. Yeoh, K. Wang, M. Elkashlan, B. Vucetic, and Y. Li, “Secure communications for UAV-enabled mobile edge computing systems,” *IEEE Trans. Commun.*, vol. 68, no. 1, pp. 376–388, Jan. 2020.\n- \\[20\\] X. Sun, W. Yang, and Y. Cai, “Secure communication in noma-assisted millimeter-wave SWIPT UAV networks,” *IEEE Internet Things J.*, vol. 7, no. 3, pp. 1884–1897, Mar. 2020.\n- \\[21\\] A. Li, Q. Wu, and R. Zhang, “UAV-enabled cooperative jamming for improving secrecy of ground wiretap channel,” *IEEE Wirel. Commun. Lett.*, vol. 8, no. 1, pp. 181–184, Feb. 2019.\n- \\[22\\] Y. Cai, Z. Wei, R. Li, D. W. K. Ng, and J. Yuan, “Joint trajectory and resource allocation design for energy-efficient secure UAV communication systems,” *IEEE Trans. Commun.*, vol. 68, no. 7, pp. 4536–4553, Jul. 2020.\n- \\[23\\] A. Gao, Q. Wang, Y. Hu, W. Liang, and J. Zhang, “Dynamic role switching scheme with joint trajectory and power control for multi-UAV cooperative secure communication,” *IEEE Trans. Wirel. Commun.*, vol. 23, no. 2, pp. 1260–1275, Feb. 2024.\n- \\[24\\] S. S. Hanna and D. Cabric, “Distributed transmit beamforming: Design and demonstration from the Lab to UAVs,” *IEEE Trans. Wirel. Commun.*, vol. 22, no. 2, pp. 778–792, Feb. 2023.\n- \\[25\\] M. T. Mamaghani, X. Zhou, N. Yang, and A. L. Swindlehurst, “Secure short-packet communications via UAV-enabled mobile relaying: Joint resource optimization and 3D trajectory design,” *IEEE Trans. Wirel. Commun.*, Early Access, 2023, doi: 10.1109/TWC.2023.3344802.\n- \\[26\\] W. Fan, Y. Wu, X. Sun, and W. Yang, “Robust secure UAV-enabled multiple user communication with fairness consideration,” in *2020 International Conference on Wireless Communications and Signal Processing (WCSP)*, Oct. 2020, pp. 1028–1033.\n- \\[27\\] Y. Gao, H. Tang, B. Li, and X. Yuan, “Securing energy-constrained UAV communications against both internal and external eavesdropping,” *IEEE Commun. Lett.*, vol. 25, no. 3, pp. 749–753, Mar. 2021.\n- \\[28\\] Ying Gao, H. Tang, B. Li, and X. Yuan, “Energy minimization for robust secure transmission in UAV networks with multiple colluding eavesdroppers,” *IEEE Commun. Lett.*, vol. 25, no. 7, pp. 2353–2357, Jul. 2021.\n- \\[29\\] W. Mao, K. Xiong, Y. Lu, P. Fan, and Z. Ding, “Energy consumption minimization in secure multi-antenna UAV-assisted MEC networks with channel uncertainty,” *IEEE Trans. Wirel. Commun.*, vol. 22, no. 11, pp. 7185–7200, Nov. 2023.\n- \\[30\\] R. Dong, B. Wang, and K. Cao, “Security enhancement of UAV swarm enabled relaying systems with joint beamforming and resource allocation,” *China Commun.*, vol. 18, pp. 71–87, Sep. 2021.\n- \\[31\\] X. Zhou, Q. Wu, S. Yan, F. Shu, and J. Li, “UAV-enabled secure communications: Joint trajectory and transmit power optimization,” *IEEE Trans. Veh. Technol.*, vol. 68, no. 4, pp. 4069–4073, Apr. 2019.\n- \\[32\\] L. Xiao, H. Li, S. Yu, Y. Zhang, L. Wang, and S. Ma, “Reinforcement learning based network coding for drone-aided secure wireless communications,” *IEEE Trans. Commun.*, vol. 70, no. 9, pp. 5975–5988, Sep. 2022.\n- \\[33\\] R. Dong, B. Wang, J. Tian, T. Cheng, and D. Diao, “Deep reinforcement learning based UAV for securing mmwave communications,” *IEEE Trans. Veh. Technol.*, vol. 72, no. 4, pp. 5429–5434, Apr. 2023.\n- \\[34\\] J. Li, G. Sun, L. Duan, and Q. Wu, “Multi-objective optimization for UAV swarm-assisted iot with virtual antenna arrays,” *IEEE Trans. Mob. Comput.*, vol. 23, no. 5, pp. 4890–4907, May 2024.\n- \\[35\\] H. Ochiai, P. Mitran, H. V. Poor, and V. Tarokh, “Collaborative beamforming for distributed wireless ad hoc sensor networks,” *IEEE Trans. Signal Process.*, vol. 53, no. 11, pp. 4110–4124, Nov. 2005.\n- \\[36\\] J. Feng, Y. Lu, B. Jung, D. Peroulis, and Y. C. Hu, “Energy-efficient data dissemination using beamforming in wireless sensor networks,” *ACM Trans. Sens. Networks*, vol. 9, no. 3, pp. 31:1–31:30, Jun. 2013.\n- \\[37\\] A. Al-Hourani, K. Sithamparanathan, and S. Lardner, “Optimal LAP altitude for maximum coverage,” *IEEE Wirel. Commun. Lett.*, vol. 3, no. 6, pp. 569–572, Dec. 2014.\n- \\[38\\] S. K. Nobar, M. H. Ahmed, Y. Morgan, and S. A. Mahmoud, “Resource allocation in cognitive radio-enabled UAV communication,” *IEEE Trans. Cogn. Commun. Netw.*, vol. 8, no. 1, pp. 296–310, Mar. 2022.\n- \\[39\\] A. Meng, X. Gao, Y. Zhao, and Z. Yang, “Three-dimensional trajectory optimization for energy-constrained UAV-enabled IoT system in probabilistic LoS channel,” *IEEE Internet Things J.*, vol. 9, no. 2, pp. 1109–1121, Jan. 2022.\n- \\[40\\] Y. Zeng, J. Xu, and R. Zhang, “Energy minimization for wireless communication with rotary-wing UAV,” *IEEE Trans. Wirel. Commun.*, vol. 18, no. 4, pp. 2329–2345, Apr. 2019.\n- \\[41\\] P. Goos, U. Syafitri, B. Sartono, and A. R. Vazquez, “A nonlinear multidimensional knapsack problem in the optimal design of mixture experiments,” *Eur. J. Oper. Res.*, vol. 281, no. 1, pp. 201–221, Jan. 2020.\n- \\[42\\] S. Fujimoto, H. van Hoof, and D. Meger, “Addressing function approximation error in actor-critic methods,” in *Proceedings of the 35th International Conference on Machine Learning (ICML)*, Jul. 2018, pp. 1582–1591.\n- \\[43\\] T. P. Lillicrap, J. J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver, and D. Wierstra, “Continuous control with deep reinforcement learning,” arXiv:1509.02971 \\[cs\\], 2018, doi: 10.48550/ARXIV.1509.02971.\n- \\[44\\] R. S. Sutton and A. G. Barto, *Reinforcement learning: An introduction*, 2nd ed.   Cambridge, MA, US: MIT press, Nov. 2018.\n- \\[45\\] H. GM, M. K. Gourisaria, M. Pandey, and S. S. Rautaray, “A comprehensive survey and analysis of generative models in machine learning,” *Comput. Sci. Rev.*, vol. 38, p. 100285, Nov. 2020.\n- \\[46\\] L. Yang, Z. Zhang, Y. Song, S. Hong, R. Xu, Y. Zhao, W. Zhang, B. Cui, and M. Yang, “Diffusion models: A comprehensive survey of methods and applications,” *ACM Comput. Surv.*, vol. 56, no. 4, pp. 105:1–105:39, Apr. 2024.\n- \\[47\\] J. Ho, A. Jain, and P. Abbeel, “Denoising diffusion probabilistic models,” in *Proceedings of the 34th International Conference on Neural Information Processing Systems (NIPS)*, Dec. 2020, pp. 6840––6851.\n- \\[48\\] Z. Xiao, K. Kreis, and A. Vahdat, “Tackling the generative learning trilemma with denoising diffusion GANs,” arXiv:2112.07804 \\[cs\\], 2022, doi: 10.48550/ARXIV.2112.07804.\n- \\[49\\] M. Chen, Z. Yang, W. Saad, C. Yin, H. V. Poor, and S. Cui, “A joint learning and communications framework for federated learning over wireless networks,” *IEEE Trans. Wirel. Commun.*, vol. 20, no. 1, pp. 269–283, Jan. 2021.\n- \\[50\\] R. He, B. Ai, G. L. Stüber, and Z. Zhong, “Non-stationary mobile-to-mobile channel modeling using the Gauss-Markov mobility model,” in *9th International Conference on Wireless Communications and Signal Processing (WCSP)*, Oct. 2017, pp. 1–6.\n- \\[51\\] R. I. B. Yaliniz, A. El-Keyi, and H. Yanikomeroglu, “Efficient 3-D placement of an aerial base station in next generation cellular networks,” in *2016 IEEE International Conference on Communications (ICC)*, May 2016, pp. 1–5.\n- \\[52\\] X. Glorot, A. Bordes, and Y. Bengio, “Deep sparse rectifier neural networks,” in *Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS)*, Apr. 2011, pp. 315–323.\n- \\[53\\] D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv:1412.6980 \\[cs\\], 2015, doi: 10.48550/ARXIV.1412.6980.\n- \\[54\\] D. Misra, “Mish: A self regularized non-monotonic activation function,” arXiv:1908.08681 \\[cs\\], 2019, doi: 10.48550/ARXIV.1908.08681.\n- \\[55\\] T. Haarnoja, A. Zhou, K. Hartikainen, G. Tucker, S. Ha, J. Tan, V. Kumar, H. Zhu, A. Gupta, P. Abbeel, and S. Levine, “Soft actor-critic algorithms and applications,” arXiv:1812.05905 \\[cs\\], 2018, doi: 10.48550/ARXIV.1812.05905.\n- \\[56\\] J. Schulman, F. Wolski, P. Dhariwal, A. Radford, and O. Klimov, “Proximal policy optimization algorithms,” arXiv:1707.06347 \\[cs\\], 2017, doi: 10.48550/ARXIV.1707.06347.\n- \\[57\\] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin, “Attention is all you need,” in *Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS)*, Dec. 2017, pp. 5998–6008.\n- \\[58\\] C. Colas, O. Sigaud, and P. Oudeyer, “How many random seeds? statistical power analysis in deep reinforcement learning experiments,” arXiv:1806.08295 \\[cs\\], 2018, doi: 10.48550/arXiv.1806.08295.\n- \\[59\\] A. Q. Nichol and P. Dhariwal, “Improved denoising diffusion probabilistic models,” in *Proceedings of the 38th International Conference on Machine Learning (ICML)*, Jul. 2021, pp. 8162–8171.\n- \\[60\\] H. Du, Z. Li, D. Niyato, J. Kang, Z. Xiong, H. Huang, and S. Mao, “Diffusion-based reinforcement learning for edge-enabled AI-generated content services,” *IEEE Trans. Mob. Comput.*, pp. 1–16, Early Access, 2024, doi: 10.1109/TMC.2024.3356178.\n\n<table>\n<tbody>\n<tr>\n<td>[Uncaptioned image]</td>\n<td>Chuang Zhang received the B.S. degree in computer science and technology from Jilin University, Changchun, China, in 2021, where he is currently pursuing the Ph.D. degree with the College of Computer Science and Technology. His current research interests include UAV communications, secure communications, distributed beamforming and multi-objective optimization.</td>\n</tr>\n</tbody>\n</table>\n\n<table>\n<tbody>\n<tr>\n<td>[Uncaptioned image]</td>\n<td>Geng Sun (Senior Member, IEEE) received the B.S. degree in communication engineering from Dalian Polytechnic University, and the Ph.D. degree in computer science and technology from Jilin University, in 2011 and 2018, respectively. He was a Visiting Researcher with the School of Electrical and Computer Engineering, Georgia Institute of Technology, USA. He is an Associate Professor in College of Computer Science and Technology at Jilin University, and His research interests include wireless networks, UAV communications, collaborative beamforming and optimizations.</td>\n</tr>\n</tbody>\n</table>\n\n<table>\n<tbody>\n<tr>\n<td>[Uncaptioned image]</td>\n<td>Jiahui Li received a BS degree in Software Engineering, and an MS degree in Computer Science and Technology from Jilin University, Changchun, China, in 2018 and 2021, respectively. He is currently studying Computer Science at Jilin University to get a Ph.D. degree. His current research focuses on UAV networks, antenna arrays, and optimization.</td>\n</tr>\n</tbody>\n</table>\n\n<table>\n<tbody>\n<tr>\n<td>[Uncaptioned image]</td>\n<td>Qingqing Wu (Senior Member, IEEE) received the B.Eng. and the Ph.D. degrees in Electronic Engineering from South China University of Technology and Shanghai Jiao Tong University (SJTU) in 2012 and 2016, respectively. From 2016 to 2020, he was a Research Fellow in the Department of Electrical and Computer Engineering at National University of Singapore. He is currently an Associate Professor with Shanghai Jiao Tong University. His current research interest includes intelligent reflecting surface (IRS), unmanned aerial vehicle (UAV) communications, and MIMO transceiver design. He has coauthored more than 100 IEEE journal papers with 26 ESI highly cited papers and 8 ESI hot papers, which have received more than 30,000 Google citations. He was listed as the Clarivate ESI Highly Cited Researcher in 2022 and 2021, the Most Influential Scholar Award in AI-2000 by Aminer in 2021 and World’s Top 2% Scientist by Stanford University in 2020 and 2021. He was the recipient of the IEEE Communications Society Asia Pacific Best Young Researcher Award and Outstanding Paper Award in 2022, the IEEE Communications Society Young Author Best Paper Award in 2021, the Outstanding Ph.D. Thesis Award of China Institute of Communications in 2017, the Outstanding Ph.D. Thesis Funding in SJTU in 2016, the IEEE ICCC Best Paper Award in 2021, and IEEE WCSP Best Paper Award in 2015. He was the Exemplary Editor of IEEE Communications Letters in 2019 and the Exemplary Reviewer of several IEEE journals. He serves as an Associate Editor for IEEE Transactions on Communications, IEEE Communications Letters, IEEE Wireless Communications Letters, IEEE Open Journal of Communications Society (OJ COMS), and IEEE Open Journal of Vehicular Technology (OJVT). He is the Lead Guest Editor for IEEE Journal on Selected Areas in Communications on “UAV Communications in 5G and Beyond Networks”, and the Guest Editor for IEEE OJVT on “6G Intelligent Communications” and IEEE OJ-COMS on “Reconfigurable Intelligent Surface-Based Communications for 6G Wireless Networks”. He is the workshop co-chair for IEEE ICC 2019-2022 workshop on “Integrating UAVs into 5G and Beyond”, and the workshop co-chair for IEEE GLOBECOM 2020 and ICC 2021 workshop on “Reconfigurable Intelligent Surfaces for Wireless Communication for Beyond 5G”. He serves as the Workshops and Symposia Officer of Reconfigurable Intelligent Surfaces Emerging Technology Initiative and Research Blog Officer of Aerial Communications Emerging Technology Initiative. He is the IEEE Communications Society Young Professional Chair in Asia Pacific Region.</td>\n</tr>\n</tbody>\n</table>\n\n<table>\n<tbody>\n<tr>\n<td>[Uncaptioned image]</td>\n<td>Jiacheng Wang is the research fellow in the College of Computing and Data Science at Nanyang Technological University, Singapore. Prior to that, he received the Ph.D. degree in School of Communications and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing, China. His research interests include wireless sensing, semantic communications, and generative AI, Metaverse.</td>\n</tr>\n</tbody>\n</table>\n\n<table>\n<tbody>\n<tr>\n<td>[Uncaptioned image]</td>\n<td>Dusit Niyato (Fellow, IEEE) received the B.Eng. degree from the King Mongkuts Institute of Technology Ladkrabang (KMITL), Thailand, in 1999, and the Ph.D. degree in electrical and computer engineering from the University of Manitoba, Canada, in 2008. He is currently a Professor with the College of Computing and Data Science, Nanyang Technological University, Singapore. His research interests include the Internet of Things (IoT), machine learning, and incentive mechanism design.</td>\n</tr>\n</tbody>\n</table>\n\n<table>\n<tbody>\n<tr>\n<td>[Uncaptioned image]</td>\n<td>Yuanwei Liu (Fellow, IEEE) received the PhD degree in electrical engineering from the Queen Mary University of London, U.K., in 2016. He was with the Department of Informatics, King’s College London, from 2016 to 2017, where he was a Post-Doctoral Research Fellow. He has been a Senior Lecturer (Associate Professor) with the School of Electronic Engineering and Computer Science, Queen Mary University of London, since Aug. 2021, where he was a Lecturer (Assistant Professor) from 2017 to 2021. His research interests include non-orthogonal multiple access, reconfigurable intelligent surface, near field communications, integrated sensing and communications, and machine learning. Yuanwei Liu is a Fellow of the IEEE, a Fellow of AAIA, a Web of Science Highly Cited Researcher, an IEEE Communication Society Distinguished Lecturer, an IEEE Vehicular Technology Society Distinguished Lecturer, the rapporteur of ETSI Industry Specification Group on Reconfigurable Intelligent Surfaces on work item of “Multi-functional Reconfigurable Intelligent Surfaces (RIS): Modelling, Optimisation, and Operation”, and the UK representative for the URSI Commission C on ”Radio communication Systems and Signal Processing”. He was listed as one of 35 Innovators Under 35 China in 2022 by MIT Technology Review. He received IEEE ComSoc Outstanding Young Researcher Award for EMEA in 2020. He received the 2020 IEEE Signal Processing and Computing for Communications (SPCC) Technical Committee Early Achievement Award, IEEE Communication Theory Technical Committee (CTTC) 2021 Early Achievement Award. He received IEEE ComSoc Outstanding Nominee for Best Young Professionals Award in 2021. He is the co-recipient of the Best Student Paper Award in IEEE VTC2022-Fall, the Best Paper Award in ISWCS 2022, the 2022 IEEE SPCC-TC Best Paper Award, the 2023 IEEE ICCT Best Paper Award, and the 2023 IEEE ISAP Best Emerging Technologies Paper Award. He serves as the Co-Editor-in-Chief of IEEE ComSoc TC Newsletter, an Area Editor of IEEE Communications Letters, an Editor of IEEE Communications Surveys & Tutorials, IEEE Transactions on Wireless Communications, IEEE Transactions on Vehicular Technology, IEEE Transactions on Network Science and Engineering, and IEEE Transactions on Communications (2018-2023). He serves as the (leading) Guest Editor for Proceedings of the IEEE on Next Generation Multiple Access, IEEE JSAC on Next Generation Multiple Access, IEEE JSTSP on Intelligent Signal Processing and Learning for Next Generation Multiple Access, and IEEE Network on Next Generation Multiple Access for 6G. He serves as the Publicity Co-Chair for IEEE VTC 2019 Fall, the Panel Co-Chair for IEEE WCNC 2024, Symposium Co-Chair for several flagship conferences such as IEEE GLOBECOM, ICC and VTC. He serves the academic Chair for the Next Generation Multiple Access Emerging Technology Initiative, vice chair of SPCC and Technical Committee on Cognitive Networks (TCCN).</td>\n</tr>\n</tbody>\n</table><|endoftext|>"
    }
  },
  "nuclear": {
    "train": {
      "total_tokens": 1006119874,
      "example": "# \\[nucl-th/0005044\\] Clocking hadronization in relativistic heavy ion collisions with balance functions\n\n# Clocking hadronization in relativistic heavy ion collisions with balance functions\n\nSteffen A. Bass, Pawel Danielewicz and Scott Pratt\n\n###### Abstract\n\nA novel state of matter has been hypothesized to exist during the early stage of relativistic heavy ion collisions, with normal hadrons not appearing until several fm/c after the start of the reaction. To test this hypothesis, correlations between charges and their associated anticharges are evaluated with the use of balance functions. It is shown that late-stage hadronization is characterized by tightly correlated charge-anticharge pairs when measured as a function of relative rapidity.\n\nRelativistic heavy ion collisions produce mesoscopic regions of enormous energy density, perhaps surpassing 3 GeV/fm<sup>3</sup> in Pb collisions at the CERN SPS \\[[1](#bib.bib1), [2](#bib.bib2)\\] with even higher energy densities expected at RHIC. At such energies hadronic degrees of freedom should be replaced by quark-gluon degrees of freedom. Several experimental measurements have been proposed as signals to the quark-gluon plasma\\[[3](#bib.bib3)\\]. Among these signals is an expected enhancement in strange-quark production which should take place 5-10 fm/c into the collision when the local temperature has dropped to near 160 MeV, but the system is still far from freeze-out. Strangeness enhancement has indeed been observed in heavy ion collisions \\[[4](#bib.bib4)\\], but alternative hadronic explanations have also been put forward assuming early-stage hadronization with medium modifications, referred to as color ropes \\[[5](#bib.bib5), [6](#bib.bib6)\\] or baryon junctions \\[[7](#bib.bib7)\\]. In this paper the use of balance functions is proposed as a means to determine whether quark production occurred at early times, $`\\tau < 1`$ fm/c, or according to a late-stage hadronization scenario, see e.g. \\[[8](#bib.bib8), [9](#bib.bib9)\\].\n\nLate-stage production of quarks could be attributed to three mechanisms: formation of hadrons from gluons, conversion of the non-perturbative vacuum energy into particles, or hadronization of a quark gas at constant temperature. Hadronization of a quark gas should approximately conserve the net number of particles due to the constraint of entropy conservation. Since hadrons are formed of two or more quarks, creation of quark-antiquark pairs should accompany hadronization. All three mechanisms for late-stage quark production involve a change in the degrees of freedom. Therefore, any signal that pinpoints the time where quarks first appear in a collision would provide valuable insight into understanding whether a novel state of matter has been formed and persisted for a substantial time. The fact that the hadronic phase has a higher concentration of charges than the QGP phase at the same entropy has been discussed in the context of charge fluctuations in \\[[10](#bib.bib10)\\].\n\nThe link between balance functions and the time at which quarks are created has a simple physical explanation. Charge-anticharge pairs are created at the same location in space-time, and are correlated in rapidity due to the strong collective expansion inherent to a relativistic heavy ion collision. Pairs created earlier can separate further in rapidity due to the higher initial temperature and due to the diffusive interactions with other particles. The balance function, which describes the momentum of the accompanying antiparticle, quantifies this correlation.\n\nThe balance functions employed here are similar to observables used to investigate hadronization in jets produced in $`p\\hspace{0pt}\\overline{p}`$ or $`e^{+}\\hspace{0pt}e^{-}`$ collisions \\[[11](#bib.bib11), [12](#bib.bib12)\\]. The balance function describes the conditional probability that a particle in the bin $`p_{1}`$ will be accompanied by a particle of opposite charge in the bin $`p_{2}`$. We define the balance function,\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>B​(p2|p1)≡12​{ρ​(b,p2|a,p1)−ρ​(b,p2|b,p1)+ρ​(a,p2|b,p1)−ρ​(a,p2|a,p1)},𝐵conditionalsubscript𝑝2subscript𝑝112𝜌𝑏conditionalsubscript𝑝2𝑎subscript𝑝1𝜌𝑏conditionalsubscript𝑝2𝑏subscript𝑝1𝜌𝑎conditionalsubscript𝑝2𝑏subscript𝑝1𝜌𝑎conditionalsubscript𝑝2𝑎subscript𝑝1B(p_{2}|p_{1})\\equiv\\frac{1}{2}\\left\\{\\rho(b,p_{2}|a,p_{1})-\\rho(b,p_{2}|b,p_{1})+\\rho(a,p_{2}|b,p_{1})-\\rho(a,p_{2}|a,p_{1})\\right\\},</td>\n<td></td>\n<td>(1)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`\\rho\\hspace{0pt}{(b,\\left. p_{2} \\middle| {a,p_{1}} \\right.)}`$ is the conditional probability of observing a particle of type $`b`$ in bin $`p_{2}`$ given the existence of a particle of type $`a`$ in bin $`p_{1}`$. The label $`a`$ might refer to all negative kaons with $`b`$ referring to all positive kaons, or $`a`$ might refer to all hadrons with a strange quark while $`b`$ refers to all hadrons with an antistrange quark. The conditional probability $`\\rho\\hspace{0pt}{(b,\\left. p_{2} \\middle| {a,p_{1}} \\right.)}`$ is generated by first counting the number $`N\\hspace{0pt}{(b,\\left. p_{2} \\middle| {a,p_{1}} \\right.)}`$ of pairs that satisfy both criteria and dividing by the number $`N\\hspace{0pt}{(a,p_{1})}`$ of particles of type $`a`$ that satisfy the first criteria.\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>ρ​(b,p2,a,p1)=N​(b,p2|a,p1)N​(a,p1).𝜌𝑏subscript𝑝2𝑎subscript𝑝1𝑁𝑏conditionalsubscript𝑝2𝑎subscript𝑝1𝑁𝑎subscript𝑝1\\rho(b,p_{2},a,p_{1})=\\frac{N(b,p_{2}|a,p_{1})}{N(a,p_{1})}.</td>\n<td></td>\n<td>(2)</td>\n</tr>\n</tbody>\n</table>\n\nBoth sums run over all events, though pairs only involve particles from the same event.\n\nAn example of binning might be that $`p_{1}`$ refers to a measurement anywhere in the detector, while $`p_{2}`$ refers to the relative rapidity $`|{y_{b} - y_{a}}|`$. Then the balance function would be a function of $`\\Delta\\hspace{0pt}y`$ only, and would represent the probability that the balancing charges were separated by $`\\Delta\\hspace{0pt}y`$ (in our formalism we include a division by $`\\Delta\\hspace{0pt}y`$ to express $`B\\hspace{0pt}{({\\Delta\\hspace{0pt}y})}`$ as a density).\n\nThe balance function is normalized to unity if $`a/b`$ refer to all particles with a positive/negative globally conserved charge.\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>∑p2B​(p2|p1)=12​{Mb−(Mb−1)+Ma−(Ma−1)}=1,subscriptsubscript𝑝2𝐵conditionalsubscript𝑝2subscript𝑝112subscript𝑀𝑏subscript𝑀𝑏1subscript𝑀𝑎subscript𝑀𝑎11\\sum_{p_{2}}B(p_{2}|p_{1})=\\frac{1}{2}\\left\\{M_{b}-(M_{b}-1)+M_{a}-(M_{a}-1)\\right\\}=1,</td>\n<td></td>\n<td>(3)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`M_{a}`$ and $`M_{b}`$ are the average multiplicities of the $`a`$ and $`b`$ particles. The normalization derives from the fact that for every extra positive charge there exists one extra negative charge. If the acceptance measures only a fraction of the charge, e.g. only kaons are measured and the strangeness in hyperons is excluded, the balance function would sum to that fraction. Balance functions can exploit any conserved charge: electric charge, strangeness, baryon number or charm. The first two terms in Eq. ([1](#E1 \"In Clocking hadronization in relativistic heavy ion collisions with balance functions\")) constitute the balance functions defined in several analyses of $`{e^{+}\\hspace{0pt}e^{-}}\\rightarrow`$ jets. By adding the last two terms the normalization properties are retained even for the case where there is a non-zero net charge, $`{M_{a} - M_{b}} \\neq 0`$.\n\nIf many charges are present in the event, the balance function represents the subtraction of two large numbers. However, large multiplicities also imply a large number of pairs from which to calculate the balance function. Since the number of uncorrelated pairs rises as the square of the multiplicity $`M`$, the statistical error in calculating the numerators of the conditional probabilities, which rises as the square root of the number of pairs, increases linearly with $`M`$. Since the denominator also rises linearly with $`M`$, the statistical error in the balance function is independent of multiplicity and is principally determined by the number of events:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>σB∝1Nev.proportional-tosubscript𝜎𝐵1subscript𝑁ev\\sigma_{B}\\propto\\\\ \\frac{1}{\\sqrt{N_{\\rm ev}}}.</td>\n<td></td>\n<td>(4)</td>\n</tr>\n</tbody>\n</table>\n\nThus, the baryon-antibaryon balance function which might involve a few dozen antibaryons would require the same number of events as the electric-charge balance function which might be constructed from a thousand particles. Typically, $`10^{5}`$ events are required to determine a balance function with statistical fluctuations at the level of $`10^{- 2}`$.\n\nBalance functions probe the dynamics of charge-anticharge pairs by quantifying the degree to which the charges are correlated in momentum space given the constraint of being created at the same space-time point in a system exhibiting strong position-momentum correlations such as a relativistic collision where source velocities might span several units of rapidity. In a globally equilibrated system with no collective flow, there would exist no correlation between the balancing charges, and the numerator in Eq. ([2](#E2 \"In Clocking hadronization in relativistic heavy ion collisions with balance functions\")) would factorize. The width of the balance function would then correspond to the extent of single-particle emission in momentum space.\n\nTo illustrate the way in which balance functions quantify the charge-anticharge correlations, we consider a Bjorken boost-invariant parameterization \\[[13](#bib.bib13)\\] of a source expanding along the $`z`$ axis with a collective velocity proportional to the position, $`v_{coll} = {z/t}`$. All intrinsic variables, such as density or temperature, depend only on the proper time $`\\tau = {({t^{2} - z^{2}})}^{1/2}`$. We first consider only direct production of hadrons, as the possibility of hadrons coalescing from quarks is discussed later in the paper. Particles and antiparticles of mass $`m`$ are generated in pairs at the same point in space-time following a local thermal distribution, and the relative rapidities are used to generate balance functions. The characteristic width of the balance function is determined by the ratio of the temperature to the mass. Non-relativistically, $`\\sigma_{y} = {({{2\\hspace{0pt}T}/m})}^{1/2}`$, and heavier particles are characterized by narrower balance functions. For particles with masses much less than the temperature, the balance functions become independent of the temperature.\n\nFigure [1](#F1 \"FIG. 1 ‣ ACKNOWLEDGMENTS ‣ Clocking hadronization in relativistic heavy ion collisions with balance functions\") displays balance functions assuming a Bjorken parameterization of an expanding pion gas and an expanding proton gas, for two temperatures, 225 MeV and 165 MeV. Clearly, the balance functions of the more massive particles are sensitive to the temperature. This suggests that the strangeness and baryon balance functions should provide more insight than the electric-charge balance function which would be largely dominated by pions.\n\nBalance functions in heavy ion collisions should be compared to those from $`p\\hspace{0pt}p`$ collisions at the same $`\\sqrt{s}`$ where hadronization is nearly instantaneous. Charged-pion balances measured in $`e^{+}\\hspace{0pt}e^{-}`$ collisions as a function of the rapidity defined along the jet axis have been reasonably explained by the string hadronization dynamics of the Lund model \\[[14](#bib.bib14)\\], e.g. as implemented in PYTHIA \\[[15](#bib.bib15)\\]. Thermally generated balance functions are compared to predictions of PYTHIA for $`p\\hspace{0pt}p`$ collisions at $`\\sqrt{s} = 200`$ GeV in Fig. [1](#F1 \"FIG. 1 ‣ ACKNOWLEDGMENTS ‣ Clocking hadronization in relativistic heavy ion collisions with balance functions\"). The PYTHIA balance functions tend to be broader than those that are thermally generated, especially for the more massive protons and kaons. Assuming that experimental balance functions in $`p\\hspace{0pt}p`$ collisions would be well described by similar string dynamics, Fig. [1](#F1 \"FIG. 1 ‣ ACKNOWLEDGMENTS ‣ Clocking hadronization in relativistic heavy ion collisions with balance functions\") suggests that narrower balance functions might indeed point to thermal production at a lower temperature and thus at later times in the evolution of the heavy ion reaction.\n\nRescattering and annihilation should also affect balance functions. Rescattering may be qualitatively understood by considering the diffusion equation in Bjorken coordinates $`\\tau`$ and $`\\eta \\equiv {\\tanh^{- 1}{({z/t})}}`$, where $`\\eta`$ plays the role of the position in the $`z`$ direction and also equals the collective rapidity of the local matter. Rather than considering the diffusion constant $`D = {v_{t}/{({n\\hspace{0pt}\\sigma})}}`$ as a constant, it is more physical to incorporate the fact that the density $`n`$ falls inversely with $`\\tau`$ and to consider $`\\beta \\equiv {v_{t}/{({n\\hspace{0pt}\\tau\\hspace{0pt}\\sigma})}}`$ as a constant where $`v_{t}`$ is the thermal velocity and $`\\sigma`$ is a characteristic cross section. The diffusion equation then becomes\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>∂∂τ​f​(τ,η)=−βτ​∂2∂η2​f​(τ,η).𝜏𝑓𝜏𝜂𝛽𝜏superscript2superscript𝜂2𝑓𝜏𝜂\\frac{\\partial}{\\partial\\tau}f(\\tau,\\eta)=-\\frac{\\beta}{\\tau}\\frac{\\partial^{2}}{\\partial\\eta^{2}}f(\\tau,\\eta).</td>\n<td></td>\n<td>(5)</td>\n</tr>\n</tbody>\n</table>\n\nHere, $`f`$ is the probability of observing a particle at position $`\\eta`$ at time $`\\tau`$. With the initial condition of $`\\eta = 0`$ at $`\\tau_{0}`$, the solution to the diffusion equation is a Gaussian with variance $`\\sigma_{\\eta}^{2} = {2\\hspace{0pt}\\beta\\hspace{0pt}{\\ln{({\\tau/\\tau_{0}})}}}`$. This illustrates that collisions broaden the balance function by diffusing the charge in the effective spatial coordinate $`\\eta`$. However, in the limit of zero mean free path, the diffusion constant tends to zero and the particles do not then diffuse.\n\nThe overall width of the balance function in relative rapidity is a combination of the thermal rapidity spread $`\\sigma_{therm}`$ and the effect of diffusion in $`\\eta`$ of both particles:\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>σy2=σtherm2+4​β​ln⁡(τ/τ0).superscriptsubscript𝜎𝑦2superscriptsubscript𝜎therm24𝛽𝜏subscript𝜏0\\sigma_{y}^{2}=\\sigma_{\\rm therm}^{2}+4\\beta\\ln(\\tau/\\tau_{0}).</td>\n<td></td>\n<td>(6)</td>\n</tr>\n</tbody>\n</table>\n\nDue to cooling, the width $`\\sigma_{therm}`$ falls with time which provides a competition between diffusion which stretches the balance function, and cooling which narrows it. If the production occurs at early times, then $`\\ln{({\\tau/\\tau_{0}})}`$ is large and the effect of collisions is to significantly broaden the balance function.\n\nSome hadrons will contain coalesced quarks that were created at early times. The thermal contribution to $`\\sigma_{y}`$ described in Eq. ([6](#E6 \"In Clocking hadronization in relativistic heavy ion collisions with balance functions\")) should be unaffected by the past history of the constituent quarks. However, the diffusive contribution might significantly depend on the fact that the charge moved as a free quark rather than as a hadron during it’s early history. Balance functions constructed from hadrons can thus provide meaningful information regarding the creation and mobility of the constituent quarks.\n\nTo quantitatively illustrate the effect of rescattering, we model a pair of particles produced at an initial proper time $`\\tau_{0}`$ that collide $`N_{coll}`$ times before disassociating at a final time $`\\tau_{f}`$. Each collision is assumed to completely reorient the particle with the local collective velocity. The collision times are chosen randomly such that the number of collisions as a function of $`\\ln{(\\tau)}`$ is uniform. The temperature is chosen to vary linearly with the proper time, cooling from 225 MeV at $`\\tau = 1`$ fm/c to 120 MeV at $`\\tau = 15`$ fm/c. Figure [2](#F2 \"FIG. 2 ‣ ACKNOWLEDGMENTS ‣ Clocking hadronization in relativistic heavy ion collisions with balance functions\") shows the $`K_{+}\\hspace{0pt}K_{-}`$ balance function with $`N_{coll} = 0`$ and $`N_{coll} = 10`$ assuming kaons are created at $`\\tau = 1`$ fm/c and cease to collide at $`\\tau_{f} = 15`$fm/c. In this case collisions clearly broaden the balance function.\n\nAnnihilations should also broaden the balance function. Annihilation forms new correlated pairs with the surviving partners of the annihilated particles, which tend to be less correlated than the original pairs. Annihilation combined with an equal amount of creation does not affect the balance function since the relative rapidities of formed and annihilated pairs should be identical. Figure [2](#F2 \"FIG. 2 ‣ ACKNOWLEDGMENTS ‣ Clocking hadronization in relativistic heavy ion collisions with balance functions\") illustrates the effects of annihilation by considering the same case described above, but with the additional assumption that half the particles disappear due to annihilation. In hadronic models of heavy ion collisions, the number of both antibaryons and strange particles tend to decrease with time due to cooling, which should result in broadened balance functions.\n\nFigure [3](#F3 \"FIG. 3 ‣ ACKNOWLEDGMENTS ‣ Clocking hadronization in relativistic heavy ion collisions with balance functions\") displays the effect of collisions on balance functions for pions, kaons and protons, by considering the mean relative rapidity as a function of the number of collisions. For production at early times when the collective velocity gradient is high ($`{{{d\\hspace{0pt}v_{coll}}/d}\\hspace{0pt}z} = {1/\\tau}`$), collisions broaden the balance function. However, for very large numbers of collisions, the charge does not diffuse and the balance functions are narrowed due to the cooling. One would expect particles to undergo 10-20 collisions if created at $`\\tau = 1`$ fm/c, although the effective number of completely randomizing collisions might be closer to a half dozen. If created at $`\\tau = 9`$ fm/c when the temperature is 165 MeV, the effective number of completely randomizing collisions might be two or three. Figure [3](#F3 \"FIG. 3 ‣ ACKNOWLEDGMENTS ‣ Clocking hadronization in relativistic heavy ion collisions with balance functions\") suggests that the signal for late-stage quark production is significantly magnified by rescattering. Due to collisions, even charged-pion balance functions become strongly sensitive to the creation time.\n\nThe simple calculations presented here sidestep two issues: correlations from decays such as $`\\phi\\rightarrow{K^{+}\\hspace{0pt}K^{-}}`$, and experimental acceptance problems. Both problems can be addressed by modeling constrained by the multitude of other observables measured in a heavy ion collision. Although some open questions remain, it seems clear that the canonical picture of a heavy-ion reaction,quark-gluon plasma formation followed by late-stage hadronization, should have a clear signature in the balance functions. Compared to $`p\\hspace{0pt}p`$ collisions, one expects the peak in the balance function in nucleus-nucleus collisions to be narrower near $`{\\Delta\\hspace{0pt}y} = 0`$ due to the contribution of late-stage production of quark pairs, while the tails of balance function should become broader reflecting the extra diffusion of charge in the early stages of the collision. Finally, we remark that we have barely explored the possibilities of balance functions. The rich nature of the binnings $`(\\left. p_{2} \\middle| p_{1} \\right.)`$ should provide a powerful means for resolving many of the issue regarding creation and diffusion of quarks and hadrons in relativistic heavy ion collisions.\n\n## ACKNOWLEDGMENTS\n\nWe are grateful to T. Sjöstrand for providing valuable references. This work was supported by the National Science Foundation, Grants No. PHY-00-70818 and PHY-96-0527.\n\n## REFERENCES\n\n- \\[1\\]\n\n  T. Alber et al. Phys. Rev. Lett. 74 (1995) 1303.\n\n- \\[2\\]\n\n  M .Aggarwal et al. Nucl. Phys. A610 (1996) 200c.\n\n- \\[3\\]\n\n  J. Harris and B. Müller, Ann. Rev. Nucl. Part. Sci. 46 (1996) 71.\n  S.A. Bass, M. Gyulassy, H. Stöcker and W. Greiner, J. Phys. G25 (1999) R1.\n\n- \\[4\\] E. Andersen et al. (WA97 collaboration),\n\n  Phys. Lett B433 (1998) 209.\n  R. Lietava et al. (WA97 collaboration),\n\n  Journal of Physics G25 (1999) 181.\n  R. Caliandro et al. (WA97 collaboration),\n\n  Journal of Physics G25 (1999) 171.\n  S. Margetis et al. (NA49 collaboration),\n\n  Journal of Physics G25 (1999) 189.\n  F. Gabler et al. (NA49 collaboration),\n\n  Journal of Physics G25 (1999) 199.\n  D. Evans et al. (WA85 and WA94 collaborations), Journal of Physics G25 (1999) 209.\n\n- \\[5\\] T. S. Biro, H. B. Nielsen, J. Knoll,\n\n  Nucl. Phys. B245 (1984) 449.\n  J. Knoll, Z. Phys. C38 (1988) 187.\n\n- \\[6\\] H. Sorge, M. Berenguer, H. Stöcker, W. Greiner,\n\n  Phys. Lett. B289 (1992) 6.\n\n- \\[7\\]\n\n  S. E. Vance and M. Gyulassy, Phys. Rev. Lett.  83 (1999) 1735.\n\n- \\[8\\] J. Rafelski, B. Müller,\n\n  Phys. Rev. Lett. 48 (1982) 1066; Erratum-ibid.56 (1986) 2334.\n\n- \\[9\\] P. Koch, B. Müller, J. Rafelski,\n\n  Phys. Rep. 142 (1986) 167.\n\n- \\[10\\] S. Jeon and V. Koch, LANL report hep-ph/0003168;\n  M. Asakawa, U. Heinz and B. Müller, LANL report hep-ph/0003169.\n\n- \\[11\\]\n\n  D. Drijard et al., Nucl. Phys. B155 (1979) 269.\n  D. Drijard et al., Nucl. Phys. B166 (1980) 233.\n  I.V. Ajinenko et al., Z. Phys. C43 (1989) 37.\n\n- \\[12\\]\n\n  R. Brandelik et al., Phys. Lett. B100 (1981) 357.\n  M. Althoff et al., Z. Phys. C17 (1983) 5.\n  H. Aihara et al., Phys. Rev. Lett. 53 (1984) 2199.\n  H. Aihara et al., Phys. Rev. Lett. 57 (1986) 3140.\n  P.D. Acton et al., Phys. Lett. B305 (1993) 415.\n\n- \\[13\\]\n\n  J.D. Bjorken, Phys. Rev. D27 (1983) 140.\n\n- \\[14\\]\n\n  B. Anderson et al. Nucl. Phys. B281 (1987) 289.\n\n- \\[15\\]\n\n  H.-U. Bengtsson and T. Sjöstrand, Comp. Phys. Com. 46 (1987) 43.\n\n[Refer to caption](/html/nucl-th/0005044/assets/x1.png)\n\nFIG. 1.: Balance functions as predicted in a simple Bjorken thermal model are shown for two temperatures, 225 MeV and 165 MeV. Since heavier particles from cooler systems have smaller thermal velocities, they are more strongly correlated in rapidity and result in narrower balance functions. Also shown are balance functions as predicted by PYTHIA where the shape of the balance function is largely determined by string phenomenology.\n\n[Refer to caption](/html/nucl-th/0005044/assets/x2.png)\n\nFIG. 2.: Kaon balance functions are shown assuming an initial local temperature of 225 MeV and a production time of 1 fm/c. The balance function is broadened by the inclusion of randomizing collisions and annihilation.\n\n[Refer to caption](/html/nucl-th/0005044/assets/x3.png)\n\nFIG. 3.: The mean width of the balance function displayed as a function of the number of collisions, both for the case where particles are created early ($`\\tau = 1`$ fm/c, $`T = 225`$ MeV) and late ($`\\tau = 9`$ fm/c, $`T = 165`$ MeV).<|endoftext|>"
    },
    "test": {
      "total_tokens": 111290086,
      "example": "# \\[nucl-th/0110078\\] Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\n\n# Microscopic description of Coulomb and nuclear excitation of multiphonon states in <sup>40</sup>Ca + <sup>40</sup>Ca collisions\n\nM.V. Andrés\n\nDepartamento de Física Atómica, Molecular y Nuclear, Universidad de Sevilla,\nApdo 1065, E-41080 Sevilla, Spain\n\nF. Catara\n\nE. G. Lanza\n\nDipartimento di Fisica Universitá di Catania and INFN, Sezione di Catania,\nI-95129 Catania, Italy\n\nPh. Chomaz\n\nGANIL, B.P. 5027, F-14021 Caen Cedex, France\n\nM. Fallot\n\nJ. A. Scarpaci\n\nInstitut de Physique Nucléaire, IN2P3-CNRS, F-91406 Orsay Cedex, France\n\n###### Abstract\n\nWe calculate the inelastic scattering cross sections to populate one- and two-phonon states in heavy ion collisions with both Coulomb and nuclear excitations. Starting from a microscopic approach based on RPA, we go beyond it in order to treat anharmonicities and non-linear terms in the exciting field. These anharmonicities and non-linearities are shown to have important effects on the cross sections both in the low energy part of the spectrum and in the energy region of the Double Giant Quadrupole Resonance. By properly introducing an optical potential the inelastic cross section is calculated semiclassically by integrating the excitation probability over all impact parameters. A satisfactory agreement with the experimental results is obtained.\n\nPACS : 21.60.Ev; 21.60.Jz; 24.30.Cz; 25.55 Ci; 25.70.De\n\nKeywords: Coulomb and nuclear excitation, multiphonon states, anharmonicity and non-linearity in RPA, heavy ion collisions.\n\n## I Introduction\n\nAll theoretical approaches used to calculate the cross section for the multiple excitation of Giant Resonances (GR) in heavy ion collisions are based on a semiclassical description of the process \\[[1](#bib.bib1)\\], where the excitation of one reaction partner is assumed to be due to the action of the mean field of the other and is treated quantum mechanically while the relative motion is determined classically.\n\nFor each eigenstate $`\\alpha`$ of the internal hamiltonian of one nucleus, one can calculate its excitation probability $`P_{\\alpha}\\hspace{0pt}{(b)}`$ by perturbation theory or by solving a system of coupled equations. This is done by integrating the equations of motion along the classical relative motion trajectory corresponding to the impact parameter $`b`$. The total excitation cross section $`\\sigma_{\\alpha}`$ is then evaluated by integrating the probability over all the impact parameters starting from a minimum one $`b_{m\\hspace{0pt}i\\hspace{0pt}n}`$. In Coulomb excitation studies the value of the latter is chosen according to a systematics \\[[2](#bib.bib2)\\] following some prescription based on the condition that the contributions from the nuclear field should be eliminated. Even so, however, some ambiguities are present since the calculated cross sections can vary appreciably for small variations of $`b_{m\\hspace{0pt}i\\hspace{0pt}n}`$. Moreover, when the bombarding energy is not very high and the two nuclei are not very heavy, the nuclear excitation is the dominant process. In this situation one cannot apply that procedure because in principle one should add more internal trajectory for the determination of $`\\sigma_{\\alpha}`$. On the other hand, the trajectories corresponding to small impact parameters would not contribute too much to the inelastic cross section if the absorption due to all other channels is taken into account. This can be done by introducing an optical potential as was already done in a qualitative way in ref. \\[[3](#bib.bib3)\\].\n\nIn this paper we present calculations for the excitation cross section of one- and two-phonon states in the <sup>40</sup>Ca + <sup>40</sup>Ca reaction at 50 MeV/u for which experimental results exist \\[[4](#bib.bib4)\\]. The calculations are done within the extended RPA model described in our previous works \\[[5](#bib.bib5), [6](#bib.bib6), [7](#bib.bib7)\\] where we have introduced anharmonicities in the internal hamiltonian and non-linear terms in the external field. This model has been successful in the description of the excitation of the double giant resonances, reducing the discrepancy between the measured cross section and the standard theoretical estimate. Here the model is extended by introducing an optical potential in order to avoid the uncertainty on the integration over the impact parameter. Since the optical potential takes into account the absorption due to all channels, we have introduced a procedure in order to avoid to double count the effects of the channels explicitly included in our calculations. In the next section we will recall briefly our model and extensively describe its improvements. In section III we present our results and the quantitative comparison with the experimental findings. We then draw our conclusions and discuss some perspectives.\n\n## II The model\n\nThe best microscopic theory to describe collective excitations in nuclei is the RPA whose hamiltonian can be written as\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>HR​P​A=∑νEν​Qν†​Qνsubscript𝐻𝑅𝑃𝐴subscript𝜈subscript𝐸𝜈superscriptsubscript𝑄𝜈†subscript𝑄𝜈H_{RPA}=\\sum_{\\nu}E_{\\nu}Q_{\\nu}^{\\dagger}Q_{\\nu}</td>\n<td></td>\n<td>(1)</td>\n</tr>\n</tbody>\n</table>\n\nwhere the phonon creation operator is\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>Qν†=∑p,h(Xp​hν​Bp​h†−Yp​hν​Bp​h).subscriptsuperscript𝑄†𝜈subscript𝑝ℎsubscriptsuperscript𝑋𝜈𝑝ℎsubscriptsuperscript𝐵†𝑝ℎsuperscriptsubscript𝑌𝑝ℎ𝜈subscript𝐵𝑝ℎQ^{\\dagger}_{\\nu}=\\sum_{p,h}(X^{\\nu}_{ph}B^{\\dagger}_{ph}-Y_{ph}^{\\nu}B_{ph}).</td>\n<td></td>\n<td>(2)</td>\n</tr>\n</tbody>\n</table>\n\nThe bosonic operators $`B`$ are the lowest order terms of the bosonic expansion of the fermionic operators \\[[8](#bib.bib8)\\]\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>ap†​ah→Bp​h†+(1−2)​∑p′​h′Bp′​h′†​Bp′​h†​Bp​h′+…→subscriptsuperscript𝑎†𝑝subscript𝑎ℎsuperscriptsubscript𝐵𝑝ℎ†12subscriptsuperscript𝑝′superscriptℎ′subscriptsuperscript𝐵†superscript𝑝′superscriptℎ′subscriptsuperscript𝐵†superscript𝑝′ℎsubscript𝐵𝑝superscriptℎ′…a^{\\dagger}_{p}a_{h}\\rightarrow B_{ph}^{\\dagger}+(1-\\sqrt{2})\\sum_{p^{\\prime}h^{\\prime}}B^{\\dagger}_{p^{\\prime}h^{\\prime}}B^{\\dagger}_{p^{\\prime}h}B_{ph^{\\prime}}+~{}...</td>\n<td></td>\n<td>(3)</td>\n</tr>\n</tbody>\n</table>\n\nHere, the index p (h) labels the particle (hole) states with respect to the Hartree-Fock ground state. The other terms after the first one correct for the Pauli principle.\n\nIn the harmonic RPA hamiltonian ([1](#E1 \"In II The model ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\")) only the $`V_{{p\\hspace{0pt}h},{p'\\hspace{0pt}h'}}`$ and $`V_{{p\\hspace{0pt}p'},{h\\hspace{0pt}h'}}`$ terms of the residual interaction are taken into account. If we consider also the other terms $`V_{{p\\hspace{0pt}p'},{p^{\\operatorname{\\prime\\prime}}\\hspace{0pt}p^{\\operatorname{\\prime\\prime\\prime}}}}`$, $`V_{{h\\hspace{0pt}h'},{h^{\\operatorname{\\prime\\prime}}\\hspace{0pt}h^{\\operatorname{\\prime\\prime\\prime}}}}`$, $`V_{{p\\hspace{0pt}p'},{p^{\\operatorname{\\prime\\prime}}\\hspace{0pt}h}}`$ and $`V_{{p\\hspace{0pt}h},h',h^{\\operatorname{\\prime\\prime}}}`$ and introducing the mappings \\[[8](#bib.bib8)\\]\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>ap†​ap′⟶(ap†​ap′)B=∑hBp​h†​Bp′​hah​ah′†⟶(ah​ah′†)B=∑pBp​h†​Bp​h′⟶superscriptsubscript𝑎𝑝†subscript𝑎superscript𝑝′subscriptsuperscriptsubscript𝑎𝑝†subscript𝑎superscript𝑝′𝐵subscriptℎsuperscriptsubscript𝐵𝑝ℎ†subscript𝐵superscript𝑝′ℎ⟶subscript𝑎ℎsuperscriptsubscript𝑎superscriptℎ′†subscriptsubscript𝑎ℎsuperscriptsubscript𝑎superscriptℎ′†𝐵subscript𝑝superscriptsubscript𝐵𝑝ℎ†subscript𝐵𝑝superscriptℎ′\\begin{array}[]{l}a_{p}^{\\dagger}a_{p^{\\prime}}\\longrightarrow(a_{p}^{\\dagger}a_{p^{\\prime}})_{B}=\\sum_{h}B_{ph}^{\\dagger}B_{p^{\\prime}h}\\\\ a_{h}a_{h^{\\prime}}^{\\dagger}\\longrightarrow(a_{h}a_{h^{\\prime}}^{\\dagger})_{B}=\\sum_{p}B_{ph}^{\\dagger}B_{ph^{\\prime}}\\end{array}</td>\n<td></td>\n<td>(4)</td>\n</tr>\n</tbody>\n</table>\n\none ends up with a hamiltonian containing cubic, quartic, etc, terms in the phonon creation and annihilation operators. In the space spanned by one- and two-phonon states the bosonic hamiltonian is\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>H=∑νEν​Qν†​Qν+[∑ν1​ν2​ν3Vν1​ν2​ν321​Qν1†​Qν2†​Qν3+∑ν1​ν2​ν3​ν4Vν1​ν2,ν3​ν422​Qν1†​Qν2†​Qν3​Qν4]+h.c.formulae-sequence𝐻subscript𝜈subscript𝐸𝜈subscriptsuperscript𝑄†𝜈subscript𝑄𝜈delimited-[]subscriptsubscript𝜈1subscript𝜈2subscript𝜈3subscriptsuperscript𝑉21subscript𝜈1subscript𝜈2subscript𝜈3subscriptsuperscript𝑄†subscript𝜈1subscriptsuperscript𝑄†subscript𝜈2subscript𝑄subscript𝜈3subscriptsubscript𝜈1subscript𝜈2subscript𝜈3subscript𝜈4subscriptsuperscript𝑉22subscript𝜈1subscript𝜈2subscript𝜈3subscript𝜈4subscriptsuperscript𝑄†subscript𝜈1subscriptsuperscript𝑄†subscript𝜈2subscript𝑄subscript𝜈3subscript𝑄subscript𝜈4ℎ𝑐H=\\sum_{\\nu}E_{\\nu}Q^{\\dagger}_{\\nu}Q_{\\nu}+[\\sum_{\\nu_{1}\\nu_{2}\\nu_{3}}V^{21}_{\\nu_{1}\\nu_{2}\\nu_{3}}Q^{\\dagger}_{\\nu_{1}}Q^{\\dagger}_{\\nu_{2}}Q_{\\nu_{3}}+\\sum_{\\nu_{1}\\nu_{2}\\nu_{3}\\nu_{4}}V^{22}_{\\nu_{1}\\nu_{2},\\nu_{3}\\nu_{4}}Q^{\\dagger}_{\\nu_{1}}Q^{\\dagger}_{\\nu_{2}}Q_{\\nu_{3}}Q_{\\nu_{4}}]+h.c.</td>\n<td></td>\n<td>(5)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`V^{21}`$ ($`V^{22}`$) are the matrix elements connecting one- with two-phonon states (two- with two-phonon states). The eigenstates of the hamiltonian ([5](#E5 \"In II The model ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\")) are\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>|Φα>=∑νcνα​|ν>+∑ν1​ν2dν1​ν2α​|ν1​ν2>ketsubscriptΦ𝛼subscript𝜈subscriptsuperscript𝑐𝛼𝜈ket𝜈subscriptsubscript𝜈1subscript𝜈2subscriptsuperscript𝑑𝛼subscript𝜈1subscript𝜈2ketsubscript𝜈1subscript𝜈2|\\Phi_{\\alpha}>=\\sum_{\\nu}c^{\\alpha}_{\\nu}|\\nu>+\\sum_{\\nu_{1}\\nu_{2}}d^{\\alpha}_{\\nu_{1}\\nu_{2}}|\\nu_{1}\\nu_{2}></td>\n<td></td>\n<td>(6)</td>\n</tr>\n</tbody>\n</table>\n\nand the corresponding eigenvalues do not form a harmonic spectrum.\n\nIn the semiclassical models of grazing ion-ion collisions the excitation of one of the two nuclei is due to the mean field of the other. Since the mean field is a one-body operator, the excitation operator has the following form\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>W​(t)=∑α,β<α|UB​(𝐑​(t))|β>​aα†​aβ𝑊𝑡subscript𝛼𝛽quantum-operator-product𝛼subscript𝑈𝐵𝐑𝑡𝛽superscriptsubscript𝑎𝛼†subscript𝑎𝛽W(t)=\\sum_{\\alpha,\\beta}<\\alpha|U_{B}({\\bf R}(t))|\\beta>a_{\\alpha}^{\\dagger}a_{\\beta}</td>\n<td></td>\n<td>(7)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`U_{B}`$ is the mean field of the other nucleus. The time dependence comes in through the relative distance R between the two nuclei. In the standard approach $`W\\hspace{0pt}{(t)}`$ is linear in the phonon operators because only the ph terms of eq. ([7](#E7 \"In II The model ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\")) are considered and the lowest order boson expansion is taken. If we include also the pp and hh terms, their mapping (eq. [4](#E4 \"In II The model ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\")) leads to a quadratic form in $`Q`$\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>W=W00+∑νWν10​Qν†+h.c.+∑ν​ν′Wν​ν′11​Qν†​Qν′+∑ν​ν′Wν​ν′20​Qν†​Qν′†+h.c.formulae-sequence𝑊superscript𝑊00subscript𝜈subscriptsuperscript𝑊10𝜈subscriptsuperscript𝑄†𝜈ℎ𝑐subscript𝜈superscript𝜈′subscriptsuperscript𝑊11𝜈superscript𝜈′subscriptsuperscript𝑄†𝜈subscript𝑄superscript𝜈′subscript𝜈superscript𝜈′subscriptsuperscript𝑊20𝜈superscript𝜈′subscriptsuperscript𝑄†𝜈subscriptsuperscript𝑄†superscript𝜈′ℎ𝑐W=W^{00}+\\sum_{\\nu}W^{10}_{\\nu}Q^{\\dagger}_{\\nu}+h.c.+\\sum_{\\nu\\nu^{\\prime}}W^{11}_{\\nu\\nu^{\\prime}}Q^{\\dagger}_{\\nu}Q_{\\nu^{\\prime}}+\\sum_{\\nu\\nu^{\\prime}}W^{20}_{\\nu\\nu^{\\prime}}Q^{\\dagger}_{\\nu}Q^{\\dagger}_{\\nu^{\\prime}}+h.c.</td>\n<td></td>\n<td>(8)</td>\n</tr>\n</tbody>\n</table>\n\nThe first term in eq. ([8](#E8 \"In II The model ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\")) represents the interaction of the two colliding nuclei in their ground state, in the present case it has also an imaginary part which describes the absorption due to the non elastic channels. The $`W^{10}`$ part connects states differing by one phonon, the $`W^{11}`$ term couples excited states with the same number of phonons, while $`W^{20}`$ allows transitions from the ground state to two-phonon states. All the form factors $`W`$ are calculated by double-folding the Coulomb and nuclear nucleon-nucleon interactions with the Hartree-Fock ground state density of the projectile and with the ground state density or the transition densities of the considered excited states of the target.\n\nIn the space of the ground state and the $`|\\Phi_{\\alpha}>`$ states we can cast the Schrödinger equation into a set of linear differential coupled equations for the time dependent amplitude probabilities $`A_{\\alpha}\\hspace{0pt}{(t)}`$. Then the cross section is calculated non-perturbatively as described in ref. \\[[6](#bib.bib6)\\] where we integrated the probability of exciting the state $`|\\Phi_{\\alpha}>`$ starting from a minimum impact parameter. In the calculation presented here we integrate over all impact parameters since we have introduced in $`W^{00}`$ the optical potential which, in an effective way, takes care of the most inner trajectories.\n\nThe imaginary part $`W_{i\\hspace{0pt}m}`$ of the optical potential is usually determined by fitting the experimental elastic cross section. This potential describes the absorption due to all non-elastic channels. Therefore, it cannot be inserted directly in $`W^{00}`$ (eq. ([7](#E7 \"In II The model ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\"))) since the absorption due to the inelastic channels explicitly included in the coupled equations would be counted twice.\n\nLet us first discuss how to solve this problem when no anharmonicities are present and therefore, the states $`|\\Phi_{\\alpha}>`$ are pure multiphonon states. In such a case one can solve the Schrödinger equation in a semiclassical approach by integrating it along each classical relative motion trajectory. The state of the system $`|\\Psi>`$ is a coherent state and the probability to excite $`m_{\\nu}`$ times a phonon $`\\nu`$ is \\[[9](#bib.bib9)\\]\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>Pν,mν0=(Nν)mνmν!​Pg.s.0superscriptsubscript𝑃𝜈subscript𝑚𝜈0superscriptsubscript𝑁𝜈subscript𝑚𝜈subscript𝑚𝜈superscriptsubscript𝑃formulae-sequence𝑔𝑠0P_{\\nu,m_{\\nu}}^{0}={(N_{\\nu})^{m_{\\nu}}\\over m_{\\nu}!}\\,P_{g.s.}^{0}</td>\n<td></td>\n<td>(9)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`N_{\\nu}`$ is the average number of $`\\nu`$-phonons in $`|\\Psi>`$. In the above equation, as well as in the following discussion, the dependence on the impact parameter $`b`$ is understood. The superscript “$`0`$” refers to the fact that only the absorption due to the multiple excitation of phonons is taken into account. In such a case\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>Pg.s.0=e−𝒩superscriptsubscript𝑃formulae-sequence𝑔𝑠0superscript𝑒𝒩P_{g.s.}^{0}=e^{-\\cal N}</td>\n<td></td>\n<td>(10)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`\\mathcal{N} = {\\sum_{\\nu}N_{\\nu}}`$. We stress that the same survival probability of the ground state appears as a factor in all the probabilities in eq. ([9](#E9 \"In II The model ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\")).\n\nThe survival probability associated with the imaginary optical potential $`W_{i\\hspace{0pt}m}`$ is calculated as\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>Pg.s.W=exp⁡{2ℏ​c​∫−∞+∞Wi​m​(t)​𝑑t}superscriptsubscript𝑃formulae-sequence𝑔𝑠𝑊2Planck-constant-over-2-pi𝑐superscriptsubscriptsubscript𝑊𝑖𝑚𝑡differential-d𝑡P_{g.s.}^{W}=\\exp{\\{{2\\over{\\hbar c}}\\int_{-\\infty}^{+\\infty}W_{im}(t)\\,dt\\}}</td>\n<td></td>\n<td>(11)</td>\n</tr>\n</tbody>\n</table>\n\nwhere the integral is again done along a classical trajectory. The de-population of the ground state due only to the neglected channels can be in principle calculated as in eq. ([11](#E11 \"In II The model ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\")) but with an auxiliary imaginary potential $`\\overline{W}`$ which does not contain the absorption due to the adopted ones. Then\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>Pg.s.W=Pg.s.0×Pg.s.W¯superscriptsubscript𝑃formulae-sequence𝑔𝑠𝑊superscriptsubscript𝑃formulae-sequence𝑔𝑠0superscriptsubscript𝑃formulae-sequence𝑔𝑠¯𝑊P_{g.s.}^{W}=P_{g.s.}^{0}\\times P_{g.s.}^{\\bar{W}}</td>\n<td></td>\n<td>(12)</td>\n</tr>\n</tbody>\n</table>\n\nand\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>Pν,mν=Pν,mν0×Pg.s.W¯.subscript𝑃𝜈subscript𝑚𝜈superscriptsubscript𝑃𝜈subscript𝑚𝜈0superscriptsubscript𝑃formulae-sequence𝑔𝑠¯𝑊P_{\\nu,m_{\\nu}}=P_{\\nu,m_{\\nu}}^{0}\\times P_{g.s.}^{\\bar{W}}\\,.</td>\n<td></td>\n<td>(13)</td>\n</tr>\n</tbody>\n</table>\n\nWhen anharmonicities are taken into account, the state of the system is no more a coherent state. The probability to excite the state $`|\\Phi_{\\alpha}>`$ is equal to\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>Pα0=|Aα|2superscriptsubscript𝑃𝛼0superscriptsubscript𝐴𝛼2P_{\\alpha}^{0}=|A_{\\alpha}|^{2}</td>\n<td></td>\n<td>(14)</td>\n</tr>\n</tbody>\n</table>\n\nwhere $`A_{\\alpha}`$ is solution of the coupled equations of motion without any imaginary potential. Therefore, $`P_{\\alpha}^{0}`$ contains only the absorption due to all the adopted channels. The remaining part, due to all the other channels, can be introduced by writing, in analogy with eq. ([13](#E13 \"In II The model ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\")),\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>Pα=Pα0×Pg.s.w¯.subscript𝑃𝛼superscriptsubscript𝑃𝛼0superscriptsubscript𝑃formulae-sequence𝑔𝑠¯𝑤P_{\\alpha}=P_{\\alpha}^{0}\\times P_{g.s.}^{\\bar{w}}\\,.</td>\n<td></td>\n<td>(15)</td>\n</tr>\n</tbody>\n</table>\n\nThis equation can be formally derived by assuming that the absorption due to the excluded channels is the same in all the adopted ones. This is certainly an approximation, however we would like to emphasize that many important inelastic channels are explicitly taken into account in the coupled equations and that we solve the latter exactly. Therefore, the corresponding absorption is calculated correctly, including the Q-value effects. The unknown auxiliary imaginary potential $`\\overline{W}`$ can be eliminated by inserting eq. ([12](#E12 \"In II The model ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\")) in eq. ([15](#E15 \"In II The model ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\"))\n\n<table>\n<tbody>\n<tr>\n<td></td>\n<td>Pα=Pα0×Pg.s.wPg.s.0subscript𝑃𝛼superscriptsubscript𝑃𝛼0superscriptsubscript𝑃formulae-sequence𝑔𝑠𝑤superscriptsubscript𝑃formulae-sequence𝑔𝑠0P_{\\alpha}=P_{\\alpha}^{0}\\times{P_{g.s.}^{w}\\over P_{g.s.}^{0}}</td>\n<td></td>\n<td>(16)</td>\n</tr>\n</tbody>\n</table>\n\nwhich is the expression we have used in order to calculate the inelastic cross section. We would like to stress that the part of the nuclear absorption that corresponds to non inelastic channels is often taken into account as a sharp cut off transmission coefficient. So the introduction of the imaginary potential can be seen as an important improvement.\n\n## III Results and discussion\n\nThe above described model has been applied to the reaction <sup>40</sup>Ca on <sup>40</sup>Ca at E/u = 50 MeV. The one-phonon basis has been obtained with a self-consistent HF+RPA calculation with Skyrme interaction SGII \\[[10](#bib.bib10)\\]. Only the most collective one-phonon states, exhausting at least $`5\\%`$ of the relevant EWSR, are taken into account. They are listed in table [I](#T1 \"TABLE I ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\"). We then have considered all possible two-phonon states that can be constructed out of them, with all possible values of the total angular momentum L, and in this space we have diagonalised the hamiltonian ([5](#E5 \"In II The model ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\")) to get the states ([6](#E6 \"In II The model ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\")). In table [II](#T2 \"TABLE II ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\") we have reported some properties of the quadrupole states, each one labeled with the name of its main component and whose unperturbed energy is given in the second column. In the third column there are the energy shifts due to the anharmonicities. Their overlaps with the single and double ISGQR states are shown in the last two columns. Similar tables for the GDR states are reported in \\[[6](#bib.bib6)\\].\n\nThe elementary nuclear form factors $`W`$ to pure one- and two-phonon configurations (eq.([8](#E8 \"In II The model ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\"))) were calculated by double folding the M3Y nucleon-nucleon interaction \\[[11](#bib.bib11)\\] with the RPA transition densities. The transition matrix elements between mixed states $`|\\Phi_{\\alpha}>`$ were computed by mixing the elementary form factors according to the unitary transformation ([6](#E6 \"In II The model ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\")). The same procedure was used with the Coulomb interaction to calculate the Coulomb form factors. The relative motion trajectories were determined by solving the classical equation of motion in the presence of both Coulomb field and real part of the nuclear potential.\n\nThe real part of the optical potential was obtained by double folding the M3Y nucleon-nucleon potential with the Hartree-Fock densities of the two nuclei while its imaginary part was chosen with the same geometry and multiplied by a scale factor whose value (0.627) was determined by a fit to the experimental elastic cross section for the collision <sup>40</sup>Ca on <sup>40</sup>Ca at E/u = 50 MeV of ref. \\[[12](#bib.bib12)\\].\n\nIn these calculations both the nuclear and Coulomb excitations were included. Actually, the Coulomb excitation alone does not produce a sizable cross section because the colliding nuclei are not very heavy, but when it is considered together with the nuclear excitation it produces an interference effect which can be important. This is due to the fact that on one hand we have a coupled channel effect and, on the other hand, some two-phonon states are excited only when both fields are acting. This was clearly demonstrated in our previous work \\[[7](#bib.bib7)\\].\n\nSince our calculations are based on a discrete RPA we get a discrete excitation spectrum and a cross section $`\\sigma_{\\alpha}`$ corresponding to each state $`|\\Phi_{\\alpha}>`$. The energy differential cross sections presented in fig. [1](#F1 \"FIG. 1 ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\") are obtained by summing up all the contributions coming from the states $`|\\Phi_{\\alpha}>`$ after a smoothing of each individual line by a Lorentzian with a 3 MeV width. The dashed line refers to a calculation where the internal hamiltonian is harmonic and the external field is linear. The solid line corresponds to a calculation where the anharmonicity and non-linearity were introduced, which produce a sizable increase with respect to the standard case. In the figure we can clearly distinguish three energy regions. The cross sections given in tables [III](#T3 \"TABLE III ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\") to [V](#T5 \"TABLE V ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\") are obtained by summing up the $`\\sigma_{\\alpha}`$’s for the discrete states $`|\\Phi_{\\alpha}>`$ lying in each region. As already observed in ref. \\[[6](#bib.bib6), [7](#bib.bib7)\\], the increase at low energies is due both to the anharmonicities and non-linearities. In particular, the anharmonicities are important because the low lying two-phonon states can be excited by the $`W^{10}`$ part of the external field through their large one-phonon component. At high energies the main contribution comes from the non-linearities because their presence increases the number of excitation routes. This is seen better in table [IV](#T4 \"TABLE IV ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\") where the excitation cross section in the double giant quadrupole resonance energy region is reported. For each multipolarity we have summed the excitation cross section in the energy region between 28 and 38 MeV, and this is done for four different cases as shown in the table. The L=3 contribution is due to the HEOR at 31.33 MeV, while the L=0,2 and 4 contributions are dominated by the double excitation of the double ISGQR. As we can see in table [V](#T5 \"TABLE V ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\"), the non-linear terms are also responsible for the increase of the cross section in the ISGQR region, especially for the L=2 state whose main component is the ISGQR. This is at variance with the relativistic Coulomb excitation studied in ref. \\[[6](#bib.bib6)\\] because the Coulomb interaction very selectively populates dipole transitions and therefore cannot excite the most important two-phonon components of the ISGQR which are built with monopole and quadrupole phonons (see table [II](#T2 \"TABLE II ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\")).\n\nThe obtained ratio between cross section in the Giant Resonance region and that in the two phonon one varies from 3.7 in the anharmonic and non-linear case to 4.6 in the harmonic and linear calculation. If we only consider the cross section to the single and double isoscalar giant quadrupole resonance those ratios increase to 6.5 and 9.6, respectively. Those values are smaller than the ones reported in ref. \\[[3](#bib.bib3)\\] for the cross sections at the grazing angle. This difference can be traced back to the present availability of the experimental elastic cross section needed to fix the imaginary part of the optical potential and to the fact that the theoretical approach has been improved in several aspects, especially in the calculation of the form factors.\n\nOur calculation can be compared with the experimental data of ref. \\[[4](#bib.bib4)\\] where the reaction <sup>40</sup>Ca+<sup>40</sup>Ca at 50 MeV/u has been studied. Let us resume the important results of ref. \\[[4](#bib.bib4)\\] and the most critical points. We discuss first the inclusive spectrum and later on we will analyse the one obtained in coincidence with backward emitted particles. The inelastic spectrum was extracted for ejectiles scattered between 3.4 and 10 degrees in the center of mass frame. The GR contribution was obtained from the inclusive inelastic spectrum by deconvolution of the angular distributions into inelastic excitations and a non-inelastic background. For the inelastic excitation, a DWBA prediction was used. As for the background, its angular distribution was assumed to be similar to the one of the energy region located immediately above the GRs. This procedure gave 113 mb/sr between 12 and 22 MeV for the inelastic excitation corresponding to 40% of the quadrupole EWSR. However, it should be noticed that the estimate of the non-inelastic background underlying the GR is not unambiguous. Indeed, if inelastic excitation is still present in the region above the resonance as expected from fig.1, the assumed background is overestimated. In this case the extracted value should be understood as a minimum. The maximum inelastic contribution compatible with the measured angular distribution is 223 mb/sr. This corresponds to the other extreme when no non-inelastic background is considered. Therefore the GR cross section extracted from the inclusive spectrum is between 113 and 223 mb/sr depending upon the background hypothesis. The associated EWSR would thus range between 40 and 80% if the whole cross section is assumed to be coming from quadrupole states.\n\nIn order to get the total cross section one has to extrapolate the measured differential cross section beyond the solid angle covered by the ejectile detector. This was done by assuming that the DWBA angular distribution used to fit the measured angular distribution in ref. \\[[4](#bib.bib4)\\] was also valid in the region in which no data are available. The ratio between the integrals of the DWBA cross section over the full angular range and that over the angles covered by the detector is 3.16. Taking into account the fraction of the solid angle covered by the spectrometer one gets a total compensating factor of 6.67x10<sup>-2</sup>. Such factor transforms the double differential cross section into the energy differential one. The resulting total cross section is then 7.5 and 15 mb respectively. These values have to be compared with the theoretical inelastic cross section which, in the anharmonic and non-linear case, adds up to 22 mb in the GR region. Keeping into account the uncertainties of the analysis of the experimental data and the fact that our theoretical results are obtained without adjusting any parameter, the comparison can be considered satisfactory. In order to draw quantitative conclusions one should elaborate on different issues both from the experimental and theoretical sides. A recent experiment on the same reaction \\[[13](#bib.bib13)\\] using an improved apparatus is expected to eliminate most of the experimental uncertainties. These new data will allow a more reliable determination of some parameters entering in the theoretical calculation, mainly in the optical potential.\n\nCoincidences with backward emitted particles provide an unambiguous signal for the inelastic excitations and could in principle be used to avoid the non-inelastic background problems. This was the idea of ref. \\[[4](#bib.bib4)\\], but some other sources of uncertainties appear. The coincidence rate with backward emitted protons was converted into a differential cross section correcting for the energy dependence of the proton multiplicity. At that time it was already stressed that this correction factor can be subject to many uncertainties. First of all, this proton multiplicity function was calculated with a statistical decay code which does not include any direct decay component. Furthermore, due to the absence of out-of-plane detectors, the azimuthal angular distribution was not measured and was assumed to be uniform. This procedure gives a cross section for the GR extracted from the coincidence data (339 mb/sr) larger than the one obtained from the inclusive inelastic spectrum (between 113 and 223 mb/sr). This shows that the hypotheses used are not correct. The use of a 4$`\\pi`$ detector in a recent experiment \\[[13](#bib.bib13)\\] should solve these ambiguities since it will provide the angular distribution of the emitted protons and there will be no need to rely on a statistical code to infer their multiplicity. That was not the case in the experiment of ref. \\[[4](#bib.bib4)\\]. Therefore, only the ratio was deduced from the coincidence data. Two values of this ratio were reported by assuming two backgrounds for the two-phonon region, while the GR peak was considered with no background subtraction in the coincidence spectrum. The values of the second phonon cross section were, after subtraction of the two backgrounds, 30 and 17 mb/sr respectively for an energy running from 28 to 40 MeV, while the GR cross section was 339 mb/sr in the coincidence spectrum in the range of 12 to 22 MeV, leading to the ratios 11 and 20 quoted in ref. \\[[4](#bib.bib4)\\]\n\n<sup>\\*</sup>\n\n<sup>\\*</sup>\\*one digit inversion error was spot in the text of fig. 16 c) and d) of ref.\\[4\\] (erratum to be published)\n\n. Such values are the ratios between the single GR cross section and only a small fraction of the DGR cross section. We want to stress here that the correct procedure should be not to subtract any background in the two-phonon region. Indeed, on one hand, coincidence with backward emitted particles avoids any contribution from non-inelastic background in the experimental data. On the other hand, in our theoretical calculation, not only double GQR has been included but many contributions from different inelastic excitations have been taken into account. These two remarks plead in favour of a direct comparison of the one and the two-phonon regions with no background subtraction.\n\nIn order to have a more direct comparison we present, in fig. 2, the experimental coincidence inelastic spectrum of ref. \\[[4](#bib.bib4)\\] (fig. 16 b) with no background subtraction. The right scale is the double differential cross section while the left scale is the energy differential cross section obtained with the above mentioned factor of 6.67x10<sup>-2</sup>. In the figure we present the theoretical results smoothed by a Lorentzian of 5 MeV width, rather than the 3 MeV used in fig. 1. From the figure we see that with this value the shape of the experimental peak in the GR region is well reproduced. It should be noticed that some contribution to the experimental cross section is present just below 14 MeV. However, due to the proximity of the proton emission threshold, the correction for the multiplicity is more delicate in that energy region. Disregarding these two points, the overall agreement between theory and experiment is rather satisfactory. A rough estimate of the one-phonon and two-phonon cross-sections can be obtained by integrating both curves in the energy ranges shown in fig.2 as shadowed areas. By doing that one would get an experimental and theoretical ratio of 2.4 and 2.3, respectively. We want to stress that the experimental ratio quoted above is different from the one deduced in ref. \\[[4](#bib.bib4)\\] because the latter is the ratio between the full peak of the single GR and the DGR with background subtraction, while the former one is obtained without background subtraction in both single GR and DGR. Furthermore the first two experimental points in fig. 2 were not included as explained before. Finally, we would like to comment on the dependence of the theoretical ratios upon the smoothing width. This ratio is decreasing with the increasing width, due to the fact that while the integral of the single GR is decreasing the one over the region of the DGR remains almost unchanged. This is related to the fact that in the single GR energy region the peaks of the single $`\\Phi_{\\alpha}`$ states are quite separate while the density of states in the DGR region is very high. In any case the dependence on the width is not very strong: by varying $`\\Gamma`$ from 3 to 6 MeV the ratio changes from 2.75 to 2.20. These values cannot be directly compared with the values reported in tables [III](#T3 \"TABLE III ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\")-[V](#T5 \"TABLE V ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\") because the latter have been obtained just by summing the cross sections associated with each discrete state.\n\n## IV Conclusions\n\nWe have calculated the inelastic scattering cross sections of one- and two-phonon states for the <sup>40</sup>Ca + <sup>40</sup>Ca collision at E/u=50 MeV. Several effects have been evidenced. In particular, we have analyzed the role played by anharmonicities in the excitation spectrum and non-linearities in the operator describing the mutual interaction of the collision partners. The anharmonicities are particularly important at relatively low energy where the excitation comes through the one-phonon component of the mixed states. The non-linearities give their main contribution at high energy, in particular in the region of the double quadrupole giant resonance. Namely, in the interval between 28 and 38 MeV, they give an increase of about 40% with respect to a harmonic and linear calculation. This increase is due to the excitation of other two-phonon states which are populated because of the presence of the anharmonicities and non-linearities. With all the previously discussed caveats, the comparison of the smoothed theoretical result with the experimental coincidence inelastic spectrum of ref. \\[[4](#bib.bib4)\\] is satisfactory. The inclusion of three-phonon states in the calculation will increase the inelastic cross section at higher excitation energies. At the same time, a fraction of the population of the two-phonon states will move to higher energies. In ref. \\[[14](#bib.bib14)\\] it has been shown within a simple model that the spectrum calculated by diagonalizing in a space including up to three phonons the hamiltonian obtained by a boson expansion truncated at the quartic order is in reasonable agreement with the exact one. A similar calculation is feasible also in a realistic case. This, together with the results shown here, encourages us to proceed in the direction of calculating the three-phonon excitation cross section for the system <sup>40</sup>Ca + <sup>40</sup>Ca at E/u=50 MeV for which experiments have already been done \\[[13](#bib.bib13)\\].\n\n###### Acknowledgements.\n\nThis work has been partially supported by the Spanish DGICyT under contract PB98-1111, by the Spanish-Italian agreement between the CICyT and the INFN and by the Spanish-French agreement between the CICyT and the IN2P3.\n\n## REFERENCES\n\n- \\[1\\]\n\n  H. Emling, Prog. Part. Nucl. Phys. 33 (1994) 729; Ph. Chomaz and N. Frascaria, Phys. Rep. 252 (1995) 275; T. Aumann, P. F. Bortignon and H. Emling, Annu. Rev. Nucl. Part. Sci. vol. 48 (1998).\n\n- \\[2\\]\n\n  C. Benesh, B. Cook and J. Vary, Phys. Rev. C 40 (1989) 1198.\n\n- \\[3\\]\n\n  F. Catara, Ph. Chomaz and A. Vitturi, Nucl. Phys. A 471 (1987) 661.\n\n- \\[4\\]\n\n  J. A. Scarpaci et al., Phys. Rev. C 56 (1997) 3187; J. A. Scarpaci et al., Phys. Rev. Lett. 71 (1993) 3766.\n\n- \\[5\\]\n\n  C. Volpe, F. Catara, Ph. Chomaz, M.V. Andrés and E.G. Lanza, Nucl. Phys. A 589 (1995) 521; Nucl. Phys. A 599 (1996) 347c.\n\n- \\[6\\]\n\n  E. G. Lanza, M. V. Andrés, F. Catara, Ph. Chomaz and C. Volpe, Nucl. Phys. A 613 (1997) 445; Nucl. Phys. A 654 (1999) 792c.\n\n- \\[7\\]\n\n  E. G. Lanza, M. V. Andrés, F. Catara, Ph. Chomaz and C. Volpe, Nucl. Phys. A 636 (1998) 452.\n\n- \\[8\\]\n\n  M. Hage-Hassan and M. Lambert, Nucl. Phys. A188 (1972) 545.\n\n- \\[9\\]\n\n  K. Alder and A. Winther, Electromagnetic Excitation, North-Holland, Amsterdam (1975).\n\n- \\[10\\]\n\n  N. V. Giai and H. Sagawa, Phys. Lett. 106B (1981) 379.\n\n- \\[11\\]\n\n  G. R. Satchler and W. G. Love, Phys. Rep. 55 (1979) 183.\n\n- \\[12\\] J. A. Scarpaci, Ph.D thesis, Université d’Orsay, France,1990.\n\n- \\[13\\]\n\n  N. Frascaria, Nucl. Phys. A 687 (2001) 154 and private communications.\n\n- \\[14\\]\n\n  C.Volpe, Ph. Chomaz, M.V. Andrés, F. Catara and E.G. Lanza, Nucl. Phys. A 647 (1999) 246.\n\nFIG. 1.: Inelastic cross section for the system <sup>40</sup>Ca + <sup>40</sup>Ca at 50 MeV/u as function of the excitation energy. Both curves are the result of a smoothing procedure with a Lorentzian with a width $`\\Gamma`$=3 MeV. The shadowed areas are the energy regions over which we have summed the cross sections reported in the tables.\n\nFIG. 2.: The dots represent the experimental coincidence inelastic spectrum of ref. \\[4\\] (Fig.16 b) with no background subtraction (right scale). The solid line is the result of a smoothing procedure with a Lorentzian with a width $`\\Gamma`$=5 MeV of the theoretical inelastic cross section for the anharmonic and non-linear case (left scale). The shadowed areas are the energy regions over which we have integrated the energy differential cross sections. The resulting values in mb are the numbers reported in the two areas. Those above the curves refer to the theoretical results, while the ones below refer to the experimental data. In the inset we report the ratios between the single GR cross section and the DGR ones for the two cases.\n\nTABLE I.: RPA one-phonon basis for the nucleus <sup>40</sup>Ca. For each state its spin, parity, isospin, energy and percentage of the EWSR are reported.\n<table>\n<tbody>\n<tr>\n<td>Phonons</td>\n<td>Jπsuperscript𝐽𝜋J^{\\pi}</td>\n<td>T𝑇T</td>\n<td>E​(M​e​V)𝐸𝑀𝑒𝑉E(MeV)</td>\n<td>%EWSR\\%EWSR</td>\n</tr>\n<tr>\n<td>G​M​R1𝐺𝑀subscript𝑅1GMR_{1}</td>\n<td>0+superscript00^{+}</td>\n<td>00</td>\n<td>18.2518.2518.25</td>\n<td>303030</td>\n</tr>\n<tr>\n<td>G​M​R2𝐺𝑀subscript𝑅2GMR_{2}</td>\n<td>0+superscript00^{+}</td>\n<td>00</td>\n<td>22.4722.4722.47</td>\n<td>545454</td>\n</tr>\n<tr>\n<td>G​D​R1𝐺𝐷subscript𝑅1GDR_{1}</td>\n<td>1−superscript11^{-}</td>\n<td>111</td>\n<td>17.7817.7817.78</td>\n<td>565656</td>\n</tr>\n<tr>\n<td>G​D​R2𝐺𝐷subscript𝑅2GDR_{2}</td>\n<td>1−superscript11^{-}</td>\n<td>111</td>\n<td>22.0322.0322.03</td>\n<td>101010</td>\n</tr>\n<tr>\n<td>I​S​G​Q​R𝐼𝑆𝐺𝑄𝑅ISGQR</td>\n<td>2+superscript22^{+}</td>\n<td>00</td>\n<td>16.9116.9116.91</td>\n<td>858585</td>\n</tr>\n<tr>\n<td>I​V​G​Q​R𝐼𝑉𝐺𝑄𝑅IVGQR</td>\n<td>2+superscript22^{+}</td>\n<td>111</td>\n<td>29.5929.5929.59</td>\n<td>262626</td>\n</tr>\n<tr>\n<td>3−superscript33^{-}</td>\n<td>3−superscript33^{-}</td>\n<td>00</td>\n<td>4.944.944.94</td>\n<td>141414</td>\n</tr>\n<tr>\n<td>L​E​O​R𝐿𝐸𝑂𝑅LEOR</td>\n<td>3−superscript33^{-}</td>\n<td>00</td>\n<td>9.719.719.71</td>\n<td>555</td>\n</tr>\n<tr>\n<td>H​E​O​R𝐻𝐸𝑂𝑅HEOR</td>\n<td>3−superscript33^{-}</td>\n<td>00</td>\n<td>31.3331.3331.33</td>\n<td>252525</td>\n</tr>\n</tbody>\n</table>\n\nTABLE II.: Characteristics of the $`|\\Phi_{\\alpha}>`$ quadrupole 2<sup>+</sup> states whose major components are in the first column. In the second column we show the energies of the major components in the harmonic approach. The shift in the energy produced by the anharmonicities is indicated by $`\\Delta\\hspace{0pt}E`$ (in KeV). We can compare these values with the diagonal matrix elements of the residual interaction, $`\\Delta\\hspace{0pt}E_{0}`$ (in KeV). In the last columns we report the amplitude with which the single and double ISGQR components appear in the mixed states.\n<table>\n<thead>\n<tr>\n<th>Quadrupole</th>\n<th></th>\n<th>States</th>\n<th>E0subscript𝐸0E_{0}(MeV)</th>\n<th>Δ​EΔ𝐸\\Delta E</th>\n<th>(Δ​E0)Δsubscript𝐸0(\\Delta E_{0})</th>\n<th>cI​S​G​Q​Rsubscript𝑐𝐼𝑆𝐺𝑄𝑅c_{{}_{ISGQR}}</th>\n<th>cI​S​G​Q​R×I​S​G​Q​Rsubscript𝑐𝐼𝑆𝐺𝑄𝑅𝐼𝑆𝐺𝑄𝑅c_{{}_{ISGQR\\times ISGQR}}</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<th>I​S​G​Q​R𝐼𝑆𝐺𝑄𝑅ISGQR</th>\n<th></th>\n<th></th>\n<th>16.91016.91016.910</th>\n<td>−402.402-402.</td>\n<td>0.00.</td>\n<td>0.9850.9850.985</td>\n<td>−0.0140.014-0.014</td>\n</tr>\n<tr>\n<th>I​V​G​Q​R𝐼𝑉𝐺𝑄𝑅IVGQR</th>\n<th></th>\n<th></th>\n<th>29.59429.59429.594</th>\n<td>−506.506-506.</td>\n<td>0.00.</td>\n<td>−0.0050.005-0.005</td>\n<td>0.0170.0170.017</td>\n</tr>\n<tr>\n<th>G​M​R1𝐺𝑀subscript𝑅1GMR_{1}\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>I​S​G​Q​R𝐼𝑆𝐺𝑄𝑅\\!\\!ISGQR</th>\n<th>35.15535.15535.155</th>\n<td>87.8787.</td>\n<td>−11.11-11.</td>\n<td>−0.0730.073-0.073</td>\n<td>−0.0280.028-0.028</td>\n</tr>\n<tr>\n<th>G​M​R1𝐺𝑀subscript𝑅1GMR_{1}\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>I​V​G​Q​R𝐼𝑉𝐺𝑄𝑅\\!\\!IVGQR</th>\n<th>47.84547.84547.845</th>\n<td>−42.42-42.</td>\n<td>−187.187-187.</td>\n<td>−0.0000.000-0.000</td>\n<td>0.0020.0020.002</td>\n</tr>\n<tr>\n<th>G​M​R2𝐺𝑀subscript𝑅2GMR_{2}\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>I​S​G​Q​R𝐼𝑆𝐺𝑄𝑅\\!\\!ISGQR</th>\n<th>39.37839.37839.378</th>\n<td>246.246246.</td>\n<td>−31.31-31.</td>\n<td>−0.1080.108-0.108</td>\n<td>−0.0140.014-0.014</td>\n</tr>\n<tr>\n<th>G​M​R2𝐺𝑀subscript𝑅2GMR_{2}\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>I​V​G​Q​R𝐼𝑉𝐺𝑄𝑅\\!\\!IVGQR</th>\n<th>52.06752.06752.067</th>\n<td>190.190190.</td>\n<td>−178.178-178.</td>\n<td>−0.0020.002-0.002</td>\n<td>0.0030.0030.003</td>\n</tr>\n<tr>\n<th>G​D​R1𝐺𝐷subscript𝑅1GDR_{1}\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>G​D​R1𝐺𝐷subscript𝑅1\\!\\!GDR_{1}</th>\n<th>35.56035.56035.560</th>\n<td>−464.464-464.</td>\n<td>−505.505-505.</td>\n<td>0.0340.0340.034</td>\n<td>0.0870.0870.087</td>\n</tr>\n<tr>\n<th>G​D​R1𝐺𝐷subscript𝑅1GDR_{1}\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>G​D​R2𝐺𝐷subscript𝑅2\\!\\!GDR_{2}</th>\n<th>39.81439.81439.814</th>\n<td>−436.436-436.</td>\n<td>−439.439-439.</td>\n<td>0.0090.0090.009</td>\n<td>0.0060.0060.006</td>\n</tr>\n<tr>\n<th>G​D​R1𝐺𝐷subscript𝑅1GDR_{1}\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>3−superscript3\\!\\!3^{-}</th>\n<th>22.72222.72222.722</th>\n<td>−31.31-31.</td>\n<td>−35.35-35.</td>\n<td>0.0290.0290.029</td>\n<td>−0.0000.000-0.000</td>\n</tr>\n<tr>\n<th>G​D​R1𝐺𝐷subscript𝑅1GDR_{1}\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>L​E​O​R𝐿𝐸𝑂𝑅\\!\\!LEOR</th>\n<th>27.48627.48627.486</th>\n<td>−444.444-444.</td>\n<td>−442.442-442.</td>\n<td>−0.0130.013-0.013</td>\n<td>−0.0070.007-0.007</td>\n</tr>\n<tr>\n<th>G​D​R1𝐺𝐷subscript𝑅1GDR_{1}\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>H​E​O​R𝐻𝐸𝑂𝑅\\!\\!HEOR</th>\n<th>49.11049.11049.110</th>\n<td>−278.278-278.</td>\n<td>−288.288-288.</td>\n<td>−0.0050.005-0.005</td>\n<td>0.0060.0060.006</td>\n</tr>\n<tr>\n<th>G​D​R2𝐺𝐷subscript𝑅2GDR_{2}\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>G​D​R2𝐺𝐷subscript𝑅2\\!\\!GDR_{2}</th>\n<th>44.06844.06844.068</th>\n<td>−435.435-435.</td>\n<td>−436.436-436.</td>\n<td>0.0040.0040.004</td>\n<td>0.0020.0020.002</td>\n</tr>\n<tr>\n<th>G​D​R2𝐺𝐷subscript𝑅2GDR_{2}\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>3−superscript3\\!\\!3^{-}</th>\n<th>26.97626.97626.976</th>\n<td>−6.6-6.</td>\n<td>7.77.</td>\n<td>0.0030.0030.003</td>\n<td>0.0010.0010.001</td>\n</tr>\n<tr>\n<th>G​D​R2𝐺𝐷subscript𝑅2GDR_{2}\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>L​E​O​R𝐿𝐸𝑂𝑅\\!\\!LEOR</th>\n<th>31.74031.74031.740</th>\n<td>−307.307-307.</td>\n<td>−309.309-309.</td>\n<td>0.0000.0000.000</td>\n<td>−0.0070.007-0.007</td>\n</tr>\n<tr>\n<th>G​D​R2𝐺𝐷subscript𝑅2GDR_{2}\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>H​E​O​R𝐻𝐸𝑂𝑅\\!\\!HEOR</th>\n<th>53.36453.36453.364</th>\n<td>−212.212-212.</td>\n<td>−217.217-217.</td>\n<td>0.0000.0000.000</td>\n<td>0.0000.0000.000</td>\n</tr>\n<tr>\n<th>I​S​G​Q​R𝐼𝑆𝐺𝑄𝑅ISGQR\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>I​S​G​Q​R𝐼𝑆𝐺𝑄𝑅\\!\\!ISGQR</th>\n<th>33.81933.81933.819</th>\n<td>0.00.</td>\n<td>4.44.</td>\n<td>−0.0200.020-0.020</td>\n<td>0.9950.9950.995</td>\n</tr>\n<tr>\n<th>I​S​G​Q​R𝐼𝑆𝐺𝑄𝑅ISGQR\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>I​V​G​Q​R𝐼𝑉𝐺𝑄𝑅\\!\\!IVGQR</th>\n<th>46.50846.50846.508</th>\n<td>39.3939.</td>\n<td>40.4040.</td>\n<td>0.0020.0020.002</td>\n<td>0.0020.0020.002</td>\n</tr>\n<tr>\n<th>I​V​G​Q​R𝐼𝑉𝐺𝑄𝑅IVGQR\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>I​V​G​Q​R𝐼𝑉𝐺𝑄𝑅\\!\\!IVGQR</th>\n<th>59.19859.19859.198</th>\n<td>−247.247-247.</td>\n<td>−250.250-250.</td>\n<td>−0.0070.007-0.007</td>\n<td>−0.0040.004-0.004</td>\n</tr>\n<tr>\n<th>3−superscript33^{-}\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>3−superscript3\\!\\!3^{-}</th>\n<th>9.8849.884~{}9.884</th>\n<td>750.750750.</td>\n<td>776.776776.</td>\n<td>−0.0450.045-0.045</td>\n<td>−0.0050.005-0.005</td>\n</tr>\n<tr>\n<th>3−superscript33^{-}\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>L​E​O​R𝐿𝐸𝑂𝑅\\!\\!LEOR</th>\n<th>14.64814.64814.648</th>\n<td>−267.267-267.</td>\n<td>−241.241-241.</td>\n<td>0.0860.0860.086</td>\n<td>0.0010.0010.001</td>\n</tr>\n<tr>\n<th>3−superscript33^{-}\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>H​E​O​R𝐻𝐸𝑂𝑅\\!\\!HEOR</th>\n<th>36.27236.27236.272</th>\n<td>−104.104-104.</td>\n<td>−120.120-120.</td>\n<td>0.0250.0250.025</td>\n<td>−0.0030.003-0.003</td>\n</tr>\n<tr>\n<th>L​E​O​R𝐿𝐸𝑂𝑅LEOR\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>L​E​O​R𝐿𝐸𝑂𝑅\\!\\!LEOR</th>\n<th>19.41319.41319.413</th>\n<td>−271.271-271.</td>\n<td>−269.269-269.</td>\n<td>−0.0210.021-0.021</td>\n<td>−0.0000.000-0.000</td>\n</tr>\n<tr>\n<th>L​E​O​R𝐿𝐸𝑂𝑅LEOR\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>H​E​O​R𝐻𝐸𝑂𝑅\\!\\!HEOR</th>\n<th>41.03741.03741.037</th>\n<td>−192.192-192.</td>\n<td>−197.197-197.</td>\n<td>−0.0050.005-0.005</td>\n<td>0.0020.0020.002</td>\n</tr>\n<tr>\n<th>H​E​O​R𝐻𝐸𝑂𝑅HEOR\\!\\!</th>\n<th>⊗tensor-product\\otimes</th>\n<th>H​E​O​R𝐻𝐸𝑂𝑅\\!\\!HEOR</th>\n<th>62.66062.66062.660</th>\n<td>−212.212-212.</td>\n<td>−215.215-215.</td>\n<td>−0.0060.006-0.006</td>\n<td>−0.0010.001-0.001</td>\n</tr>\n</tbody>\n</table>\n\nTABLE III.: Coulomb plus nuclear excitation cross section for <sup>40</sup>Ca + <sup>40</sup>Ca at 50 MeV/u. Each multipolarity contribution is shown for several anharmonic and non–linear combinations. The values for L=1 and 5 are very small and they are not shown. The cross sections (in mb) are summed over the energy region (0 $`\\leq E \\leq`$ 12 MeV).\n<table>\n<thead>\n<tr>\n<th>Phonons</th>\n<th>harm. & lin.</th>\n<th>harm. & non-lin.</th>\n<th>anh. & lin.</th>\n<th>anh. & non-lin.</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td>L=0</td>\n<td> 0.1</td>\n<td> 0.3</td>\n<td> 1.3</td>\n<td> 2.3</td>\n</tr>\n<tr>\n<td>L=2</td>\n<td> 0.2</td>\n<td> 0.4</td>\n<td> 0.2</td>\n<td> 0.1</td>\n</tr>\n<tr>\n<td>L=3</td>\n<td>14.2</td>\n<td>16.9</td>\n<td>14.3</td>\n<td>16.8</td>\n</tr>\n<tr>\n<td>L=4</td>\n<td> 0.2</td>\n<td> 0.3</td>\n<td> 0.2</td>\n<td> 0.3</td>\n</tr>\n<tr>\n<td>L=6</td>\n<td> 0.5</td>\n<td> 0.7</td>\n<td> 0.4</td>\n<td> 0.7</td>\n</tr>\n<tr>\n<td>total</td>\n<td>15.2</td>\n<td>18.6</td>\n<td>16.4</td>\n<td>20.2</td>\n</tr>\n</tbody>\n</table>\n\nTABLE IV.: Same as table [III](#T3 \"TABLE III ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\") but for the double ISGQR region. The cross sections (in mb) are summed over the energy region (28 MeV $`\\leq E \\leq`$ 38 MeV). The values in parentheses correspond to the double ISGQR state.\n<table>\n<thead>\n<tr>\n<th>Phonons</th>\n<th>harm. & lin.</th>\n<th>harm. & non-lin.</th>\n<th>anh. & lin.</th>\n<th>anh. & non-lin.</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td>L=0</td>\n<td> 0.2  (0.15)</td>\n<td> 0.3  (0.26)</td>\n<td> 0.2  (0.14)</td>\n<td> 0.3  (0.21)</td>\n</tr>\n<tr>\n<td>L=2</td>\n<td> 0.6  (0.33)</td>\n<td> 1.0  (0.51)</td>\n<td> 0.6  (0.33)</td>\n<td> 1.1  (0.53)</td>\n</tr>\n<tr>\n<td>L=3</td>\n<td> 2.2</td>\n<td> 2.5</td>\n<td> 2.3</td>\n<td> 2.5</td>\n</tr>\n<tr>\n<td>L=4</td>\n<td> 1.0  (0.90)</td>\n<td> 1.9  (1.83)</td>\n<td> 0.9  (0.85)</td>\n<td> 1.8  (1.73)</td>\n</tr>\n<tr>\n<td>L=6</td>\n<td> 0.2</td>\n<td> 0.2</td>\n<td> 0.2</td>\n<td> 0.2</td>\n</tr>\n<tr>\n<td>total</td>\n<td> 4.2  (1.38)</td>\n<td> 5.9  (2.60)</td>\n<td> 4.2  (1.32)</td>\n<td> 5.9  (2.47)</td>\n</tr>\n</tbody>\n</table>\n\nTABLE V.: Same as table [III](#T3 \"TABLE III ‣ Microscopic description of Coulomb and nuclear excitation of multiphonon states in 40Ca + 40Ca collisions\") but for the ISGQR region. The cross sections (in mb) are summed over the energy region (14 MeV $`\\leq E \\leq`$ 20 MeV). In this region there are no states with L=3 and 5. The values in parentheses correspond to the ISGQR state.\n<table>\n<thead>\n<tr>\n<th>Phonons</th>\n<th>har. & lin.</th>\n<th>harm. & non-lin.</th>\n<th>anh. & lin.</th>\n<th>anh. & non-lin.</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td>L=0</td>\n<td> 2.7</td>\n<td> 2.8</td>\n<td> 2.2</td>\n<td> 2.2</td>\n</tr>\n<tr>\n<td>L=1</td>\n<td> 3.0</td>\n<td> 2.9</td>\n<td> 3.6</td>\n<td> 3.3</td>\n</tr>\n<tr>\n<td>L=2</td>\n<td>13.3  (13.2)</td>\n<td>16.1  (16.0)</td>\n<td>13.8  (13.6)</td>\n<td>16.0  (16.0)</td>\n</tr>\n<tr>\n<td>L=4</td>\n<td> 0.1</td>\n<td> 0.1</td>\n<td> 0.1</td>\n<td> 0.1</td>\n</tr>\n<tr>\n<td>L=6</td>\n<td> 0.2</td>\n<td> 0.3</td>\n<td> 0.2</td>\n<td> 0.3</td>\n</tr>\n<tr>\n<td>total</td>\n<td>19.3</td>\n<td>22.2</td>\n<td>19.9</td>\n<td>21.9</td>\n</tr>\n</tbody>\n</table><|endoftext|>"
    }
  },
  "all": {
    "total_tokens_train": 2413690506,
    "total_tokens_test": 268814704,
    "tokenizer": "EleutherAI/gpt-neo-125M",
    "vocab_size": 50257,
    "max_length": -1,
    "column": "category",
    "labels": [
      "biology",
      "cyber",
      "nuclear"
    ],
    "length_strategy": "none"
  }
}