% \documentclass{uai2023} % for initial submission
\documentclass[accepted]{uai2023} % after acceptance, for a revised
                                    % version; also before submission to
                                    % see how the non-anonymous paper
                                    % would look like
%% There is a class option to choose the math font
% \documentclass[mathfont=ptmx]{uai2023} % ptmx math instead of Computer
                                         % Modern (has noticable issues)
% \documentclass[mathfont=newtx]{uai2023} % newtx fonts (improves upon
                                          % ptmx; less tested, no support)
% NOTE: Only keep *one* line above as appropriate, as it will be replaced
%       automatically for papers to be published. Do not make any other
%       change above this note for an accepted version.

%% Choose your variant of English; be consistent
\usepackage[american]{babel}
% \usepackage[british]{babel}

%% Some suggested packages, as needed:
\usepackage{natbib} % has a nice set of citation styles and commands
    \bibliographystyle{plainnat}
    \renewcommand{\bibsection}{\subsubsection*{References}}
\usepackage{mathtools} % amsmath with fixes and additions
% \usepackage{siunitx} % for proper typesetting of numbers and units
\usepackage{booktabs} % commands to create good-looking tables
\usepackage{tikz} % nice language for creating drawings and diagrams
\usepackage{subfig}
\usepackage{graphicx}

\usepackage{amsmath}
\usepackage{amssymb}
\usepackage{mathtools}
\usepackage{amsthm}

\usepackage{hyperref}
\usepackage{cleveref}

\usepackage{colortbl}  %彩色表格需要加载的宏包
\usepackage{xcolor}
\usepackage{array}
\usepackage{marvosym}

%% Provided macros
% \smaller: Because the class footnote size is essentially LaTeX's \small,
%           redefining \footnotesize, we provide the original \footnotesize
%           using this macro.
%           (Use only sparingly, e.g., in drawings, as it is quite small.)

%% Self-defined macros
% \newcommand{\swap}[3][-]{#3#1#2} % just an example

\title{Memory Mechanism for Unsupervised Anomaly Detection}

% The standard author block has changed for UAI 2023 to provide
% more space for long author lists and allow for complex affiliations
%
% All author information is authomatically removed by the class for the
% anonymous submission version of your paper, so you can already add your
% information below.
%
% Add authors
\author[1,2]{\href{mailto:www.lijiahao@live.cn}{Jiahao Li}}
\author[1,2]{\href{mailto:yqchen@ict.ac.cn}{Yiqiang Chen\textsuperscript{\Letter}}}
% \author[1,2]{\href{mailto:yqchen@ict.ac.cn}{Yiqiang Chen}}
\author[1,2]{\href{mailto:xingyunbing@ict.ac.cn}{Yunbing Xing}}


% Add affiliations after the authors
\affil[1]{%
    Institute of Computing Technology\\
    Chinese Academy of Sciences \\
    Beijing, CN
}

\affil[2]{%
    University of Chinese Academy of Sciences\\
    Beijing, CN
}
% \affil[3]{%
%     The Beijing Key Laboratory of Mobile Computing and Pervasive Device\\
%     Beijing, CN
% }

  
  \begin{document}
\maketitle

\begin{abstract}
  Unsupervised anomaly detection is a binary classification that detects anomalies in unseen samples given only unlabeled normal data. Reconstruction-based approaches are widely used, which perform reconstruction error minimization on training data to learn normal patterns and quantify the degree of anomalies by reconstruction errors on testing data. However, this approach tends to miss anomalies when the normal data has multi-pattern. Because the model generalizes unrestrictedly beyond normal patterns even to include anomaly patterns. In this paper, we proposed a memory mechanism that memorizes typical normal patterns through a capacity-controlled external differentiable matrix so that the generalization of the model to anomalies is limited by the retrieval of the matrix. We achieved state-of-the-art performance on several public benchmarks.

% powered by ChatGPT
  % Detecting anomalies in unseen samples using only unlabeled normal data is a challenging task that requires unsupervised anomaly detection using binary classification. To achieve this, reconstruction-based approaches have been widely used. These approaches minimize the reconstruction error on the training data to learn normal patterns and then quantify the degree of anomalies by measuring the reconstruction errors on testing data. However, these approaches tend to miss anomalies when the normal data consists of multiple patterns, as the model can generalize unrestrictedly beyond normal patterns and even include anomaly patterns. To address this issue, we propose a novel memory mechanism that memorizes typical normal patterns using a capacity-controlled external differentiable matrix. This approach ensures that the generalization of the model to anomalies is limited by the retrieval of the matrix. Our method has been tested on several public benchmarks and has achieved state-of-the-art performance.
\end{abstract}

\section{Introduction}
\label{sec:intro}
Overconfident models can lead to silent failures. Once a trained model is deployed into an open-world scenario, it will inevitably produce silent failures\citep{gonzalez2022distance}, meaning that the model is overconfident in subsuming unknown classes into known classes without making any declarations. The cost of silent failure is unacceptable in areas such as medical diagnosis, military decision-making, and financial risk control. Therefore, it is necessary to equip the model with the ability to truthfully report unknowns.

\begin{figure}[t]
        \begin{minipage}{0.99\linewidth}
            \subfloat[Test set challenge.\label{fig:Test Chn}]{
		\centering
		\includegraphics[width=0.95\textwidth]{pictures/test.png}}
	\end{minipage}
	\hfill
	\begin{minipage}{0.99\linewidth}
            \subfloat[Training set challenge.\label{fig:Train Chn}]{
		\centering
		\includegraphics[width=0.95\textwidth]{pictures/train.png}}
	\end{minipage}
  \caption{Two major challenges of OGP. \textbf{Test Set Challenge:} Anomalies are represented in red and normal in purple. The anomalies shown on the left side of the dashed line are clearly different from the training set and can be detected easily. The right side of the dashed line shows the semantic anomalies. Hexagons and trapezoids represent semantic-level anomalies that are homologous to the original dataset, which is more insidious and difficult to detect. A large percentage of non-semantic anomalies can yield seemingly good performance. But when all the anomalies in the test set are semantic anomalies, the performance of the model is exposed realistically. \textbf{Training Set Challenge:} The unsupervised learning dilemma in the unlabeled multi-pattern training set is shown on the left. The training sets for the unlabeled and labeled scenarios are shown on the left and right, respectively.}
  \label{fig:Udemo}
\end{figure}


The unsupervised anomaly detection (UAD) task is kind of known-or-unknown judgment on unseen data given unlabeled known (normal) data, which requires the model to detect unknowns (anomalies) based on the generalization of the known (normal) data\citep{yang2021visual}. The reconstruction-based approach as shown in \cref{fig:AD framework} is the classical paradigm of UAD, which minimizes the reconstruction error on the normal data with the help of autoencoder (AE) framework for training, and then detects anomalies by reconstruction error\citep{bengio2006greedy,baldi2012autoencoders,ruff2021unifying}. AD framework expect small reconstruction errors on normal samples and relatively large ones on anomaly samples. However, some studies have found and pointed out the failure case\citep{zong2018deep,gong2019memorizing}, i.e., the anomalies are also well generalized thus failing to produce significant reconstruction errors. To visualize the failure case of the overgeneralized anomalies, we show the illustration in \cref{fig:OGP demo}. The overgeneralized anomalies will lead to reconstruction errors that are difficult to distinguish from normal ones. The overgeneralization problem (OGP) has the following challenges.


\begin{figure}[t]
  \centering
  \includegraphics[width=0.96\linewidth]{pictures/a.png}
  \caption{The reconstruction-based AD approaches.}
  \label{fig:AD framework}
\end{figure}

\begin{figure}[t]
  \centering
  \includegraphics[width=0.96\linewidth]{pictures/c.png}
  \caption{The visual illustration of overgeneralized anomalies. We selected the numbers 1 to 9 as the normal and 0 as the anomalies from MNIST. We use a fully connected AE as a reconstruction model to train on normal classes. The purple box in the figure shows the result of the model reconstructing the known classes. The lower right corner of the red box shows the reconstruction results of the anomalies. We expect the model to produce the reconstruction shown in the picture marked in the green box for normal classes, resulting in a sufficiently large reconstruction error. But in fact, the model outputs the picture marked by the orange box, which means that the model also generalizes well to the anomalies.}
  \label{fig:OGP demo}
\end{figure}


One challenge comes from the unlabeled training set, where the data may be non-single patterns. The lack of pattern labels leads to two dilemmas, as shown in \cref{fig:Train Chn}. First, it is impossible to know what pattern an instance belongs to when given one. Second, it is impossible to know how many patterns the training data has when given one. In other words, neither the boundaries nor the number of patterns is available. Label-free guided AE networks need to generalize patterns in isolation. This is why unsupervised networks are unable to sensitively extract patterns in the data leading to overconfident models. 


Another challenge comes from the test set, where the data may be semantic anomalies\citep{DBLP:journals/corr/abs-1908-04388}. A semantic anomaly is an anomaly that differs from the normal pattern only at the semantic level. For example, in 2D graphical anomaly detection with known normal data, the anomalies in the test set can be roughly divided into two categories, as shown in \cref{fig:Test Chn}. The two categories of anomalies on either side of the dotted line are 3D graphics and 2D graphics, respectively. For the anomaly detection model, detecting 3D anomalies is very simple because there is a big difference between 3D and 2D. However, detecting 2D anomalies requires further analysis of the number of edges, corners, and angles of the graph, which places a higher demand on pattern analysis at a finer granularity. In other words, semantic anomalies are more difficult to distinguish from normal ones.





Many methods have been proposed one after another to try to solve the OGP. MemAE\citep{gong2019memorizing} proposed a memory module that makes progress on a class of classification scenarios with a combination of prototype learning and sparse attention mechanisms. MNAD\citep{park2020learning} proposed a memory module that learns in a clustering-like manner without the aid of gradient updating. SSPCAB\citep{ristea2022self} proposed a convolutional attention block to improve anomaly detection. It needs to be affirmed that the academic community has recognized that the reconstruction false-negative problem is caused by the model falling into the OGP, which means that the model simply generalizes a reconstruction constant mapping in a one-sided manner like memoryless learning.

However, no study has yet combined the two previously mentioned challenges(\cref{fig:Test Chn,fig:Train Chn}), i.e., detecting semantic anomalies under unlabeled multi-pattern normal data. Beyond that, all the existing methods inevitably introduce multiple hyperparameters both in the module and penalty term, which makes the models need to be well-tuned for deployment in real industrial applications in advance. And once the task is changed, the previous optimal combination of hyperparameters may need to be tuned again in order to achieve usable performance. These methods are not user-friendly in terms of comprehensibility and implementation in practical deployments. We propose a memory mechanism that can be performed simultaneously with model training and does not introduce any additional penalty term in the reconstruction loss. The memory mechanism can be well encapsulated by platforms such as PyTorch\citep{paszke2019pytorch} with only one line of code to equip existing models. The memory mechanism can effectively cope with the coexistence of training and test set challenges because the memory space learns to extract data patterns instead of just unilaterally generalizing the reconstruction mapping. 

Overall, the core contributions of this paper are as follows:

\begin{itemize}
  \item We proposed a capacity-controlled memory mechanism with a mapping-sharing strategy (\cref{sec: Memory Mechanism}), which could be a viable solution to the OGP to cope with the coexistence of training and test set challenges.
  \item We proposed a memory-based autoencoder, called a Memorizer (\cref{sec: Memorizer}), which uses a multi-round memory mechanism for learning (\cref{sec: Multi-Round}).
  \item We proposed a challenging experimental setup under the unlabeled non-single class normal data condition (\cref{sec: OGP Problem}) conforming to real-world scenarios (\cref{fig:Udemo}) different from the previous work with a non-single class of normal data, following the latest recommendations from academia.
  \item We reached state-of-the-art on several public benchmarks, proving the effectiveness of the memory mechanism.
\end{itemize}



\section{Related Work}
\label{sec: Related Work}

\paragraph{Anomaly Detection.}
Anomaly detection is a complex problem because anomalies are unknown and rare\citep{pang2021deep}. Anomaly detection has been intensively studied under statistical techniques, such as Gaussian method\citep{barnett1976ordering,barnett1984outliers,beckman1983outlier,ye2001anomaly}, mixed parameter distributions method\citep{lauer2001mixture,10.5555/645529.658128,abraham1989outlier,box1968bayesian,agarwal2005empirical}, histograms method\citep{10.5555/645529.658128,1702202,helman1997statistically}, kernel functions method\citep{yeung2002parzen,bishop1994novelty}, and so on. However, these methods cannot effectively deal with high-dimensional data. With the development of deep learning techniques, deep anomaly detection models emerged\citep{chalapathy2019deep}. Supervised methods are built on the basis that each normal class instance has a class label\citep{shilton2013combined,jumutc2014multi,kim2015deep,erfani2017shared}. However, such precisely labeled data for a mount of normal instances is often not available\citep{chalapathy2019deep}. In contrast, unsupervised methods do not require data labeling but also face the following challenges\citep{chalapathy2019deep,gong2019memorizing,zong2018deep}. First, learning the commonality of normal data in high-dimensional space. Second, how to choose the hyperparameters of the autoencoder to obtain optimal performance. Third, the autoencoder suffers from the OGP and fails to produce large reconstruction errors for anomalies. 

\paragraph{Representation Learning.}
Several studies in recent years have been devoted to addressing the shortcomings of the unsupervised approach. Memory-based approaches are seen as promising solutions. MemAE\citep{gong2019memorizing} proposed memory modules that use the encoder output of the latent space to apply attention mechanisms to the memory prototypes in the module to obtain weights. The sparse weights are then used to weigh and sum the memory prototypes as decoder inputs. The weight sparsification loss is introduced as a penalty term in the loss function. MNAD\citep{park2020learning} draws on KMeans clustering\citep{lloyd1982least} to update the memory prototype with a non-gradient style. The compact loss and separation loss are introduced into the loss function as penalty terms. SSPCAB\citep{ristea2022self} proposed a masked convolution and attention block to improve anomaly detection. TrustMAE\citep{tan2021trustmae} proposed the concept of trusted regions based on MemAE to further prevent the autoencoder suffering from the OGP. Six additional penalty terms are introduced into the loss function. In summary, the improvement of memory shows a trend of more and more penalty terms and more complex structures. Excellent performance is constantly broken but the number of hyperparameters and module complexity is increasing. Is there a simple and elegant structure that can achieve good performance without introducing penalty terms and numerous hyperparameters? More relevant studies are waiting to be conducted.

\begin{figure}[t]
  \centering
  \includegraphics[width=0.99\linewidth]{pictures/fig1.png}
  \caption{Memory versus Attention. On the left is our proposed memory mechanism and on the right is the self-attention mechanism.}
  \label{fig:memory}
\end{figure}
\begin{figure}[t]
  \centering
  \includegraphics[width=0.96\linewidth]{pictures/fig4.png}
  \caption{Overview of Memorizer.}
  \label{fig:Memorizer}
\end{figure}
\begin{figure*}[t]
  \centering
  \includegraphics[width=0.99\linewidth]{pictures/fig3.png}
  \caption{Multi-Round Memory.}
  \label{fig:Unfold}
\end{figure*}
\paragraph{Attention Mechanisms.}
The early success of the attention mechanism in the field of machine translation is unprecedented\citep{bahdanau2014neural}, which is a technique that uses the query to compare keys to obtain weights to weigh the sum of values. After this, various variants of the attention mechanism emerged. General attention proposed trainable mapping matrices\citep{luong2015effective}. Hard attention proposed the concept of stochastic key\citep{xu2015show}. The self-attention mechanism proposed to rely only on itself for attention operations\citep{yang2016hierarchical}. Transformer proposed a multi-headed attention mechanism and obtained a breakthrough in the field of computer vision\citep{vaswani2017attention}. SENet proposed an attention mechanism for feature map channels\citep{hu2018squeeze}. More and more research is going deeper with the application of attention mechanisms and improvements in Transformers. The attentional mechanism and its variants show outstanding generalizability in experimental results. Can we do the opposite by using attention mechanisms to suppress overgeneralization to alleviate the OGP in UAD? More variants are to be studied.




\section{Methodology}
\label{sec: Methodology}
The main goal of the memory mechanism is to solve the problem of overgeneralization of unlabeled non-single-class data in UAD. \cref{fig:memory} illustrates the memory mechanism and how it compares to the self-attention mechanism. The attention mechanism mainly consists of mapping shared strategies and independent capacity-controlled memory, which are described in detail in \cref{sec: Memory Mechanism}. \cref{fig:Unfold} demonstrates the multi-round memory structure based on the memory mechanism, details of which are expanded in \cref{sec: Multi-Round}. The memory-based autoencoder called Memorizer is shown in \cref{fig:Memorizer} and described in detail in \cref{sec: Memorizer}.







\subsection{Definition of the Overgeneralization Problem (OGP)}
\label{sec: OGP Problem}
Given a training set $D$, it is known that $D=\left\{(x_1,y_1),(x_2,y_2),\cdots,(x_n,y_n)\right\}$ contains only normal data, where $(x,y)$ is the sample-label pair, $n$ is the number of samples, and $d$ is the dimension of sample $\forall x_i\in D,x_i\in\mathbb{R}^d$. Consider the label set $Y=\left\{y_i\right\}_{i=1}^n$ of dataset $D$, which is divided into $\xi$ classes, i.e., $Y=\bigcup_{k=1}^{\xi}\left\{\left.y_j\right|y_j=y_{c_k},\forall j\in\left[1,n\right]\right\}$. Given a test set $T=\left\{({\widetilde{x}}_1,{\widetilde{y}}_1),({\widetilde{x}}_2,{\widetilde{y}}_2),\cdots,({\widetilde{x}}_{n^\prime},{\widetilde{y}}_{n^\prime})\right\}$ containing both normal and anomalous, where $(\widetilde{x},\widetilde{y})$ is the sample-label pair, and $n^\prime$ is the number of samples $\forall{\widetilde{x}}_i\in T,{\widetilde{x}}_i\in\mathbb{R}^d$. The label set $\widetilde{Y}=\left\{{\widetilde{y}}_i\right\}_{i=1}^{n^\prime}$ of $T$ is divided into two classes as positive and negative respectively, i.e., $\widetilde{Y}=\bigcup_{c_k\in\left\{pos,neg\right\}}\left\{\left.{\widetilde{y}}_j\right|{\widetilde{y}}_j=y_{c_k},\forall j\in\left[1,n^\prime\right]\right\}$, where negative means normal and positive means anomaly. The OGP is defined as follows. Under the unlabeled non-single class normal data(UNSCND) condition $\{\xi>1,Y=\emptyset\}$, the performance $Q$ of the reconstruction-based model $\Omega$ on $T$ decreases as the number of classes $\xi$ or model capacity $\mathcal{O}\left(\mathrm{\Omega}\right)$ increases, i.e., $Q_\mathrm{\Omega}\left(T\right)\propto{(\xi\mathcal{O}\left(\mathrm{\Omega}\right))}^{-1}$. 

To truthfully and exclusively study the OGP without generating embellished scores in the experimental results that obscure the OGP, two key points needed to be stated. First, following the recommendation of paper\citep{DBLP:journals/corr/abs-1908-04388} for the academic community, the anomaly should be semantic level, i.e., $D$ and $T$ come from the \textbf{SAME} dataset as shown on the right side of the dashed line in \cref{fig:Udemo}. Second. it is important to note the distinction between traditional unlabeled single class normal data (USCND) condition $\{\xi=1,Y=\emptyset\}$ and \textbf{UNSCND} condition $\{\xi>1,Y=\emptyset\}$, as the smaller $\xi$ tends to obscure the OGP in $Q_\mathrm{\Omega}\left(T\right)$.






\subsection{Memory Mechanism}
\label{sec: Memory Mechanism}
The input of the memory mechanism is denoted as $\mathcal{H}\in\mathbb{R}^{B\times{(W*H*C)}}$, where $B,W,H,C$ are the batch size, width, height, and the number of channels, respectively. The main component of the memory mechanism consists of a trainable matrix $\mathcal{M}\in\mathbb{R}^{N\times F}$, where $N$ is the memory capacity and $F=W*H*C$. $W_{S}\in\mathbb{R}^{F\times F}$ is a trainable linear mapping shared by $\mathcal{H}$ and $\mathcal{M}$. Denote the SoftMax\citep{bridle1989training} function as $\sigma$ in the direction of dimension $N$. The output $\widetilde{\mathcal{H}}$ of the memory mechanism is defined by \cref{eq:QKV memory} as follows.

\begin{equation}
    \widetilde{\mathcal{H}}=Memory\left(\mathcal{H},\mathcal{M}\right)=\sigma\left(\mathcal{H}{W_S\left(\mathcal{M}W_S\right)}^T\right)\mathcal{M}  
      \label{eq:QKV memory}
\end{equation}

Notice that $\widetilde{\mathcal{H}}\in\mathbb{R}^{B\times{F}}$ in \cref{eq:QKV memory} is in the same space as $\mathcal{H}$, and $\mathbb{R}^{B\times{(W*H*C)}}$ is the flattened form of $\mathbb{R}^{B\times{H}\times{W}\times{C}}$.







\subsection{Multi-Round Memory}
\label{sec: Multi-Round}

In addition to the mapping sharing strategy $W_{S}$ mentioned by \cref{sec: Memory Mechanism} to pull $\mathcal{H}$ and $\mathcal{M}$ into the same space for comparison, we found that multiple rounds of memory for the memory matrix $\mathcal{H}$ are also beneficial for overgeneralization suppression. Multi-round memory of learning helps memory to extract and consolidate the intrinsic patterns of the data in the form described by \cref{eq:multi mem}.

\begin{equation}
    {\widetilde{\mathcal{H}}}_r=Memory\left({\widetilde{\mathcal{H}}}_{r-1}+\mathcal{H},\mathcal{M}_r\right),\ \ ({\widetilde{\mathcal{H}}}_0=\mathcal{H})
      \label{eq:multi mem}
\end{equation}

${\widetilde{\mathcal{H}}}_r$ and $\mathcal{M}_r(\forall r\in\left[1,R\right])$ represents the $r$-th round of $\mathcal{H}$ and $\mathcal{M}$ in the serial R-round memory learning. The output $\mathcal{H}_R$ of the final R-round memory is differentiable with respect to the first-round input $\mathcal{H}$ as described in \cref{eq:chine law}.

\begin{equation}
\begin{aligned}
    \frac{\partial \widetilde{\mathcal{H}}_R}{\partial \mathcal{H}}&=\frac{\partial \widetilde{\mathcal{H}}_R}{\partial \widetilde{\mathcal{H}}_0}\\&=\prod_{i=1}^{R}\frac{\partial{\widetilde{\mathcal{H}}}_i}{\partial{\widetilde{\mathcal{H}}}_{i-1}}\\&=\frac{\partial \widetilde{\mathcal{H}}_R}{\partial \widetilde{\mathcal{H}}_{R-1}} \frac{\partial \widetilde{\mathcal{H}}_{R-1}}{\partial \widetilde{\mathcal{H}}_{R-2}} \cdots \frac{\partial \widetilde{\mathcal{H}}_1}{\partial \widetilde{\mathcal{H}}_0}
      \label{eq:chine law}
\end{aligned}
\end{equation}

It can be noted that the final output after the $R$-th round of memory can be back-propagated to the encoder by the chain derivative law for the gradient, which ensures that the multi-round memorization process is differentiable.




\subsection{Memorizer: Memory-based Auto-Encoder}
\label{sec: Memorizer}
Memorizers are composed on an autoencoder framework. Encoder $f_{\theta_E}:\mathbb{R}^d\rightarrow\mathbb{R}^F$ and decoder $g_{\theta_D}:\mathbb{R}^F\rightarrow\mathbb{R}^d$ are nonlinear learning functions, respectively. The structure of the Memorizer is described in \cref{eq:encode,eq:mem encoder outs,eq:decode}.

\begin{equation}
    \mathcal{H}=f_{\theta_E}\left(X\right)
      \label{eq:encode}
\end{equation}

\begin{equation}
    \widetilde{\mathcal{H}}=\left(Memory\left({\widetilde{\mathcal{H}}}_{r-1}+\mathcal{H},\mathcal{M}_r\right)\right)_{r=1:R}
      \label{eq:mem encoder outs}
\end{equation}

\begin{equation}
    \widetilde{X}=g_{\theta_D}\left(\widetilde{\mathcal{H}}\right)      \label{eq:decode}
\end{equation}

The \cref{eq:encode,eq:decode} in which $X=\left\{x_{i_j}\right\}_{j=1}^B\in\mathbb{R}^{B\times d}$ denotes the input data of one batch and $\widetilde{X}$ denotes the reconstructed data. The loss function of the Memorizer is as follows.

\begin{equation}
    L=\frac{1}{B} \sum_{j=1}^B\left\|x_{i_j}-\widetilde{x}_{i_j}\right\|_2^2
      \label{eq:recon loss}
\end{equation}

$x_{i_j}$ represents the data in $D$ that is shuffled into the batch. Notice that the decoder input $\widetilde{H}$ of the memorizer comes from a linear weighted sum of $\widetilde{M}$. Therefore, the model generalizability is suppressed by and only by the memory capacity $N$ and no longer depends on the model capacity $\mathcal{O}\left(\mathrm{\Omega}\right)$. So the essence of the Memorizer is a controlled transformation of the OGP using the memory mechanism as described in \cref{eq:OGP trans}.


\begin{equation}
    Q_\mathrm{\Omega}\left(T\right)\propto{(\xi\mathcal{O}\left(\mathrm{\Omega}\right))}^{-1}\Rightarrow Q_\mathcal{M}\left(T\right)\propto\frac{\gamma}{\xi \mathcal{O}\left(\mathcal{M}\right)}
      \label{eq:OGP trans}
\end{equation}

Notice that $\mathcal{O}\left(\mathcal{M}\right)$ is a correlation function on $N$. Without loss of generality, $\exists\delta,\gamma\in\mathbb{Z}^+$, $\lim\limits_{R\rightarrow\gamma}Q_\mathcal{M}\left(T\right)=\delta$ holds for fixed $\xi$ and $N$.


\begin{figure*}
\begin{center}
    \captionsetup{type=figure}
    \includegraphics[width=.99\textwidth]{pictures/show.png}
    \captionof{figure}{The memory mechanism alleviates the OGP in the face of the combined challenge of the training and test sets.}
    \label{fig:show}
\end{center}
\end{figure*}


\section{Experiment}
\label{sec: Experiment}
In this section, we conduct parallel comparison experiments for the OGP on multiple public benchmarks to verify the effectiveness of the Memorizer for overgeneralization inhibition with the principle of absolute fair comparison. Finally, we performed an ablation study and sensitivity analysis to ensure that the mechanisms and structures proposed in this paper are positive for solving the OGP. 

% \begin{table*}[t]
% \caption{The numerical results.}
% \label{tab:main}
% \resizebox{\linewidth}{!}{
% \begin{tabular}{ccccccccccccccccccccccccc}
% \textbf{ROCO}      & \multicolumn{8}{c}{\textbf{AUROC}}                                                                                                                 & \multicolumn{8}{c}{\textbf{AUPRC}}                                                                                                        & \multicolumn{8}{c}{\textbf{F1}}                                                                                                                    \\
% \textbf{MNIST}     & AE         & SAE                 & DAE        & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                & AE         & SAE                 & DAE        & VAE                 & MemAE      & MNAD       & SSPCAB              & Ours                & AE         & SAE                 & DAE        & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                \\
% 0                  & 79.16±0.30 & 82.57±0.44          & 82.86±1.06 & 94.68±1.73          & 86.16±1.2           & 79.88±0.33 & 80.95±0.12          & \textbf{97.79±0.64} & 25.83±0.69 & 35.59±0.66          & 31.61±1.87 & 62.26±0.53          & 39.98±3.23 & 26.50±0.61 & 28.75±0.34          & \textbf{80.24±1.52} & 36.52±0.52 & 41.35±0.62          & 41.15±1.76 & 63.97±6.58          & 44.85±1.38          & 36.63±0.59 & 38.73±0.42          & \textbf{80.16±0.39} \\
% 1                  & 12.45±0.94 & 19.21±3.36          & 14.05±0.99 & 38.99±0.40          & 17.16±2.93          & 16.90±1.43 & 11.39±0.58          & \textbf{41.61±0.03} & 6.27±0.04  & 6.67±0.24           & 6.37±0.06  & 8.47±0.05           & 6.57±0.16  & 6.51±0.08  & 6.23±0.04           & \textbf{8.81±0.01}  & 20.39±0.00 & 20.41±0.03          & 20.41±0.02 & 20.92±0.04          & 20.42±0.05          & 20.39±0.00 & 20.39±0             & \textbf{22.78±0.06} \\
% 2                  & 86.20±0.22 & 89.26±0.15          & 89.23±1.76 & 97.14±0.08          & 93.80±1.35          & 86.63±0.64 & 86.80±0.02          & \textbf{97.95±0.27} & 44.93±0.52 & 56.38±1.92          & 53.31±5.13 & 83.11±0.36          & 67.98±6.65 & 46.16±1.21 & 46.10±1.12          & \textbf{86.06±0.68} & 48.47±0.78 & 56.51±1.07          & 54.72±3.89 & 75.93±0.45          & 65.39±4.26          & 49.24±1.17 & 49.85±0.48          & \textbf{78.40±0.94} \\
% 3                  & 64.68±0.10 & 67.91±3.88          & 68.62±2.51 & 95.11±0.07          & 82.05±3.12          & 66.62±0.41 & 66.64±0.24          & \textbf{94.97±0.61} & 16.84±0.22 & 17.96±3.65          & 19.02±1.82 & \textbf{64.20±0.85} & 31.06±4.58 & 17.31±0.27 & 17.62±0.18          & 61.25±4.30          & 22.98±0.04 & 25.70±2.78          & 24.99±1.15 & 65.36±0.28          & 38.26±3.51          & 23.91±0.14 & 23.95±0.18          & \textbf{65.52±0.84} \\
% 4                  & 59.13±0.45 & 66.88±0.51          & 62.34±3.68 & 92.80±0.12          & 77.04±3.14          & 65.29±4.48 & 59.78±0.48          & \textbf{89.84±0.81} & 13.24±0.25 & 16.54±0.86          & 14.20±1.41 & \textbf{60.84±0.04} & 25.84±3.25 & 16.44±2.61 & 14.13±0.42          & 48.88±0.46          & 20.40±0.11 & 24.18±0.15          & 21.77±1.25 & \textbf{59.84±0.03} & 33.11±2.57          & 23.85±3.06 & 20.33±0.14          & 52.42±0.70          \\
% 5                  & 71.34±0.15 & 75.78±0.73          & 73.77±3.06 & \textbf{96.45±0.09} & 81.16±2.69          & 72.13±0.62 & 71.56±0.76          & 96.21±0.18          & 16.26±0.13 & 19.92±0.74          & 18.16±2.11 & \textbf{73.92±0.33} & 25.80±3.49 & 17.08±0.40 & 16.60±0.62          & 69.30±0.49          & 25.83±0.26 & 30.16±1.11          & 28.12±2.68 & \textbf{70.32±0.29} & 35.61±3.29          & 26.72±0.46 & 25.79±0.57          & 67.24±0.75          \\
% 6                  & 84.92±0.30 & 82.87±0.36          & 86.82±1.96 & 93.46±0.13          & 89.64±0.52          & 85.61±1.49 & 84.80±0.36          & \textbf{94.97±0.44} & 41.85±0.83 & 35.49±0.74          & 48.60±4.87 & 66.06±0.70          & 54.59±1.00 & 42.96±2.49 & 43.48±0.74          & \textbf{70.68±0.54} & 47.46±0.77 & 39.98±0.80          & 51.55±3.40 & 62.80±0.27          & 56.01±1.21          & 47.64±2.18 & 48.38±0.9           & \textbf{66.39±0.48} \\
% 7                  & 61.91±0.85 & 58.75±1.91          & 62.93±1.97 & 74.92±1.18          & 61.20±3.09          & 67.45±2.13 & 59.79±0.43          & \textbf{84.14±1.05} & 19.13±0.69 & 13.91±0.75          & 21.06±1.76 & 25.22±1.84          & 15.15±1.25 & 23.09±1.66 & 19.64±0.31          & \textbf{36.51±0.27} & 22.78±0.75 & 21.00±0.86          & 24.01±1.23 & 31.85±1.18          & 21.73±1.32          & 27.69±1.39 & 22.3±0.51           & \textbf{42.97±0.32} \\
% 8                  & 67.84±0.16 & 76.32±0.81          & 73.22±4.39 & 96.02±0.38          & 91.06±0.54          & 75.90±1.58 & 69.22±0.09          & \textbf{96.16±0.73} & 14.98±0.06 & 23.62±0.93          & 18.84±2.88 & \textbf{72.64±1.38} & 49.03±2.02 & 20.97±1.42 & 16.06±0.17          & 66.82±3.82          & 24.98±0.15 & 30.74±0.44          & 28.37±2.63 & 68.02±0.52          & 51.88±1.52          & 29.96±1.36 & 25.42±0.04          & \textbf{70.27±0.94} \\
% 9                  & 44.38±1.70 & 47.90±6.11          & 43.79±1.02 & 71.61±0.04          & 53.54±3.41          & 51.29±2.24 & 44.69±1.04          & \textbf{78.99±0.70} & 8.82±0.31  & 9.86±1.61           & 8.71±0.18  & \textbf{17.81±0.06} & 10.31±0.76 & 9.98±0.49  & 8.69±0.10           & 26.78±0.35          & 19.77±0.05 & 19.60±0.94          & 20.07±0.13 & 27.65±0.08          & 21.38±1.02          & 20.68±0.3  & 19.51±0.08          & \textbf{36.26±0.31} \\
% AVG                & 63.20±0.52 & 66.75±1.83          & 65.76±2.24 & 85.12±0.42          & 73.33±2.20          & 66.77±1.53 & 63.56±0.41          & \textbf{87.26±0.55} & 20.81±0.37 & 23.59±1.11          & 23.99±2.21 & 53.45±1.61          & 32.63±2.64 & 22.70±1.12 & 21.73±0.40          & \textbf{55.53±1.24} & 28.96±0.34 & 30.96±0.88          & 31.52±1.81 & 54.67±0.97          & 38.86±2.01          & 30.67±1.07 & 29.46±0.33          & \textbf{58.24±0.57} \\
% \textbf{Fashion}   & AE         & SAE                 & DAE        & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                & AE         & SAE                 & DAE        & VAE                 & MemAE      & MNAD       & SSPCAB              & Ours                & AE         & SAE                 & DAE        & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                \\
% T-shirt            & 57.03±0.24 & 44.28±2.63          & 58.08±1.00 & 58.34±0.22          & 55.66±0.92          & 57.15±1.99 & 57.63±0.12          & \textbf{60.33±0.08} & 11.57±0.03 & 9.68±0.50           & 11.99±0.19 & 12.99±0.05          & 11.62±0.19 & 11.67±0.72 & 11.74±0.12          & \textbf{13.7±0.12}  & 20.80±0.12 & 18.63±0.33          & 21.21±0.47 & 20.77±0.04          & 20.21±0.37          & 20.95±0.7  & 20.91±0.06          & \textbf{21.58±0.01} \\
% Trouser            & 71.03±1.25 & 88.37±1.25          & 73.80±2.22 & 84.94±0.18          & \textbf{88.78±0.48} & 70.06±2.11 & 63.66±1.35          & 84.28±0.37          & 15.37±0.50 & \textbf{39.17±3.44} & 16.87±1.24 & 31.17±0.08          & 37.73±1.33 & 14.76±1.15 & 12.25±0.43          & 29.00±0.80          & 28.67±0.96 & \textbf{49.32±2.22} & 30.74±1.75 & 42.95±0.27          & 50.60±0.92          & 28.45±0.95 & 25.29±0.67          & 42.33±0.94          \\
% Pullover           & 43.64±0.25 & 33.31±0.21          & 47.41±1.24 & 58.58±0.19          & 52.75±0.41          & 45.19±1.13 & 43.76±0.16          & \textbf{58.71±0.10} & 8.91±0.03  & 7.41±0.08           & 9.39±0.17  & 11.91±0.11          & 10.25±0.09 & 9.03±0.20  & 8.76±0.03           & \textbf{11.92±0.08} & 18.81±0.14 & 18.18±0.00          & 19.30±0.19 & 21.36±0.04          & 20.24±0.09          & 18.95±0.12 & 18.87±0.09          & \textbf{21.41±0.02} \\
% Dress              & 61.30±0.97 & 65.09±1.48          & 63.79±2.24 & \textbf{70.84±0.42} & 67.52±1.16          & 63.47±1.87 & 64.48±0.58          & 70.61±0.27          & 12.32±0.28 & 14.15±0.56          & 12.87±0.59 & \textbf{16.65±0.33} & 14.27±0.44 & 12.90±0.73 & 13.18±0.17          & 15.91±0.17          & 23.18±0.39 & 24.71±1.20          & 24.31±1.04 & \textbf{27.72±0.34} & 25.79±0.63          & 24.10±0.72 & 24.65±0.41          & 27.40±0.34          \\
% Coat               & 48.93±0.16 & 45.47±4.27          & 51.31±0.25 & 54.98±0.28          & 52.32±1.50          & 48.74±0.21 & 49.95±0.32          & \textbf{56.35±0.11} & 8.94±0.03  & 8.64±0.74           & 9.34±0.05  & 10.37±0.04          & 9.64±0.26  & 8.87±0.05  & 9.09±0.07           & \textbf{10.60±0.03} & 20.33±0.01 & 18.57±0.08          & 20.79±0.09 & 20.91±0.04          & 20.81±0.47          & 20.29±0.07 & 20.76±0.05          & \textbf{21.54±0.09} \\
% Sandal             & 92.28±0.46 & \textbf{92.63±0.30} & 90.87±1.14 & 87.38±0.29          & 89.25±0.34          & 91.85±0.92 & 91.70±0.23          & 86.17±0.19          & 49.03±1.46 & \textbf{52.72±1.07} & 48.16±1.47 & 43.19±0.81          & 44.75±0.55 & 49.71±2.24 & 51.87±0.79          & 39.48±0.12          & 55.15±1.43 & \textbf{56.79±0.65} & 51.09±3.00 & 45.45±0.81          & 47.93±0.44          & 53.68±2.73 & 53.59±0.81          & 42.98±0.51          \\
% Shirt              & 51.15±0.52 & 34.04±1.53          & 50.23±0.54 & \textbf{54.47±0.03} & 52.63±0.96          & 51.20±0.53 & 51.59±0.20          & 54.23±0.35          & 11.34±0.15 & 7.64±0.34           & 10.93±0.22 & 10.90±0.05          & 11.08±0.22 & 11.12±0.13 & \textbf{11.37±0.08} & 10.94±0.06          & 19.05±0.08 & 18.21±0.00          & 18.87±0.08 & \textbf{19.72±0.03} & 19.45±0.18          & 19.18±0.06 & 19.20±0.06          & 19.65±0.02          \\
% Sneaker            & 64.85±0.40 & \textbf{70.58±3.16} & 61.49±2.84 & 65.22±0.52          & 60.66±1.26          & 61.75±2.77 & 61.62±0.49          & 64.01±0.52          & 12.33±0.14 & \textbf{14.72±1.36} & 11.38±0.82 & 12.88±0.23          & 11.20±0.32 & 11.45±0.73 & 11.33±0.14          & 12.27±0.12          & 26.77±0.19 & \textbf{29.03±1.76} & 25.52±1.11 & 25.16±0.02          & 24.31±0.65          & 25.46±1.18 & 25.85±0.12          & 25.04±0.26          \\
% Bag                & 95.56±0.17 & 88.74±0.29          & 96.18±0.51 & 94.07±0.26          & 94.84±0.36          & 96.13±0.18 & \textbf{96.39±0.17} & 95.15±0.29          & 66.06±1.29 & 43.59±0.50          & 69.71±2.62 & 56.53±1.37          & 58.42±2.10 & 69.47±1.26 & \textbf{75.12±0.76} & 64.72±2.23          & 70.06±0.78 & 52.57±0.45          & 71.56±1.23 & 63.09±0.68          & 65.35±1.10          & 71.45±0.17 & \textbf{74.18±0.42} & 66.85±1.46          \\
% Ankle boot         & 83.41±0.64 & 83.36±0.47          & 84.71±0.78 & 77.28±0.01          & 82.22±0.65          & 83.78±0.61 & \textbf{86.73±0.58} & 81.32±0.57          & 27.69±0.54 & 31.08±0.92          & 31.25±1.93 & 22.42±0.13          & 27.30±0.81 & 28.93±1.36 & \textbf{33.29±0.91} & 30.52±1.37          & 38.82±0.83 & 39.53±0.32          & 40.62±0.96 & 32.62±0.27          & 36.98±0.75          & 39.36±0.65 & \textbf{43.88±0.83} & 37.85±0.92          \\
% AVG                & 66.92±0.51 & 64.59±1.56          & 67.79±1.28 & 70.61±0.24          & 69.66±0.8           & 66.93±1.23 & 66.75±0.42          & \textbf{71.12±0.29} & 22.36±0.45 & 22.88±0.95          & 23.19±0.93 & 22.90±0.32          & 23.63±0.63 & 22.79±0.86 & 23.80±0.35          & \textbf{23.91±0.51} & 32.16±0.49 & 32.55±0.70          & 32.4±0.99  & 31.97±0.25          & \textbf{33.17±0.56} & 32.19±0.73 & 32.72±0.35          & 32.66±0.46          \\
% \textbf{Kuzushiji} & AE         & SAE                 & DAE        & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                & AE         & SAE                 & DAE        & VAE                 & MemAE      & MNAD       & SSPCAB              & Ours                & AE         & SAE                 & DAE        & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                \\
% U+304A             & 71.50±0.03 & 71.11±2.57          & 71.36±0.86 & 81.42±0.33          & 76.92±0.18          & 71.96±0.05 & 71.56±0.35          & \textbf{85.14±0.43} & 15.92±0.10 & 16.73±1.44          & 15.86±0.52 & 31.21±0.75          & 22.07±0.44 & 16.08±0.39 & 16.27±0.95          & \textbf{37.93±1.23} & 28.78±0.06 & 28.17±1.92          & 28.39±0.55 & 37.54±0.39          & 37.54±0.39          & 29.01±0.23 & 28.91±0.07          & \textbf{43.97±0.87} \\
% U+304D             & 50.33±0.28 & 51.36±0.49          & 49.25±0.40 & 68.00±0.15          & 60.88±0.76          & 50.66±0.24 & 50.00±0.07          & \textbf{71.04±0.03} & 9.45±0.04  & 9.82±0.08           & 9.33±0.08  & 17.65±0.26          & 13.65±0.54 & 9.57±0.09  & 9.40±0.03           & \textbf{20.17±0.01} & 19.26±0.07 & 19.41±0.08          & 19.03±0.13 & 25.24±0.17          & 25.24±0.17          & 19.25±0.06 & 19.18±0.02          & \textbf{27.17±0.17} \\
% U+3059             & 43.32±0.07 & 46.93±0.31          & 42.95±0.42 & 59.05±0.50          & 52.05±0.54          & 44.41±0.16 & 43.18±0.09          & \textbf{63.14±0.2}  & 9.09±0.03  & 10.40±0.55          & 9.13±0.19  & 13.01±0.04          & 11.27±0.11 & 9.66±0.03  & 9.11±0.16           & \textbf{15.04±0.15} & 18.40±0.03 & 18.88±0.06          & 18.44±0.06 & 21.09±0.24          & 21.09±0.24          & 18.35±0.03 & 18.37±0.03          & \textbf{21.99±0.02} \\
% U+3064             & 68.78±0.17 & 72.65±0.54          & 69.96±0.41 & 78.83±0.01          & 73.77±1.46          & 69.62±0.24 & 69.36±0.04          & \textbf{82.37±0.55} & 20.23±0.17 & 24.64±0.48          & 20.40±0.53 & 29.75±0.25          & 25.52±1.58 & 21.36±0.64 & 20.39±0.39          & \textbf{32.09±0.45} & 26.73±0.12 & 29.35±0.70          & 27.07±0.09 & 36.21±0.11          & 36.21±0.11          & 27.63±0.07 & 27.27±0.05          & \textbf{39.50±0.86} \\
% U+306A             & 57.31±0.08 & 61.51±0.76          & 60.10±1.07 & 83.86±0.38          & 77.46±1.05          & 56.61±1.29 & 58.08±0.07          & \textbf{87.38±0.28} & 10.92±0.04 & 12.25±0.30          & 11.73±0.36 & 33.67±1.25          & 22.52±1.30 & 10.79±0.30 & 11.13±0.04          & \textbf{42.99±0.55} & 21.44±0.07 & 22.72±0.28          & 22.47±0.43 & 42.02±0.96          & 42.02±0.96          & 21.08±0.76 & 21.65±0.03          & \textbf{49.20±0.12} \\
% U+306F             & 19.35±0.17 & 22.19±1.15          & 20.51±0.93 & 52.66±0.45          & 37.40±0.05          & 19.90±0.27 & 20.08±0.26          & \textbf{61.14±0.74} & 5.81±0.01  & 6.03±0.08           & 5.89±0.07  & 9.66±0.06           & 7.30±0.02  & 5.83±0.01  & 5.85±0.02           & \textbf{12.36±0.14} & 18.18±0.00 & 18.19±0.00          & 18.19±0.00 & 20.10±0.11          & 20.10±0.11          & 18.18±0.00 & 18.18±0.00          & \textbf{22.45±0.40} \\
% U+307E             & 39.85±0.07 & 45.03±1.61          & 40.75±0.36 & 65.23±0.40          & 55.94±0.17          & 39.16±0.05 & 40.07±0.25          & \textbf{68.24±0.49} & 9.00±0.11  & 10.75±0.60          & 9.46±0.13  & 22.52±0.12          & 16.27±0.36 & 8.96±0.07  & 9.20±0.08           & \textbf{26.15±0.62} & 18.18±0.00 & 18.19±0.00          & 18.22±0.02 & 24.78±0.23          & 24.78±0.23          & 18.20±0.01 & 18.21±0.02          & \textbf{27.61±0.15} \\
% U+3084             & 80.46±0.10 & 80.79±0.58          & 81.46±0.57 & 90.33±0.42          & 88.66±0.27          & 80.80±0.86 & 80.79±0.05          & \textbf{92.11±0.07} & 25.26±0.27 & 24.35±0.40          & 26.39±0.64 & 53.26±2.44          & 46.60±0.25 & 25.56±1.31 & 25.55±0.08          & \textbf{58.98±1.12} & 36.14±0.01 & 36.89±1.14          & 36.68±0.67 & 54.85±1.17          & 54.85±1.17          & 36.73±0.69 & 36.46±0.02          & \textbf{57.92±0.19} \\
% U+308C             & 62.19±0.27 & 59.89±0.92          & 60.38±0.46 & 68.26±0.03          & 64.37±0.34          & 63.99±0.26 & 61.11±0.60          & \textbf{73.01±0.91} & 13.90±0.08 & 11.92±0.48          & 12.56±0.08 & 15.27±0.03          & 13.39±0.06 & 13.56±0.26 & 13.05±0.39          & \textbf{18.16±0.70} & 22.60±0.01 & 21.85±0.27          & 21.84±0.23 & 26.44±0.10          & 26.44±0.10          & 23.60±0.23 & 22.10±0.34          & \textbf{30.29±1.05} \\
% U+3092             & 65.95±0.05 & 63.24±1.00          & 66.65±0.70 & 78.07±0.24          & 73.44±0.42          & 66.45±0.11 & 66.43±0.11          & \textbf{81.41±0.55} & 15.01±0.12 & 13.97±0.48          & 15.45±0.46 & 27.36±0.49          & 19.81±0.39 & 15.51±0.40 & 15.48±0.08          & \textbf{31.10±1.28} & 24.22±0.24 & 22.87±0.50          & 24.26±0.30 & 35.46±0.07          & 35.46±0.07          & 24.26±0.01 & 24.04±0.13          & \textbf{39.02±0.63} \\
% AVG                & 55.90±0.13 & 57.47±0.99          & 56.34±0.62 & 72.57±0.29          & 66.09±0.52          & 56.36±0.35 & 56.07±0.19          & \textbf{76.50±0.42} & 13.46±0.10 & 14.09±0.49          & 13.62±0.31 & 25.34±0.57          & 19.84±0.50 & 13.69±0.35 & 13.54±0.22          & \textbf{29.50±0.62} & 23.39±0.06 & 23.65±0.5           & 23.46±0.25 & 32.37±0.35          & 32.37±0.35          & 23.63±0.21 & 23.44±0.07          & \textbf{35.91±0.45}
% \end{tabular}}
% \end{table*}
% Please add the following required packages to your document preamble:
% \usepackage{booktabs}
% \begin{table*}[t]
% \caption{The numerical results.}
% \label{tab:main}
% \resizebox{\linewidth}{!}{
% \begin{tabular}{@{}ccccccccccccccccccc@{}}
% \toprule
% \textbf{ROCO}      & \multicolumn{6}{c}{\textbf{AUROC}}                                                                                       & \multicolumn{6}{c}{\textbf{AUPRC}}                                                                     & \multicolumn{6}{c}{\textbf{F1-Score}}                                                                                          \\ \midrule
% \textbf{MNIST}     & AE                  & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                & AE         & VAE                 & MemAE      & MNAD       & SSPCAB              & Ours                & AE                  & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                \\
% 0                  & 79.16±0.30          & 94.68±1.73          & 86.16±1.2           & 79.88±0.33 & 80.95±0.12          & \textbf{97.79±0.64} & 25.83±0.69 & 62.26±0.53          & 39.98±3.23 & 26.50±0.61 & 28.75±0.34          & \textbf{80.24±1.52} & 36.52±0.52          & 63.97±6.58          & 44.85±1.38          & 36.63±0.59 & 38.73±0.42          & \textbf{80.16±0.39} \\
% 1                  & 12.45±0.94          & 38.99±0.40          & 17.16±2.93          & 16.90±1.43 & 11.39±0.58          & \textbf{41.61±0.03} & 6.27±0.04  & 8.47±0.05           & 6.57±0.16  & 6.51±0.08  & 6.23±0.04           & \textbf{8.81±0.01}  & 20.39±0.00          & 20.92±0.04          & 20.42±0.05          & 20.39±0.00 & 20.39±0             & \textbf{22.78±0.06} \\
% 2                  & 86.20±0.22          & 97.14±0.08          & 93.80±1.35          & 86.63±0.64 & 86.80±0.02          & \textbf{97.95±0.27} & 44.93±0.52 & 83.11±0.36          & 67.98±6.65 & 46.16±1.21 & 46.10±1.12          & \textbf{86.06±0.68} & 48.47±0.78          & 75.93±0.45          & 65.39±4.26          & 49.24±1.17 & 49.85±0.48          & \textbf{78.40±0.94} \\
% 3                  & 64.68±0.10          & 95.11±0.07          & 82.05±3.12          & 66.62±0.41 & 66.64±0.24          & \textbf{94.97±0.61} & 16.84±0.22 & \textbf{64.20±0.85} & 31.06±4.58 & 17.31±0.27 & 17.62±0.18          & 61.25±4.30          & 22.98±0.04          & 65.36±0.28          & 38.26±3.51          & 23.91±0.14 & 23.95±0.18          & \textbf{65.52±0.84} \\
% 4                  & 59.13±0.45          & 92.80±0.12          & 77.04±3.14          & 65.29±4.48 & 59.78±0.48          & \textbf{89.84±0.81} & 13.24±0.25 & \textbf{60.84±0.04} & 25.84±3.25 & 16.44±2.61 & 14.13±0.42          & 48.88±0.46          & 20.40±0.11          & \textbf{59.84±0.03} & 33.11±2.57          & 23.85±3.06 & 20.33±0.14          & 52.42±0.70          \\
% 5                  & 71.34±0.15          & \textbf{96.45±0.09} & 81.16±2.69          & 72.13±0.62 & 71.56±0.76          & 96.21±0.18          & 16.26±0.13 & \textbf{73.92±0.33} & 25.80±3.49 & 17.08±0.40 & 16.60±0.62          & 69.30±0.49          & 25.83±0.26          & \textbf{70.32±0.29} & 35.61±3.29          & 26.72±0.46 & 25.79±0.57          & 67.24±0.75          \\
% 6                  & 84.92±0.30          & 93.46±0.13          & 89.64±0.52          & 85.61±1.49 & 84.80±0.36          & \textbf{94.97±0.44} & 41.85±0.83 & 66.06±0.70          & 54.59±1.00 & 42.96±2.49 & 43.48±0.74          & \textbf{70.68±0.54} & 47.46±0.77          & 62.80±0.27          & 56.01±1.21          & 47.64±2.18 & 48.38±0.9           & \textbf{66.39±0.48} \\
% 7                  & 61.91±0.85          & 74.92±1.18          & 61.20±3.09          & 67.45±2.13 & 59.79±0.43          & \textbf{84.14±1.05} & 19.13±0.69 & 25.22±1.84          & 15.15±1.25 & 23.09±1.66 & 19.64±0.31          & \textbf{36.51±0.27} & 22.78±0.75          & 31.85±1.18          & 21.73±1.32          & 27.69±1.39 & 22.3±0.51           & \textbf{42.97±0.32} \\
% 8                  & 67.84±0.16          & 96.02±0.38          & 91.06±0.54          & 75.90±1.58 & 69.22±0.09          & \textbf{96.16±0.73} & 14.98±0.06 & \textbf{72.64±1.38} & 49.03±2.02 & 20.97±1.42 & 16.06±0.17          & 66.82±3.82          & 24.98±0.15          & 68.02±0.52          & 51.88±1.52          & 29.96±1.36 & 25.42±0.04          & \textbf{70.27±0.94} \\
% 9                  & 44.38±1.70          & 71.61±0.04          & 53.54±3.41          & 51.29±2.24 & 44.69±1.04          & \textbf{78.99±0.70} & 8.82±0.31  & \textbf{17.81±0.06} & 10.31±0.76 & 9.98±0.49  & 8.69±0.10           & 26.78±0.35          & 19.77±0.05          & 27.65±0.08          & 21.38±1.02          & 20.68±0.3  & 19.51±0.08          & \textbf{36.26±0.31} \\
% \rowcolor{gray!20} AVG                & 63.20±0.52          & 85.12±0.42          & 73.33±2.20          & 66.77±1.53 & 63.56±0.41          & \textbf{87.26±0.55} & 20.81±0.37 & 53.45±1.61          & 32.63±2.64 & 22.70±1.12 & 21.73±0.40          & \textbf{55.53±1.24} & 28.96±0.34          & 54.67±0.97          & 38.86±2.01          & 30.67±1.07 & 29.46±0.33          & \textbf{58.24±0.57} \\ \midrule
% \textbf{Fashion}   & AE                  & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                & AE         & VAE                 & MemAE      & MNAD       & SSPCAB              & Ours                & AE                  & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                \\
% T-shirt            & 57.03±0.24          & 58.34±0.22          & 55.66±0.92          & 57.15±1.99 & 57.63±0.12          & \textbf{60.33±0.08} & 11.57±0.03 & 12.99±0.05          & 11.62±0.19 & 11.67±0.72 & 11.74±0.12          & \textbf{13.7±0.12}  & 20.80±0.12          & 20.77±0.04          & 20.21±0.37          & 20.95±0.7  & 20.91±0.06          & \textbf{21.58±0.01} \\
% Trouser            & 71.03±1.25          & 84.94±0.18          & \textbf{88.78±0.48} & 70.06±2.11 & 63.66±1.35          & 84.28±0.37          & 15.37±0.50 & 31.17±0.08          & 37.73±1.33 & 14.76±1.15 & 12.25±0.43          & \textbf{29.00±0.80} & 28.67±0.96          & \textbf{42.95±0.27} & 50.60±0.92          & 28.45±0.95 & 25.29±0.67          & 42.33±0.94          \\
% Pullover           & 43.64±0.25          & 58.58±0.19          & 52.75±0.41          & 45.19±1.13 & 43.76±0.16          & \textbf{58.71±0.10} & 8.91±0.03  & 11.91±0.11          & 10.25±0.09 & 9.03±0.20  & 8.76±0.03           & \textbf{11.92±0.08} & 18.81±0.14          & 21.36±0.04          & 20.24±0.09          & 18.95±0.12 & 18.87±0.09          & \textbf{21.41±0.02} \\
% Dress              & 61.30±0.97          & \textbf{70.84±0.42} & 67.52±1.16          & 63.47±1.87 & 64.48±0.58          & 70.61±0.27          & 12.32±0.28 & \textbf{16.65±0.33} & 14.27±0.44 & 12.90±0.73 & 13.18±0.17          & 15.91±0.17          & 23.18±0.39          & \textbf{27.72±0.34} & 25.79±0.63          & 24.10±0.72 & 24.65±0.41          & 27.40±0.34          \\
% Coat               & 48.93±0.16          & 54.98±0.28          & 52.32±1.50          & 48.74±0.21 & 49.95±0.32          & \textbf{56.35±0.11} & 8.94±0.03  & 10.37±0.04          & 9.64±0.26  & 8.87±0.05  & 9.09±0.07           & \textbf{10.60±0.03} & 20.33±0.01          & 20.91±0.04          & 20.81±0.47          & 20.29±0.07 & 20.76±0.05          & \textbf{21.54±0.09} \\
% Sandal             & \textbf{92.28±0.46} & 87.38±0.29          & 89.25±0.34          & 91.85±0.92 & 91.70±0.23          & 86.17±0.19          & 49.03±1.46 & 43.19±0.81          & 44.75±0.55 & 49.71±2.24 & \textbf{51.87±0.79} & 39.48±0.12          & \textbf{55.15±1.43} & 45.45±0.81          & 47.93±0.44          & 53.68±2.73 & 53.59±0.81          & 42.98±0.51          \\
% Shirt              & 51.15±0.52          & \textbf{54.47±0.03} & 52.63±0.96          & 51.20±0.53 & 51.59±0.20          & 54.23±0.35          & 11.34±0.15 & 10.90±0.05          & 11.08±0.22 & 11.12±0.13 & \textbf{11.37±0.08} & 10.94±0.06          & 19.05±0.08          & \textbf{19.72±0.03} & 19.45±0.18          & 19.18±0.06 & 19.20±0.06          & 19.65±0.02          \\
% Sneaker            & \textbf{64.85±0.40} & 65.22±0.52          & 60.66±1.26          & 61.75±2.77 & 61.62±0.49          & 64.01±0.52          & 12.33±0.14 & \textbf{12.88±0.23} & 11.20±0.32 & 11.45±0.73 & 11.33±0.14          & 12.27±0.12          & \textbf{26.77±0.19} & 25.16±0.02          & 24.31±0.65          & 25.46±1.18 & 25.85±0.12          & 25.04±0.26          \\
% Bag                & 95.56±0.17          & 94.07±0.26          & 94.84±0.36          & 96.13±0.18 & \textbf{96.39±0.17} & 95.15±0.29          & 66.06±1.29 & 56.53±1.37          & 58.42±2.10 & 69.47±1.26 & \textbf{75.12±0.76} & 64.72±2.23          & 70.06±0.78          & 63.09±0.68          & 65.35±1.10          & 71.45±0.17 & \textbf{74.18±0.42} & 66.85±1.46          \\
% Ankle boot         & 83.41±0.64          & 77.28±0.01          & 82.22±0.65          & 83.78±0.61 & \textbf{86.73±0.58} & 81.32±0.57          & 27.69±0.54 & 22.42±0.13          & 27.30±0.81 & 28.93±1.36 & \textbf{33.29±0.91} & 30.52±1.37          & 38.82±0.83          & 32.62±0.27          & 36.98±0.75          & 39.36±0.65 & \textbf{43.88±0.83} & 37.85±0.92          \\
% \rowcolor{gray!20} AVG                & 66.92±0.51          & 70.61±0.24          & 69.66±0.8           & 66.93±1.23 & 66.75±0.42          & \textbf{71.12±0.29} & 22.36±0.45 & 22.90±0.32          & 23.63±0.63 & 22.79±0.86 & 23.80±0.35          & \textbf{23.91±0.51} & 32.16±0.49          & 31.97±0.25          & \textbf{33.17±0.56} & 32.19±0.73 & 32.72±0.35          & 32.66±0.46          \\ \midrule
% \textbf{Kuzushiji} & AE                  & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                & AE         & VAE                 & MemAE      & MNAD       & SSPCAB              & Ours                & AE                  & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                \\
% U+304A             & 71.50±0.03          & 81.42±0.33          & 76.92±0.18          & 71.96±0.05 & 71.56±0.35          & \textbf{85.14±0.43} & 15.92±0.10 & 31.21±0.75          & 22.07±0.44 & 16.08±0.39 & 16.27±0.95          & \textbf{37.93±1.23} & 28.78±0.06          & 37.54±0.39          & 37.54±0.39          & 29.01±0.23 & 28.91±0.07          & \textbf{43.97±0.87} \\
% U+304D             & 50.33±0.28          & 68.00±0.15          & 60.88±0.76          & 50.66±0.24 & 50.00±0.07          & \textbf{71.04±0.03} & 9.45±0.04  & 17.65±0.26          & 13.65±0.54 & 9.57±0.09  & 9.40±0.03           & \textbf{20.17±0.01} & 19.26±0.07          & 25.24±0.17          & 25.24±0.17          & 19.25±0.06 & 19.18±0.02          & \textbf{27.17±0.17} \\
% U+3059             & 43.32±0.07          & 59.05±0.50          & 52.05±0.54          & 44.41±0.16 & 43.18±0.09          & \textbf{63.14±0.2}  & 9.09±0.03  & 13.01±0.04          & 11.27±0.11 & 9.66±0.03  & 9.11±0.16           & \textbf{15.04±0.15} & 18.40±0.03          & 21.09±0.24          & 21.09±0.24          & 18.35±0.03 & 18.37±0.03          & \textbf{21.99±0.02} \\
% U+3064             & 68.78±0.17          & 78.83±0.01          & 73.77±1.46          & 69.62±0.24 & 69.36±0.04          & \textbf{82.37±0.55} & 20.23±0.17 & 29.75±0.25          & 25.52±1.58 & 21.36±0.64 & 20.39±0.39          & \textbf{32.09±0.45} & 26.73±0.12          & 36.21±0.11          & 36.21±0.11          & 27.63±0.07 & 27.27±0.05          & \textbf{39.50±0.86} \\
% U+306A             & 57.31±0.08          & 83.86±0.38          & 77.46±1.05          & 56.61±1.29 & 58.08±0.07          & \textbf{87.38±0.28} & 10.92±0.04 & 33.67±1.25          & 22.52±1.30 & 10.79±0.30 & 11.13±0.04          & \textbf{42.99±0.55} & 21.44±0.07          & 42.02±0.96          & 42.02±0.96          & 21.08±0.76 & 21.65±0.03          & \textbf{49.20±0.12} \\
% U+306F             & 19.35±0.17          & 52.66±0.45          & 37.40±0.05          & 19.90±0.27 & 20.08±0.26          & \textbf{61.14±0.74} & 5.81±0.01  & 9.66±0.06           & 7.30±0.02  & 5.83±0.01  & 5.85±0.02           & \textbf{12.36±0.14} & 18.18±0.00          & 20.10±0.11          & 20.10±0.11          & 18.18±0.00 & 18.18±0.00          & \textbf{22.45±0.40} \\
% U+307E             & 39.85±0.07          & 65.23±0.40          & 55.94±0.17          & 39.16±0.05 & 40.07±0.25          & \textbf{68.24±0.49} & 9.00±0.11  & 22.52±0.12          & 16.27±0.36 & 8.96±0.07  & 9.20±0.08           & \textbf{26.15±0.62} & 18.18±0.00          & 24.78±0.23          & 24.78±0.23          & 18.20±0.01 & 18.21±0.02          & \textbf{27.61±0.15} \\
% U+3084             & 80.46±0.10          & 90.33±0.42          & 88.66±0.27          & 80.80±0.86 & 80.79±0.05          & \textbf{92.11±0.07} & 25.26±0.27 & 53.26±2.44          & 46.60±0.25 & 25.56±1.31 & 25.55±0.08          & \textbf{58.98±1.12} & 36.14±0.01          & 54.85±1.17          & 54.85±1.17          & 36.73±0.69 & 36.46±0.02          & \textbf{57.92±0.19} \\
% U+308C             & 62.19±0.27          & 68.26±0.03          & 64.37±0.34          & 63.99±0.26 & 61.11±0.60          & \textbf{73.01±0.91} & 13.90±0.08 & 15.27±0.03          & 13.39±0.06 & 13.56±0.26 & 13.05±0.39          & \textbf{18.16±0.70} & 22.60±0.01          & 26.44±0.10          & 26.44±0.10          & 23.60±0.23 & 22.10±0.34          & \textbf{30.29±1.05} \\
% U+3092             & 65.95±0.05          & 78.07±0.24          & 73.44±0.42          & 66.45±0.11 & 66.43±0.11          & \textbf{81.41±0.55} & 15.01±0.12 & 27.36±0.49          & 19.81±0.39 & 15.51±0.40 & 15.48±0.08          & \textbf{31.10±1.28} & 24.22±0.24          & 35.46±0.07          & 35.46±0.07          & 24.26±0.01 & 24.04±0.13          & \textbf{39.02±0.63} \\
% \rowcolor{gray!20} AVG                & 55.90±0.13          & 72.57±0.29          & 66.09±0.52          & 56.36±0.35 & 56.07±0.19          & \textbf{76.50±0.42} & 13.46±0.10 & 25.34±0.57          & 19.84±0.50 & 13.69±0.35 & 13.54±0.22          & \textbf{29.50±0.62} & 23.39±0.06          & 32.37±0.35          & 32.37±0.35          & 23.63±0.21 & 23.44±0.07          & \textbf{35.91±0.45} \\ \bottomrule
% \end{tabular}}
% \end{table*}


% Please add the following required packages to your document preamble:
% \usepackage[table,xcdraw]{xcolor}
% If you use beamer only pass "xcolor=table" option, i.e. \documentclass[xcolor=table]{beamer}
\begin{table*}[t]
\caption{The AUROC results under ROCO protocol.}
\label{tab:main}
\resizebox{\linewidth}{!}{
\begin{tabular}{ccccccccc}
\hline
\textbf{MNIST}     & AE         & SAE                 & DAE        & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                \\
0                  & 79.16±0.30 & 82.57±0.44          & 82.86±1.06 & 94.68±1.73          & 86.16±1.2           & 79.88±0.33 & 80.95±0.12          & \textbf{97.79±0.64} \\
1                  & 12.45±0.94 & 19.21±3.36          & 14.05±0.99 & 38.99±0.40          & 17.16±2.93          & 16.90±1.43 & 11.39±0.58          & \textbf{41.61±0.03} \\
2                  & 86.20±0.22 & 89.26±0.15          & 89.23±1.76 & 97.14±0.08          & 93.80±1.35          & 86.63±0.64 & 86.80±0.02          & \textbf{97.95±0.27} \\
3                  & 64.68±0.10 & 67.91±3.88          & 68.62±2.51 & 95.11±0.07          & 82.05±3.12          & 66.62±0.41 & 66.64±0.24          & \textbf{94.97±0.61} \\
4                  & 59.13±0.45 & 66.88±0.51          & 62.34±3.68 & 92.80±0.12          & 77.04±3.14          & 65.29±4.48 & 59.78±0.48          & \textbf{89.84±0.81} \\
5                  & 71.34±0.15 & 75.78±0.73          & 73.77±3.06 & \textbf{96.45±0.09} & 81.16±2.69          & 72.13±0.62 & 71.56±0.76          & 96.21±0.18          \\
6                  & 84.92±0.30 & 82.87±0.36          & 86.82±1.96 & 93.46±0.13          & 89.64±0.52          & 85.61±1.49 & 84.80±0.36          & \textbf{94.97±0.44} \\
7                  & 61.91±0.85 & 58.75±1.91          & 62.93±1.97 & 74.92±1.18          & 61.20±3.09          & 67.45±2.13 & 59.79±0.43          & \textbf{84.14±1.05} \\
8                  & 67.84±0.16 & 76.32±0.81          & 73.22±4.39 & 96.02±0.38          & 91.06±0.54          & 75.90±1.58 & 69.22±0.09          & \textbf{96.16±0.73} \\
9                  & 44.38±1.70 & 47.90±6.11          & 43.79±1.02 & 71.61±0.04          & 53.54±3.41          & 51.29±2.24 & 44.69±1.04          & \textbf{78.99±0.70} \\
\rowcolor[HTML]{EFEFEF} 
AVG                & 63.20±0.52 & 66.75±1.83          & 65.76±2.24 & 85.12±0.42          & 73.33±2.20          & 66.77±1.53 & 63.56±0.41          & \textbf{87.26±0.55} \\ \hline
\textbf{Fashion}   & AE         & SAE                 & DAE        & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                \\
T-shirt            & 57.03±0.24 & 44.28±2.63          & 58.08±1.00 & 58.34±0.22          & 55.66±0.92          & 57.15±1.99 & 57.63±0.12          & \textbf{60.33±0.08} \\
Trouser            & 71.03±1.25 & 88.37±1.25          & 73.80±2.22 & 84.94±0.18          & \textbf{88.78±0.48} & 70.06±2.11 & 63.66±1.35          & 84.28±0.37          \\
Pullover           & 43.64±0.25 & 33.31±0.21          & 47.41±1.24 & 58.58±0.19          & 52.75±0.41          & 45.19±1.13 & 43.76±0.16          & \textbf{58.71±0.10} \\
Dress              & 61.30±0.97 & 65.09±1.48          & 63.79±2.24 & \textbf{70.84±0.42} & 67.52±1.16          & 63.47±1.87 & 64.48±0.58          & 70.61±0.27          \\
Coat               & 48.93±0.16 & 45.47±4.27          & 51.31±0.25 & 54.98±0.28          & 52.32±1.50          & 48.74±0.21 & 49.95±0.32          & \textbf{56.35±0.11} \\
Sandal             & 92.28±0.46 & \textbf{92.63±0.30} & 90.87±1.14 & 87.38±0.29          & 89.25±0.34          & 91.85±0.92 & 91.70±0.23          & 86.17±0.19          \\
Shirt              & 51.15±0.52 & 34.04±1.53          & 50.23±0.54 & \textbf{54.47±0.03} & 52.63±0.96          & 51.20±0.53 & 51.59±0.20          & 54.23±0.35          \\
Sneaker            & 64.85±0.40 & \textbf{70.58±3.16} & 61.49±2.84 & 65.22±0.52          & 60.66±1.26          & 61.75±2.77 & 61.62±0.49          & 64.01±0.52          \\
Bag                & 95.56±0.17 & 88.74±0.29          & 96.18±0.51 & 94.07±0.26          & 94.84±0.36          & 96.13±0.18 & \textbf{96.39±0.17} & 95.15±0.29          \\
Ankle boot         & 83.41±0.64 & 83.36±0.47          & 84.71±0.78 & 77.28±0.01          & 82.22±0.65          & 83.78±0.61 & \textbf{86.73±0.58} & 81.32±0.57          \\
\rowcolor[HTML]{EFEFEF} 
AVG                & 66.92±0.51 & 64.59±1.56          & 67.79±1.28 & 70.61±0.24          & 69.66±0.8           & 66.93±1.23 & 66.75±0.42          & \textbf{71.12±0.29} \\ \hline
\textbf{Kuzushiji} & AE         & SAE                 & DAE        & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                \\
U+304A             & 71.50±0.03 & 71.11±2.57          & 71.36±0.86 & 81.42±0.33          & 76.92±0.18          & 71.96±0.05 & 71.56±0.35          & \textbf{85.14±0.43} \\
U+304D             & 50.33±0.28 & 51.36±0.49          & 49.25±0.40 & 68.00±0.15          & 60.88±0.76          & 50.66±0.24 & 50.00±0.07          & \textbf{71.04±0.03} \\
U+3059             & 43.32±0.07 & 46.93±0.31          & 42.95±0.42 & 59.05±0.50          & 52.05±0.54          & 44.41±0.16 & 43.18±0.09          & \textbf{63.14±0.2}  \\
U+3064             & 68.78±0.17 & 72.65±0.54          & 69.96±0.41 & 78.83±0.01          & 73.77±1.46          & 69.62±0.24 & 69.36±0.04          & \textbf{82.37±0.55} \\
U+306A             & 57.31±0.08 & 61.51±0.76          & 60.10±1.07 & 83.86±0.38          & 77.46±1.05          & 56.61±1.29 & 58.08±0.07          & \textbf{87.38±0.28} \\
U+306F             & 19.35±0.17 & 22.19±1.15          & 20.51±0.93 & 52.66±0.45          & 37.40±0.05          & 19.90±0.27 & 20.08±0.26          & \textbf{61.14±0.74} \\
U+307E             & 39.85±0.07 & 45.03±1.61          & 40.75±0.36 & 65.23±0.40          & 55.94±0.17          & 39.16±0.05 & 40.07±0.25          & \textbf{68.24±0.49} \\
U+3084             & 80.46±0.10 & 80.79±0.58          & 81.46±0.57 & 90.33±0.42          & 88.66±0.27          & 80.80±0.86 & 80.79±0.05          & \textbf{92.11±0.07} \\
U+308C             & 62.19±0.27 & 59.89±0.92          & 60.38±0.46 & 68.26±0.03          & 64.37±0.34          & 63.99±0.26 & 61.11±0.60          & \textbf{73.01±0.91} \\
U+3092             & 65.95±0.05 & 63.24±1.00          & 66.65±0.70 & 78.07±0.24          & 73.44±0.42          & 66.45±0.11 & 66.43±0.11          & \textbf{81.41±0.55} \\
\rowcolor[HTML]{EFEFEF} 
AVG                & 55.90±0.13 & 57.47±0.99          & 56.34±0.62 & 72.57±0.29          & 66.09±0.52          & 56.36±0.35 & 56.07±0.19          & \textbf{76.50±0.42} \\ \hline
\end{tabular}}
\end{table*}


\begin{table}[t]
\caption{The AUPRC results under ROCO protocol.}
\label{tab:main2}
\resizebox{\linewidth}{!}{
\begin{tabular}{ccccccccc}
\hline
\textbf{MNIST}     & AE         & SAE                 & DAE        & VAE                 & MemAE      & MNAD       & SSPCAB              & Ours                \\
0                  & 25.83±0.69 & 35.59±0.66          & 31.61±1.87 & 62.26±0.53          & 39.98±3.23 & 26.50±0.61 & 28.75±0.34          & \textbf{80.24±1.52} \\
1                  & 6.27±0.04  & 6.67±0.24           & 6.37±0.06  & 8.47±0.05           & 6.57±0.16  & 6.51±0.08  & 6.23±0.04           & \textbf{8.81±0.01}  \\
2                  & 44.93±0.52 & 56.38±1.92          & 53.31±5.13 & 83.11±0.36          & 67.98±6.65 & 46.16±1.21 & 46.10±1.12          & \textbf{86.06±0.68} \\
3                  & 16.84±0.22 & 17.96±3.65          & 19.02±1.82 & \textbf{64.20±0.85} & 31.06±4.58 & 17.31±0.27 & 17.62±0.18          & 61.25±4.30          \\
4                  & 13.24±0.25 & 16.54±0.86          & 14.20±1.41 & \textbf{60.84±0.04} & 25.84±3.25 & 16.44±2.61 & 14.13±0.42          & 48.88±0.46          \\
5                  & 16.26±0.13 & 19.92±0.74          & 18.16±2.11 & \textbf{73.92±0.33} & 25.80±3.49 & 17.08±0.40 & 16.60±0.62          & 69.30±0.49          \\
6                  & 41.85±0.83 & 35.49±0.74          & 48.60±4.87 & 66.06±0.70          & 54.59±1.00 & 42.96±2.49 & 43.48±0.74          & \textbf{70.68±0.54} \\
7                  & 19.13±0.69 & 13.91±0.75          & 21.06±1.76 & 25.22±1.84          & 15.15±1.25 & 23.09±1.66 & 19.64±0.31          & \textbf{36.51±0.27} \\
8                  & 14.98±0.06 & 23.62±0.93          & 18.84±2.88 & \textbf{72.64±1.38} & 49.03±2.02 & 20.97±1.42 & 16.06±0.17          & 66.82±3.82          \\
9                  & 8.82±0.31  & 9.86±1.61           & 8.71±0.18  & \textbf{17.81±0.06} & 10.31±0.76 & 9.98±0.49  & 8.69±0.10           & 26.78±0.35          \\
\rowcolor[HTML]{EFEFEF} 
AVG                & 20.81±0.37 & 23.59±1.11          & 23.99±2.21 & 53.45±1.61          & 32.63±2.64 & 22.70±1.12 & 21.73±0.40          & \textbf{55.53±1.24} \\ \hline
\textbf{Fashion}   & AE         & SAE                 & DAE        & VAE                 & MemAE      & MNAD       & SSPCAB              & Ours                \\
T-shirt            & 11.57±0.03 & 9.68±0.50           & 11.99±0.19 & 12.99±0.05          & 11.62±0.19 & 11.67±0.72 & 11.74±0.12          & \textbf{13.7±0.12}  \\
Trouser            & 15.37±0.50 & \textbf{39.17±3.44} & 16.87±1.24 & 31.17±0.08          & 37.73±1.33 & 14.76±1.15 & 12.25±0.43          & 29.00±0.80          \\
Pullover           & 8.91±0.03  & 7.41±0.08           & 9.39±0.17  & 11.91±0.11          & 10.25±0.09 & 9.03±0.20  & 8.76±0.03           & \textbf{11.92±0.08} \\
Dress              & 12.32±0.28 & 14.15±0.56          & 12.87±0.59 & \textbf{16.65±0.33} & 14.27±0.44 & 12.90±0.73 & 13.18±0.17          & 15.91±0.17          \\
Coat               & 8.94±0.03  & 8.64±0.74           & 9.34±0.05  & 10.37±0.04          & 9.64±0.26  & 8.87±0.05  & 9.09±0.07           & \textbf{10.60±0.03} \\
Sandal             & 49.03±1.46 & \textbf{52.72±1.07} & 48.16±1.47 & 43.19±0.81          & 44.75±0.55 & 49.71±2.24 & 51.87±0.79          & 39.48±0.12          \\
Shirt              & 11.34±0.15 & 7.64±0.34           & 10.93±0.22 & 10.90±0.05          & 11.08±0.22 & 11.12±0.13 & \textbf{11.37±0.08} & 10.94±0.06          \\
Sneaker            & 12.33±0.14 & \textbf{14.72±1.36} & 11.38±0.82 & 12.88±0.23          & 11.20±0.32 & 11.45±0.73 & 11.33±0.14          & 12.27±0.12          \\
Bag                & 66.06±1.29 & 43.59±0.50          & 69.71±2.62 & 56.53±1.37          & 58.42±2.10 & 69.47±1.26 & \textbf{75.12±0.76} & 64.72±2.23          \\
Ankle boot         & 27.69±0.54 & 31.08±0.92          & 31.25±1.93 & 22.42±0.13          & 27.30±0.81 & 28.93±1.36 & \textbf{33.29±0.91} & 30.52±1.37          \\
\rowcolor[HTML]{EFEFEF} 
AVG                & 22.36±0.45 & 22.88±0.95          & 23.19±0.93 & 22.90±0.32          & 23.63±0.63 & 22.79±0.86 & 23.80±0.35          & \textbf{23.91±0.51} \\ \hline
\textbf{Kuzushiji} & AE         & SAE                 & DAE        & VAE                 & MemAE      & MNAD       & SSPCAB              & Ours                \\
U+304A             & 15.92±0.10 & 16.73±1.44          & 15.86±0.52 & 31.21±0.75          & 22.07±0.44 & 16.08±0.39 & 16.27±0.95          & \textbf{37.93±1.23} \\
U+304D             & 9.45±0.04  & 9.82±0.08           & 9.33±0.08  & 17.65±0.26          & 13.65±0.54 & 9.57±0.09  & 9.40±0.03           & \textbf{20.17±0.01} \\
U+3059             & 9.09±0.03  & 10.40±0.55          & 9.13±0.19  & 13.01±0.04          & 11.27±0.11 & 9.66±0.03  & 9.11±0.16           & \textbf{15.04±0.15} \\
U+3064             & 20.23±0.17 & 24.64±0.48          & 20.40±0.53 & 29.75±0.25          & 25.52±1.58 & 21.36±0.64 & 20.39±0.39          & \textbf{32.09±0.45} \\
U+306A             & 10.92±0.04 & 12.25±0.30          & 11.73±0.36 & 33.67±1.25          & 22.52±1.30 & 10.79±0.30 & 11.13±0.04          & \textbf{42.99±0.55} \\
U+306F             & 5.81±0.01  & 6.03±0.08           & 5.89±0.07  & 9.66±0.06           & 7.30±0.02  & 5.83±0.01  & 5.85±0.02           & \textbf{12.36±0.14} \\
U+307E             & 9.00±0.11  & 10.75±0.60          & 9.46±0.13  & 22.52±0.12          & 16.27±0.36 & 8.96±0.07  & 9.20±0.08           & \textbf{26.15±0.62} \\
U+3084             & 25.26±0.27 & 24.35±0.40          & 26.39±0.64 & 53.26±2.44          & 46.60±0.25 & 25.56±1.31 & 25.55±0.08          & \textbf{58.98±1.12} \\
U+308C             & 13.90±0.08 & 11.92±0.48          & 12.56±0.08 & 15.27±0.03          & 13.39±0.06 & 13.56±0.26 & 13.05±0.39          & \textbf{18.16±0.70} \\
U+3092             & 15.01±0.12 & 13.97±0.48          & 15.45±0.46 & 27.36±0.49          & 19.81±0.39 & 15.51±0.40 & 15.48±0.08          & \textbf{31.10±1.28} \\
\rowcolor[HTML]{EFEFEF} 
AVG                & 13.46±0.10 & 14.09±0.49          & 13.62±0.31 & 25.34±0.57          & 19.84±0.50 & 13.69±0.35 & 13.54±0.22          & \textbf{29.50±0.62} \\ \hline
\end{tabular}}
\end{table}


\begin{table}[t]
\caption{The best F1-Score under ROCO protocol.}
\label{tab:main3}
\resizebox{\linewidth}{!}{
\begin{tabular}{ccccccccc}
\hline
\textbf{MNIST}     & AE         & SAE                 & DAE        & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                \\
0                  & 36.52±0.52 & 41.35±0.62          & 41.15±1.76 & 63.97±6.58          & 44.85±1.38          & 36.63±0.59 & 38.73±0.42          & \textbf{80.16±0.39} \\
1                  & 20.39±0.00 & 20.41±0.03          & 20.41±0.02 & 20.92±0.04          & 20.42±0.05          & 20.39±0.00 & 20.39±0             & \textbf{22.78±0.06} \\
2                  & 48.47±0.78 & 56.51±1.07          & 54.72±3.89 & 75.93±0.45          & 65.39±4.26          & 49.24±1.17 & 49.85±0.48          & \textbf{78.40±0.94} \\
3                  & 22.98±0.04 & 25.70±2.78          & 24.99±1.15 & 65.36±0.28          & 38.26±3.51          & 23.91±0.14 & 23.95±0.18          & \textbf{65.52±0.84} \\
4                  & 20.40±0.11 & 24.18±0.15          & 21.77±1.25 & \textbf{59.84±0.03} & 33.11±2.57          & 23.85±3.06 & 20.33±0.14          & 52.42±0.70          \\
5                  & 25.83±0.26 & 30.16±1.11          & 28.12±2.68 & \textbf{70.32±0.29} & 35.61±3.29          & 26.72±0.46 & 25.79±0.57          & 67.24±0.75          \\
6                  & 47.46±0.77 & 39.98±0.80          & 51.55±3.40 & 62.80±0.27          & 56.01±1.21          & 47.64±2.18 & 48.38±0.9           & \textbf{66.39±0.48} \\
7                  & 22.78±0.75 & 21.00±0.86          & 24.01±1.23 & 31.85±1.18          & 21.73±1.32          & 27.69±1.39 & 22.3±0.51           & \textbf{42.97±0.32} \\
8                  & 24.98±0.15 & 30.74±0.44          & 28.37±2.63 & 68.02±0.52          & 51.88±1.52          & 29.96±1.36 & 25.42±0.04          & \textbf{70.27±0.94} \\
9                  & 19.77±0.05 & 19.60±0.94          & 20.07±0.13 & 27.65±0.08          & 21.38±1.02          & 20.68±0.3  & 19.51±0.08          & \textbf{36.26±0.31} \\
\rowcolor[HTML]{EFEFEF} 
AVG                & 28.96±0.34 & 30.96±0.88          & 31.52±1.81 & 54.67±0.97          & 38.86±2.01          & 30.67±1.07 & 29.46±0.33          & \textbf{58.24±0.57} \\ \hline
\textbf{Fashion}   & AE         & SAE                 & DAE        & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                \\
T-shirt            & 20.80±0.12 & 18.63±0.33          & 21.21±0.47 & 20.77±0.04          & 20.21±0.37          & 20.95±0.7  & 20.91±0.06          & \textbf{21.58±0.01} \\
Trouser            & 28.67±0.96 & \textbf{49.32±2.22} & 30.74±1.75 & 42.95±0.27          & 50.60±0.92          & 28.45±0.95 & 25.29±0.67          & 42.33±0.94          \\
Pullover           & 18.81±0.14 & 18.18±0.00          & 19.30±0.19 & 21.36±0.04          & 20.24±0.09          & 18.95±0.12 & 18.87±0.09          & \textbf{21.41±0.02} \\
Dress              & 23.18±0.39 & 24.71±1.20          & 24.31±1.04 & \textbf{27.72±0.34} & 25.79±0.63          & 24.10±0.72 & 24.65±0.41          & 27.40±0.34          \\
Coat               & 20.33±0.01 & 18.57±0.08          & 20.79±0.09 & 20.91±0.04          & 20.81±0.47          & 20.29±0.07 & 20.76±0.05          & \textbf{21.54±0.09} \\
Sandal             & 55.15±1.43 & \textbf{56.79±0.65} & 51.09±3.00 & 45.45±0.81          & 47.93±0.44          & 53.68±2.73 & 53.59±0.81          & 42.98±0.51          \\
Shirt              & 19.05±0.08 & 18.21±0.00          & 18.87±0.08 & \textbf{19.72±0.03} & 19.45±0.18          & 19.18±0.06 & 19.20±0.06          & 19.65±0.02          \\
Sneaker            & 26.77±0.19 & \textbf{29.03±1.76} & 25.52±1.11 & 25.16±0.02          & 24.31±0.65          & 25.46±1.18 & 25.85±0.12          & 25.04±0.26          \\
Bag                & 70.06±0.78 & 52.57±0.45          & 71.56±1.23 & 63.09±0.68          & 65.35±1.10          & 71.45±0.17 & \textbf{74.18±0.42} & 66.85±1.46          \\
Ankle boot         & 38.82±0.83 & 39.53±0.32          & 40.62±0.96 & 32.62±0.27          & 36.98±0.75          & 39.36±0.65 & \textbf{43.88±0.83} & 37.85±0.92          \\
\rowcolor[HTML]{EFEFEF} 
AVG                & 32.16±0.49 & 32.55±0.70          & 32.4±0.99  & 31.97±0.25          & \textbf{33.17±0.56} & 32.19±0.73 & 32.72±0.35          & 32.66±0.46          \\ \hline
\textbf{Kuzushiji} & AE         & SAE                 & DAE        & VAE                 & MemAE               & MNAD       & SSPCAB              & Ours                \\
U+304A             & 28.78±0.06 & 28.17±1.92          & 28.39±0.55 & 37.54±0.39          & 37.54±0.39          & 29.01±0.23 & 28.91±0.07          & \textbf{43.97±0.87} \\
U+304D             & 19.26±0.07 & 19.41±0.08          & 19.03±0.13 & 25.24±0.17          & 25.24±0.17          & 19.25±0.06 & 19.18±0.02          & \textbf{27.17±0.17} \\
U+3059             & 18.40±0.03 & 18.88±0.06          & 18.44±0.06 & 21.09±0.24          & 21.09±0.24          & 18.35±0.03 & 18.37±0.03          & \textbf{21.99±0.02} \\
U+3064             & 26.73±0.12 & 29.35±0.70          & 27.07±0.09 & 36.21±0.11          & 36.21±0.11          & 27.63±0.07 & 27.27±0.05          & \textbf{39.50±0.86} \\
U+306A             & 21.44±0.07 & 22.72±0.28          & 22.47±0.43 & 42.02±0.96          & 42.02±0.96          & 21.08±0.76 & 21.65±0.03          & \textbf{49.20±0.12} \\
U+306F             & 18.18±0.00 & 18.19±0.00          & 18.19±0.00 & 20.10±0.11          & 20.10±0.11          & 18.18±0.00 & 18.18±0.00          & \textbf{22.45±0.40} \\
U+307E             & 18.18±0.00 & 18.19±0.00          & 18.22±0.02 & 24.78±0.23          & 24.78±0.23          & 18.20±0.01 & 18.21±0.02          & \textbf{27.61±0.15} \\
U+3084             & 36.14±0.01 & 36.89±1.14          & 36.68±0.67 & 54.85±1.17          & 54.85±1.17          & 36.73±0.69 & 36.46±0.02          & \textbf{57.92±0.19} \\
U+308C             & 22.60±0.01 & 21.85±0.27          & 21.84±0.23 & 26.44±0.10          & 26.44±0.10          & 23.60±0.23 & 22.10±0.34          & \textbf{30.29±1.05} \\
U+3092             & 24.22±0.24 & 22.87±0.50          & 24.26±0.30 & 35.46±0.07          & 35.46±0.07          & 24.26±0.01 & 24.04±0.13          & \textbf{39.02±0.63} \\
\rowcolor[HTML]{EFEFEF} 
AVG                & 23.39±0.06 & 23.65±0.5           & 23.46±0.25 & 32.37±0.35          & 32.37±0.35          & 23.63±0.21 & 23.44±0.07          & \textbf{35.91±0.45} \\ \hline
\end{tabular}}
\end{table}


\subsection{Experimental Setup}
\label{sec: Setup}

\paragraph{Datasets.}

The following three public benchmarks were used for the experiments in this paper, namely MNIST\citep{deng2012mnist}, Fashion\citep{xiao2017fashion}, and Kuzushiji\citep{DBLP:journals/corr/abs-1812-01718}. In order to make the dataset conform to the UAD setting and highlight OGP, they were all preprocessed under UNSCND settings (\cref{sec: OGP Problem}), i.e., remove-one-class-out (ROCO) protocol as described below. For all classes in the dataset, we select one class to be removed from the training set and label that class as positive in the test set. For the remaining classes, we removed their label information from the training set and labeled them uniformly as negative in the test set. Note that the ROCO preprocessing is different from the USCND settings (\cref{sec: OGP Problem}) of previous research \citep{gong2019memorizing,park2020learning,ristea2022self}, but rather the UNSCND condition mentioned in \cref{sec: OGP Problem}, which is in line with the latest experimental setup recommendation from paper\citep{DBLP:journals/corr/abs-1908-04388}. Also, this is consistent with the purpose of highlighting the OGP in UAD, i.e., the UNSCND condition of $\{\xi=9,Y=\emptyset\}$.


\paragraph{Evaluation.}
AUROC is commonly used as an assessment criterion for anomaly detection, however, this is overly optimistic on unbalanced data\citep{movahedi2020limitations}. We add results on the AUPRC and F1-Score evaluation criteria to examine the full range of model performance. F1-Score results use the best one after traversal threshold. Vanilla Autoencoder(AE)\citep{kramer1991nonlinear}, Sparse Autoencoder(SAE)\citep{ng2011sparse}, Denoising Autoencoder(DAE)\citep{vincent2008extracting}, Variational Autoencoder(VAE)\citep{kingma2013auto}, MemAE\citep{gong2019memorizing}, MNAD\citep{park2020learning}, and SSPCAB\citep{ristea2022self} were used as comparative baselines.

\paragraph{Implementation.}
To facilitate the description, we make the following notation convention. FC(a,b) denotes the Linear-BN-LeakyReLU block, where a and b are the number of input and output channels. We use the same autoencoder backbone in all experiments, i.e., FC1(784,512), FC2(512,256), FC3(256,128), FC4(128,256), FC5(256,512), FC6(512,784). Note that since MNAD requires a residual structure at the bottleneck, the first layer of the MNAD decoder is twice as large as the other models, namely: FC4(128+128,256). The memory capacity of the three datasets MNIST, Fashion, and Kuzushiji is 8, 10, and 10 in order, considering that the latter two have more complex data patterns. The experiments used a batch size of 256, an optimizer Adam with a learning rate of 1e-3, multi-round memory with rounds of 8, and an early stop mechanism with the patience of 10. The validation set split ratio was 0.1 and the split random seeds were fixed with 2022. The experimental results were averaged over three runs. All comparison experiments follow the principle of absolute fair comparison(PAFC), i.e., all experiments strictly use the same training set, test set, validation set, and backbone.

% Please add the following required packages to your document preamble:
% \usepackage{multirow}
% Please add the following required packages to your document preamble:
% \usepackage{multirow}


\subsection{Results}
\label{sec: Results}
We designed a total of 30 experiments using 3 public benchmarks and conducted 8 parallel comparisons at 3 different evaluation metrics based on PAFC. Our average results achieve the leading performance as shown in \cref{tab:main,tab:main2,tab:main3}. The best performance is marked in bold.

% \begin{figure}[t]
%   \centering
%   \includegraphics[width=0.96\linewidth]{pictures/fig6.png}
%   \caption{From top to bottom are samples of the MNIST,Fashion-MNIST, and CIFAR10 dataset.}
%   \label{fig:dataset}
% \end{figure}


% Please add the following required packages to your document preamble:
% \usepackage{booktabs}
% \begin{table}[t]
% \caption{The average results on AUROC.}
% \label{tab:avg}
% \centering
% \begin{tabular}{@{}cccc@{}}
% \toprule
% \textbf{} & \textbf{MNIST}      & \textbf{Fashion}    & \textbf{Kuzushiji}  \\ \midrule
% AE        & 63.20±0.52          & 66.92±0.51          & 55.90±0.13          \\
% VAE       & 85.12±0.42          & 70.61±0.24          & 72.57±0.29          \\
% MemAE     & 73.33±2.20          & 69.66±0.80          & 66.09±0.52          \\
% MNAD      & 66.77±1.53          & 66.93±1.23          & 56.36±0.35          \\
% SSPCAB    & 63.56±0.41          & 66.75±0.42          & 56.07±0.19          \\ \midrule
% Ours      & \textbf{87.26±0.55} & \textbf{71.12±0.29} & \textbf{76.50±0.42} \\ \bottomrule
% \end{tabular}
% \end{table}

\subsection{Analysis}
\label{sec: Analysis}
\paragraph{The Distinguishability of Memory}
We normalized the reconstruction errors of the model on normal and anomaly data separately for better comparison and depicted their KDE distribution as shown in \cref{fig:show,fig:diff}. It can be seen that the vanilla autoencoder (AE) suffers from OGP when faced with the dual challenge of the test set and training set, while the AE equipped with memory can better distinguish between normal and anomaly. The ultimate goal of anomaly detection is to score the abnormalities of a sample, which comes directly from reconstruction errors. Therefore a distinguishable model has a sharper ability to score abnormalities.

\begin{figure}[t]
  \centering
  \includegraphics[width=0.96\linewidth]{pictures/diff.png}
  \caption{The distinguishability of memory.}
  \label{fig:diff}
\end{figure}

\paragraph{The User-friendliness of Memory}
Notice that VAE also has good performance, but two more points need to be highlighted. First, the results of VAE come from a fine grid search and careful tuning of the parameters, while the memorizer just uses the default parameters without any deliberate tuning. Second, as described in the paragraph above the contributions section(\cref{sec:intro}), VAE models face posterior collapse problems if it is not well-tuned, as shown in \cref{fig:collp}, whereas Memory has no such concerns.

\begin{figure}[t]
  \centering
  \includegraphics[width=0.96\linewidth]{pictures/collp.png}
  \caption{Posterior collapse of VAE.}
  \label{fig:collp}
\end{figure}

\paragraph{The Tightness of Memory}
As described in \cref{sec:intro}, the model can both generalize well to the normal instance and fail to generalize anomalies under ideal assumptions, which require the model to learn tight bounds on the data patterns. As shown in \cref{fig:show}, the model was never exposed to the number 2 during training in the MNIST ROCO-2 experiment, but AE still reconstructed it well. This phenomenon indicates that AE does not learn the tight boundaries of patterns, but only one-sidedly learns the generalized constant mapping. In contrast, the AE that used the memory mechanism memorized the pattern in the training phase and followed the known patterns for the reconstruction of the unknown category in the testing phase.

\paragraph{The Diversity of Memory}
In order to generalize data with limited memory capacity and learn patterns with tight boundaries, AE equipped with memory extract invariant features under the same class of data, i.e., Personalization-ignored Common Pattern Extraction (PCPE) process as shown in \cref{fig:show}. PCPE helps to learn the common features of the same pattern and ignore the semantic redundant features such as: starting position, stopping position, etc., so as to better establish the tight boundaries of the pattern. However, PCPE does not mean that memory let the model lose the intra- and inter-pattern diversity of the reconstruction as shown in \cref{fig:diver}. We generated the new representation by generating random numbers in the range $[0,1]$ to simulate the memory combination coefficients, and after going through the decoder we can see that the diversity between patterns is guaranteed.

\begin{table}[t]
\caption{The ablation results.}
\label{tab:ablation}
\centering
\resizebox{\linewidth}{!}{
\begin{tabular}{@{}ccccccc@{}}
\toprule
\textbf{Add}              & \textbf{Softmax}          & \textbf{Round}            & \textbf{Sharing}          & \textbf{AUC} & \textbf{AP} & \textbf{F1} \\ \midrule
                          & \checkmark & \checkmark & \checkmark & 52.61        & 9.80        & 21.07       \\
\checkmark &                           & \checkmark & \checkmark & 89.67        & 46.87       & 49.25       \\
\checkmark & \checkmark &                           & \checkmark & 88.98        & 46.07       & 48.29       \\
\checkmark & \checkmark & \checkmark &                           & 88.82        & 45.55       & 47.43       \\ \midrule
\checkmark & \checkmark & \checkmark & \checkmark & 89.88        & 48.04       & 49.58       \\ \bottomrule
\end{tabular}}
\end{table}

\begin{figure}[t]
  \centering
  \includegraphics[width=0.96\linewidth]{pictures/diver.png}
  \caption{The diversity of memory.}
  \label{fig:diver}
\end{figure}

\subsection{Ablation Study}
\label{sec: Ablation}
To investigate the role of each component of the memory mechanism, we conducted qualitative ablation experiments to explore the effects of the add operation, the softmax operation, the multi-round memory, and the sharing mapping on the model performance, respectively. Ablation experiments were carried out in a randomly selected Fashion under ROCO-6. The ablation results in \cref{tab:ablation} illustrate that all four components mentioned above play an effective role in the memory mechanism to varying degrees.
% Please add the following required packages to your document preamble:
% \usepackage{booktabs}




% \subsection{Limitations and Open Problem}
% It can be noticed that the model performance decreases as the complexity of the dataset pattern increase, which mainly stems from the complexity endogenous to the data as shown in \cref{fig:dataset}. The handwritten figures are line grayscale images, and the patterns are easier to extract for the same class of figures, and the differences are more obvious for different classes of figures. Most of the models can get good performance scores. However, Fashion-MNIST data is daily wearing grayscale image data, with smaller differences between some categories and more complex common patterns within the same category. The CIFAR10 dataset is RGB color images with backgrounds, and each category has extremely complex and difficult-to-generalize common patterns, which brings out the OGP to the extreme. The single image of CIFAR10 contains rich information, and the semantic patterns extracted by the model in the unlabeled case are only at the low level, such as edges, corners, textures, etc., which results in all of the methods not achieving high AUROC even for the best-performing ones. In other words, CIFAR10 has a large number of common features with semantic redundancy, such as sky, hair, leaves, etc., which greatly blur the pattern boundaries under a label-free scenario. This situation may be improved if weak supervision or other agent tasks with penalty terms are introduced, which is an open problem for future work.

\subsection{Sensitivity Analysis}
\label{sec: Sensitivity}
The memory mechanism involves two hyperparameters, the number of memory rounds $R$ and the memory capacity $N$. We simply fixed $R=8$ in all experiments and $N=10$ in all experiments except for the MNIST capacity $N=8$. The original intention of this setting was to expand the memory capacity because we thought that Fashion and Kuzushiji have more complex data patterns compared to MNIST. To understand in detail the effect of different parameters on model performance, we randomly selected MNIST ROCO-4 experiments for sensitivity analysis as shown in \cref{sec: Sensitivity}. We did eight sets of experiments in the range of $[2,16]$ at intervals of $2$ for memory capacity $N$ and memory round number $R$. Their AUROC scores (\cref{fig:Sensitivity}) are represented by AUC(N) [Blue] and AUC(R) [Orange], respectively. It is easy to find that the memory capacity decreases the model performance when it is too small ($N=2$), and the model performance changes relatedly as the capacity increases and reaches the optimum at a particular capacity ($N=14$). In contrast, changes in the number of rounds $R$ have less impact on AUROC, and the model performance remains stable.


\begin{figure}[t]
  \centering
  \includegraphics[width=0.70\linewidth]{pictures/sensity.png}
  \caption{The sensitivity of memory.}
  \label{fig:Sensitivity}
\end{figure}

\section{Conclusions}
\label{sec: Conclusions}
We proposed a memory mechanism for UAD to address the dual challenges of label-free multi-pattern and semantic-level anomalies, which can be plug-and-play as a module for existing models without adding penalty terms. The Memorizer model equipped with multi-round memory can effectively alleviate the OGP in UAD and allow the models to report the unknowns truthfully. 

\section{Future Works}
\label{sec: Future Works}
The OGP proposed in this paper has significant implications for the study of real industrial production environment deployment in the future. Existing UAD methods need to train multiple models separately for multiple patterns in practical applications, e.g., cup anomaly detection model for cups, nail anomaly detection model for nails, and box anomaly detection model for boxes, which is a model flooding dilemma. The formulation of the OGP clarifies the model flooding dilemma and opens up a new research direction by proposing the UNSCND condition. In future work, the theorization and application of memory mechanisms are worthy of continued in-depth research. Further proof and derivation of the existence and approximation principles of tight bounds for unlabeled multi-pattern data are needed on the theoretical side. In terms of applications, the combination of memory mechanisms with continuous learning, domain generalization, and generative networks can be explored, which are all anticipated works.


% \begin{contributions} % will be removed in pdf for initial submission 
% 					  % (without ‘accepted’ option in \documentclass)
%                       % so you can already fill it to test with the
%                       % ‘accepted’ class option
%     Briefly list author contributions. 
%     This is a nice way of making clear who did what and to give proper credit.
%     This section is optional.

%     H.~Q.~Bovik conceived the idea and wrote the paper.
%     Coauthor One created the code.
%     Coauthor Two created the figures.
% \end{contributions}

\begin{acknowledgements} % will be removed in pdf for initial submission,
						 % (without ‘accepted’ option in \documentclass)
                         % so you can already fill it to test with the
                         % ‘accepted’ class option
     This work is supported by the National Key Research and Development Program of China No.2018YFC2002603.
\end{acknowledgements}

% References
\bibliography{uai2023-template}
\end{document}
