\subsection{Preliminaries}
Given a corpus, ${\cal D}=\{(\x_i, c_i, \y_i)\}_{i=1}^N$, where $N$ is the number of message-response pairs, $\x_i=x_{i1}\,x_{i2}\,\ldots\,x_{i|{\x_i}|}$ is a message with $|{\x_i}|$ characters or words, $c_i$ denotes the associated attributes on the response of $\y_i=y_{i1}\,y_{i2}\,\ldots\,y_{i|{\y_i}|}$, the objective is to learn the conditional probability $p(\y|\x, c)$ from the corpus.  Here, the attribute $c=l_1, \ldots, l_K$ enforces the attribute $l_i$  at the $i$-th stage from $K$ pre-defined aspects, e.g., the emotion of happy or sad~\cite{DBLP:conf/naacl/JiaoYKL19}, and the tone of Declarative, Interrogative, or Imperative~\cite{DBLP:conf/acl/HuangKGx18}.  After obtaining $p(\y|\x, c)$, given a message $\x$ and a specific attribute $c$, we will generate response $\y$ accordingly. 

\section{Preliminaries}
Before delving into the details, we first define the task of RTE and ZeroRTE.  Next, we explore whether the two agents can provide accurate and reasonable feedback in the zero-shot setting.  To make the notations consistent throughout the paper, we define the important ones in Table~\ref{tab:notation} in the appendix.
\subsection{Task Definition}
\paragraph{RTE} Given a dataset $\mathcal{D} = \{(s_i, t_i)\}_{i=1}^{|\mathcal{D}|}$, where $s_i\in\mathcal{S}$ represents the $i$-th input sentences and $t_i\in \mathcal{T}$ represents the corresponding output triplet,   Relation Triplet Extraction (RTE) aims to extract relation triplet $t \in \mathcal{T}$ from a sentence $s \in \mathcal{S}$, following the form $t = (e^{head}, e^{tail}, r)$.  Here, the head entity $e^{head}$ and the tail entity $e^{tail}$ are represented as token spans or word sequences referring to real-world entities. The relation $r$ belongs to the set $\mathcal{R}$, encompassing a predefined collection of relations between the head and tail entities.

\paragraph{ZeroRTE} The objective of ZeroRTE ~\citep{chia-etal-2022-relationprompt} is to leverage the knowledge from the seen dataset $\mathcal{D}^s$ and generalize to the unseen dataset $\mathcal{D}^u$.  Let $\mathcal{D}^s$ and $\mathcal{D}^u$ represent the training and test datasets, respectively, derived from the original full dataset $\mathcal{D}$.  The relation sets during training and test are denoted as $\mathcal{R}^s = \{r^s_1, r^s_2, \ldots, r^s_n\}$ and $\mathcal{R}^u = \{r^u_1, r^u_2, \ldots, r^u_m\}$, where $n = |\mathcal{R}^s|$ and $m = |\mathcal{R}^u|$ indicate their respective sizes. Importantly, it is worth noting that ZeroRTE does have training data $\mathcal{D}^s$; zero-shot refers to the fact that the relation sets for training and test are disjoint, i.e., $\mathcal{R}^s \cap \mathcal{R}^u = \emptyset$.  %  = (\mathcal{S}, \mathcal{T})

