\section{Data description}

% This section provides all the details about the data, in particular:
% \begin{itemize}
%     \item The origin of data. If you start from existing datasets, provide all the details and references.
%     \item If you annotate data, provide all details about the guidelines, the number and expertise of annotators and the annotation quality.
% \end{itemize}

% \subsection{Origin of data}

% \subsection{Annotation details \textit{(if needed)}}

% \subsection{Data format}
% Provides all details about the data format. Examples of data can be helpful.

% \subsection{Example of prompts used for zero or/and few shots}

% \subsection{Detailed data statistics}
% Provide all statistical information about your data: the number of examples,
% the training/test split (if it exists), the number of examples for each language (if the dataset is multi-lingual), ...

%\subsection{Origin of data}
The \pro dataset\footnote{The full dataset can be downloaded at \url{https://huggingface.co/datasets/emensa/proverbIT}.}  is composed of $100$ multi-choice questions, each regarding the completion of a specific Italian proverb. To create the dataset, we started from an initial set of $200$ common Italian proverbs~\cite{200_proverbi} from which we selected $100$ of the most commonly used. This process was carried out by three of the authors, which are all native Italian speakers. Each proverb was then manually split into its \textit{beginning} and its \textit{ending}, with the point of division determined to maintain the proverb's semantic coherence in the initial part while allowing for a clear, unambiguous completion.
For each proverb, four distinct incorrect alternative endings were manually created, leveraging the following constructive rationale: 
\begin{itemize}\label{options_list}
    \item \textbf{A} is an ending that has similar sounds to the original continuation, often with an absurd/nonsensical meaning. 
    \item \textbf{B} is a non assonant literal synonym of the original ending. 
    \item \textbf{C} is the inverse of the original proverb ending, trying to maintain the assonance when possible.
    \item \textbf{D} is a tautological/trivial ending of the proverb, with no assonance.
\end{itemize}
% 
For sake of clarity we provide an example in English for each of the aforementioned continuations. Completions for the proverb \sans{Actions speak... louder than words} could be:
\begin{itemize}[label={}]
    \item A) \sans{prouder than swords}
    \item B) \sans{at higher volume compared to speech}
    \item C) \sans{quieter than words}
    \item D) \sans{when they do}
\end{itemize}
As this example shows, the synonym ending is not built on the figurative meaning of the proverb, but it is the literal synonym of the original ending (e.g., \sans{at higher volume compared to speech} rather than \sans{beyond what words can say}).
% % 
This design was adopted to ensure that models cannot simply rely on surface-level syntactic patterns but must engage in deeper semantic and contextual reasoning to identify the absence of the correct completion. 



\subsection{Data format}
The datasetis organized in a comma-separated values format. Each line contains one \textit{complete proverb}, its \textit{beginning} and \textit{ending} splitted, and the four handcrafted incorrectalternatives.



\subsection{Prompts}
% Given each proverb in \pro, we can then fill a simple prompt template that can be submitted to the models:

Two different prompts were devised for the \textit{completion} and \textit{multi-choice} tasks. For the completion baseline, we adopted a simple prompt that requires the model to directly complete a proverb given its beginning:
\begin{styledtext}{Completion Prompt Template (translated)}%
\small
Complete the proverb exactly:\\

[\textit{Proverb beginning}]...\\

Reply with the ending only, do not add further comments.
\end{styledtext}

On the multi-choice prompt we specify that the proverb must be completed \textit{exactly}. Since all provided endings are incorrect, we expect models to always answer \sans{E) None of the other answers}: 
\begin{styledtext}{Multi-Choice Prompt Template (translated)}% \label{tab:correct_none_prompt}
\small
Complete the proverb exactly by choosing from the following options (which have no typing errors) indicating only the letter.\\

    [\textit{Proverb beginning}]...\\
    A) ...[\textit{Assonant ending]}\\
    B) ...[\textit{Synonym ending}]\\
    C) ...[\textit{Inverse ending}]\\
    D) ...[\textit{Trivial ending}]\\
    E) None of the other answers\\
    
    Do not add comments, the possible answers are only A, B, C, D, E.
\end{styledtext}

For sake of clarity we provide an Italian example [with translation] from the actual dataset.

\begin{styledtext}{Example of proverb from the dataset}% 
A buon intenditor,... [To a wise man] \\
A) ...foche canore [singing seals] \\
B) ...zero chiacchiere [zero chatter] \\
C) ...molte parole [many words] \\ 
D) ...è chiaro tutto [everything is clear]\\ 
E) Nessuna delle altre risposte [None of the other answers]
\end{styledtext}


%\subsection{Detailed data statistics}
%\pro dataset is completely Italian and a zero-shot prompting strategy was employed for all tasks, making it unnecessary to split the dataset into training and test sets.