\section{Related Work}

\subsection{Large Language Models in Mental Health}

Large language models (LLMs) have increasingly been applied to mental health-related tasks, from symptom detection to conversational support.
A comprehensive review by~\cite{Guo2024MentalHealthLLM} surveyed the contributions of LLMs to mental health applications and identified key challenges in reliability, privacy, and interpretability.
For instance,\,\cite{Sezgin2023PPDStudy} evaluated the clinical accuracy of ChatGPT and Bard in answering postpartum depression (PPD) questions, demonstrating that while LLMs can outperform traditional search engines in correctness, they often lack affective sensitivity.
Similarly, a review of LLMs in psychiatry emphasized their potential to assist in therapy and patient education but noted their limited performance on nuanced emotional reasoning~\cite{Omar2024PsychiatryLLM}.

In reproductive psychiatry,\,\cite{AlSaad2025ReproductivePsychiatry} explored multimodal LLMs that integrate textual and physiological data to support diagnosis and patient communication, while~\cite{GarciaMendez2025RealTimePPD} proposed a real-time, explainable LLM-based framework for PPD detection that combines NLP and machine learning models.
Collectively, these studies reveal that while LLMs show promise in psychological support, they are predominantly designed for \textbf{clinical screening or prediction}, with limited attention to cultural and linguistic adaptation, an essential factor for building empathy and trust in maternal mental health scenarios.

\subsection{Fairness and Inclusion in Cross-Lingual NLP}

Fairness in NLP has long been associated with addressing \textbf{low-resource and cross-lingual disparities}.
Recent efforts in Tibetan include the creation of the TIB-STC corpus~\cite{SunShine2025TIBSTC}, the T-LLaMA model~\cite{Lv2025TLLaMA}, and the TiBERT pre-trained model~\cite{liu2022tibert}, which collectively enhance representation for this under-resourced linguistic community.
Complementary advances, such as prompt-based Tibetan text classification~\cite{Wang2023PromptTibetan} and the CUTE multilingual dataset~\cite{Zhuang2025CUTE}, further demonstrate the growing ecosystem for Tibetan NLP.

Parallel developments in \textbf{Moroccan Arabic (Darija)} include the MADAR corpus~\cite{Bouamor2018MADAR}, the Darija Open Dataset v2~\cite{Belkadi2024DarijaOpenV2}, DarijaBERT~\cite{Gaanoun2024DarijaBERT}, and the LoResLM / Atlas-Chat framework~\cite{Shang2024AtlasChat}, which focus on bridging dialectal variations across Arabic-speaking regions.
In \textbf{African language NLP}, resources such as Masakha-NER~\cite{Adelani2021MasakhaNER}, the Lacuna Fund Swahili corpus~\cite{Lacuna2023SwahiliCorpus}, the Kenya NLP Survey~\cite{Mwangi2024KenyaNLP}, and Lanfrica~\cite{Lanfrica2025Repository,Emezue2020LanfricaPaper} have been instrumental in improving linguistic coverage and model fairness across underrepresented communities.

% Despite these substantial advances, most cross-lingual fairness studies focus on \textit{interlanguage} disparities, leaving the \textit{intra-language} diversity problem (variation within a single linguistic system) largely unaddressed.

Despite these advances, most fairness work is evaluated on cross-language gaps or on standardised language varieties, and less frequently examines \textbf{within-language} pragmatic and cultural variation in high-stakes dialogue. 
% Importantly, many technical ideas developed for \textbf{cross-lingual} inclusion can potentially transfer to \textbf{intra-lingual} settings. 
In this paper, we focus on a practical \textbf{within-language} deployment setting: culturally grounded prompting for major Sinitic dialect communities in postpartum support dialogues, where shared written Chinese coexists with region-specific pragmatics and idiomatic expression.

\subsection{Chinese Dialects and Intra-Linguistic Diversity}

Chinese presents a particularly rich case of \textbf{intra-linguistic cultural diversity}, as its dialects differ not only phonetically but also in pragmatics, emotional expression, and social norms.
For example,\,\cite{Xu2021DialectDiscrimination} demonstrated the challenges of dialect discrimination under low-resource conditions, while~\cite{Zhang2022DialectTTS} proposed a dialect-to-speech generation framework that exposed the burden of adapting standard Mandarin models to dialectal text.
Furthermore,\,\cite{Jiang2024PhonemicAnnotation} introduced a phonemic annotation method for Chinese dialects, highlighting the need for structured linguistic resources.
A comparative analysis by~\cite{Xu2025DialectSpeech} of LLM and self-supervised models for dialectal speech recognition revealed performance degradation as dialectal variance increases.

However, these studies focus primarily on \textbf{linguistic modeling} rather than \textbf{socio-cultural alignment}.
In emotionally sensitive contexts such as maternal counseling, where implicit empathy, local metaphors, and communicative norms play central roles, such internal diversity directly affects user trust and well-being.
Addressing this gap requires models that integrate \textbf{cultural adaptation and linguistic variation}, forming the motivation for our proposed \textbf{\AbbrName (\AbbrFullName)}, which extends LLM fairness research from \textit{cross-lingual} to \textit{intra-linguistic} inclusivity.

Our work is analogous but focuses on intra-linguistic low-resource dialects (Chinese regional varieties) for postpartum support, rather than cross-lingual gaps.

\subsection{Co-Design}

Co-design extends participatory design by foregrounding the ``collective creativity'' of designers and non-designers as equal partners, rather than treating users as passive sources of requirements or testers at the end of development~\cite{Vial2022HumanCenteredDMH}.
In digital mental health, recent reviews conclude that involving service users in design tends to improve acceptability, usability, and engagement, yet co-design is often only partially implemented, under-reported, and constrained by resources and power imbalances, especially with youth and clinically vulnerable groups~\cite{Kilfoy2024UmbrellaCoDesign,Porche2022DigitalCoDesignYouth}.
For chatbots and digital conversational tools, participatory studies further show that user and clinician involvement is critical to align expectations around empathy, transparency, and conversational boundaries, but most deployed systems remain largely expert-driven with limited end-user input~\cite{Houben2023PDForWhom,Casu2024AICbtScopingReview,Hoffman2024AIAttitudesMH}.

In AI, participatory and community-engaged frameworks argue that co-design should reach beyond the interface to problem formulation, data practices, and governance, warning that superficial consultation can reproduce existing harms~\cite{Birhane2022PowerToThePeople}.
Practitioner guidance similarly emphasises sustained partnerships with affected communities when deploying high-stakes AI systems, but current work largely offers high-level process models and rarely treats intra-linguistic cultural diversity (e.g., dialectal variation within one language) as a primary design dimension~\cite{PAI2025GuidanceInclusiveAI}.
More recently, LLM-based multi-agent systems have been proposed as co-design partners that support human ideation and critique, but mainly for generic creative or engineering tasks without culturally grounded roles in sensitive health settings~\cite{Tseng2025LLMMultiAgentCoDesign}.
In contrast, CAMA operationalises co-design as an internal multi-agent governance layer: Psychologist, Linguist, Teacher, Mother, and AI Researcher agents act as always-on ``virtual stakeholders'' that approximate a continuous co-design loop at inference time, complementing, rather than replacing, conventional co-design with postpartum women and clinicians, and explicitly centring intra-linguistic cultural diversity as a core axis of system behaviour.






