{
  "title": "Realistic Simulation of Social Media Users",
  "turns": [
    {
      "turn": 1,
      "user": "From now on, you're my AI research topic guide assistant. Remember that you’re someone who has published many papers at NLP and data mining conferences like WWW, WSDM, KDD, SIGIR, CIKM, ACL, EMNLP, and NAACL. What I want to do is to build a very realistic SNS user simulator that can measure \"how similarly it behaves to real users,\" and propose a framework that properly evaluates it!\n\nSo far, the idea I’ve considered is a YouTube video-watching simulation task as follows\n\nTask: The LLM takes the user’s profile, video-watching goals, and past video/action sequences as input, and generates a new video/action sequence\n\nEvaluation: Real users directly rate on a 1~5 scale how well (1) the video selections and (2) various actions (likes, comments, etc.) generated by the simulator match their own tendencies, and we train on these ratings to build a model that automates the overall evaluation process\n\nWhat other domains or tasks could there be beyond this? Please recommend as many diverse ones as possible",
      "assistant": "Hello! Drawing on years of publishing at venues like WWW, KDD, and ACL, I’ll recommend additional domains and tasks to enrich your proposed research topic.\n\nYour proposed study on a **'highly realistic SNS user simulator and evaluation framework'** is fascinating and timely. In particular, an evaluation framework that directly measures how much an LLM agent’s behavior resembles that of real users through their ratings, and then uses those ratings to train the model, would be a key contribution (Contribution).\n\nThe YouTube viewing simulation is a great starting point. From there, you can explore other domains where user behavior is richer and more complex.\n\n* * *\n\nRecommended domains and tasks\n------------\n\n### 1\\. E-commerce (E-commerce) purchase journey simulation 🛍️\n\nBeyond the simple act of buying, simulate the **end-to-end exploration and decision-making process** leading up to a purchase. This can be very useful for offline evaluation of recommender systems or testing new UI/UX.\n\n*   **Task:** Given a user profile (demographics, past purchases) and a shopping goal (e.g., \"buy a friend's birthday gift,\" \"find new running shoes\"), the LLM generates a sequence of actions leading to **searching, product clicks, add/remove from cart, reading reviews, and final purchase**.\n*   **Core behaviors:**\n    *   **Query generation and refinement:** A search that becomes more specific such as \"sneakers\" -> \"Nike running shoes\" -> \"Nike Pegasus 41\"\n    *   **Product comparison and cart use:** Putting multiple items in the cart to compare, or wishlisting for later purchase\n    *   **Review browsing:** Selectively reading positive/negative reviews and having them influence the decision\n*   **Evaluation:** Ask real users to rate on a 1~5 scale, \"How well does this process reflect how I search and buy?\" In addition, whether the simulation **ultimately purchases a product that matches the shopping goal** is an important metric.\n\n### 2\\. Social media feed interaction and content creation simulation ✍️\n\nBeyond passive consumption, simulate **actively creating content and interacting with other users**. This can be used for studying misinformation spread, modeling online opinion formation, and social bot detection.\n\n*   **Task:** Given a user profile (political leaning, interests, following list) and the current feed (timeline), the LLM either **(1) writes a new post (tweet, post)** or **(2) performs actions like 'like,' 'share,' and 'comment' on other posts in the feed**.\n*   **Core behaviors:**\n    *   **Content creation:** Generate text/image captions that reflect opinions or reactions aligned with the profile on a given issue or trend\n    *   **Interaction selection:** Decide which posts to react to among many in the feed\n    *   **Commenting:** Produce various types of comments—support, rebuttal, questions—matched to the post\n*   **Evaluation:** Show generated posts or comments to real users and have them rate \"How similar is this to your thoughts or typical behavior?\" Further, assess realism by comparing against a model that predicts how many **'likes' or 'shares' the content would be expected to get** on real social media. (Turing Test style)\n\n### 3\\. Online community/forum discussion participation simulation 🗣️\n\nBeyond simple information consumption or socializing, simulate **forming logical opinions and debating with others** on a specific topic.\n\n*   **Task:** Given a discussion thread (Thread) from a specific online community (e.g., Reddit, a domestic topic forum) and the user’s persona (e.g., 'expert', 'beginner', 'critical participant'), the LLM generates **new opinions or questions** appropriate to the context and participates in the conversation.\n*   **Core behaviors:**\n    *   **Claim formation and justification:** Generate logical arguments aligned with the persona and sometimes cite external information\n    *   **Opinion change:** Change the initial stance in response to persuasive arguments from others\n    *   **Use of community 'memes (Meme)' or slang:** Appropriately using expressions and culture unique to that community\n*   **Evaluation:** Community users assess whether the generated comment \"appropriately contributes to the flow of discussion\" and \"consistently maintains the given persona.\" You can also validate realism by **predicting the expected upvote/downvote counts** for the comment.\n\n* * *\n\nThese domains demand a deeper level of user modeling that goes beyond 'what they view' to \\*\\*'why they act that way,' 'how they interact'\\*\\*. Therefore, while evaluating the simulator’s realism is more challenging, it’s also an opportunity to greatly increase the novelty and contribution of the work.\n\nI believe your proposed evaluation framework is a powerful methodology that can be applied consistently across all of the above domains. Wishing you great success with the research!"
    }
  ]
}