[
    {
        "aspect": "Sequence of Events",
        "introduction": "Evaluates the ability to understand and interpret a series of actions or events that occur over a period of time within an image.",
        "guidance": "Create a scene featuring a sequence of events happening in a logical, progressive order. Choose a primary setting where multiple stages of an activity or narrative can be visually displayed, such as a park where children are playing. Arrange the elements so that different stages of the actions are clear and distinguishable; for instance, show a child running, about to jump, and finally mid-air landing. Ensure the character or subjects are consistent across different stages to emphasize a clear progression. Use visual markers like motion lines or position shifts to indicate the passage of time within the scene. Maintain a balance by not overcrowding the image; focus on one to three primary actions to avoid confusion. Pay attention to lighting and shadow consistency to indicate continuous time flow. Avoid disjointed elements that don't contribute to a cohesive narrative, ensuring all parts of the sequence are contextually linked and clear in their transition."
    },
    {
        "aspect": "Predictive Analysis",
        "introduction": "Assesses the capability to predict future actions or outcomes based on current visual information.",
        "guidance": "To incorporate predictive analysis into an image generation prompt, center the scene around a key moment that hints at an impending action or outcome. Ensure the primary elements illustrate a clear cause-and-effect relationship. For instance, depict a character poised to throw a ball, with contextual clues such as body posture, trajectory lines, or an expectant catcher. Balance dynamic poses with environmental elements that reinforce the anticipation, like a tensed crowd or flying dust. Avoid clutter that can obscure the predictive elements, focusing instead on clean lines of action that guide the viewer's eye naturally towards the anticipated future event."
    },
    {
        "aspect": "Cause and Effect",
        "introduction": "Tests the understanding of causal relationships between different elements in an image, such as the result of certain actions.",
        "guidance": "When generating an image prompt incorporating Cause and Effect, clearly describe both the cause and its effect within the scene. Start with the action or event (the cause) and specify the expected outcome (the effect). Ensure that the visual representation of the cause is prominently displayed in the image, with the effect logically and visibly following from it. For instance, if the cause is \"a person kicking a ball,\" the effect could be \"the ball flying through the air toward a goal.\" Balance the elements by ensuring the cause and effect are both in focus, with the causal action more centrally placed and the effect depicted in a way that naturally follows. To avoid common pitfalls, ensure that the connection between cause and effect is intuitively clear and unambiguous, avoiding overly complex or abstract relationships that may confuse the viewer. Use visual cues and context to strengthen the causal link, such as motion blur lines for actions or expressions of surprise/reaction in characters affected by the action."
    },
    {
        "aspect": "Event Progression",
        "introduction": "Investigates how well the model can infer the stages or steps of an ongoing action or process depicted in the image.",
        "guidance": "To represent the aspect of Event Progression visually, create a prompt that clearly illustrates a sequence of stages within a singular frame. Begin by defining the event or process that is taking place. Specify distinct elements or subjects that should visually depict different phases of the action. For example, if the event is \"a flower blossoming,\" ensure elements are included to show the flower at various stages\u2014from bud to full bloom. Balance these elements so that each stage is visually distinct yet flows logically from one to the next. Use positional and size cues to indicate progression, such as placing earlier stages at the forefront or lower part of the image and later stages higher or further back. Avoid cluttering the image with too many stages; instead, select key moments that clearly show the progression. Ensure the background remains consistent to tie the stages together cohesively, and use gentle transitions to emphasize the continuity of the process."
    },
    {
        "aspect": "Temporal Context",
        "introduction": "Measures the ability to place elements within a correct temporal framework, understanding the past, present, or future context of an image.",
        "guidance": "To effectively represent temporal context in your image prompt, ensure you specify the time period you want to depict\u2014such as past, present, or future. Clearly describe the distinct visual elements appropriate for that era, including clothing styles, technology, architectural features, and any relevant cultural markers. For the past, focus on historical accuracy with vintage items and old-fashioned settings; for the present, use contemporary elements; and for the future, incorporate futuristic designs, advanced technology, or imaginative landscapes. Make sure to consistently apply these elements throughout the image to avoid anachronisms or conflicting details. Balance the elements so that the temporal context is immediately recognizable and central to the scene, without overshadowing the main subjects or actions. For common mistakes, avoid mixing elements from different eras unless the prompt is explicitly intended to create a contrast or juxtaposition for artistic purposes."
    },
    {
        "aspect": "Duration Understanding",
        "introduction": "Evaluates the comprehension of time duration and how long an action or event might take place based on visual clues.",
        "guidance": "When creating a prompt that incorporates duration understanding, specify the type of event or action and include visual elements that clearly signal the length of time involved. For instance, if depicting a race, include start and finish lines, motion blur for speed, and maybe a clock or stopwatch to emphasize time passing. If illustrating a long-duration event like a marathon, show runners at different stages with varying expressions of fatigue, a changing landscape, and perhaps an evolving time of day from dawn to dusk indicated by the position of the sun. Include cues like shadows growing longer. Be careful to ensure that all elements work cohesively to convey a clear sense of time progression; avoid conflicting signals, such as both sunny and rainy weather, which might confuse the duration concept. Balance foreground and background elements to reinforce the sense of a timeline: individuals or objects in the foreground can indicate immediate action, while those in the background provide context for past or future stages of the event."
    },
    {
        "aspect": "Object Orientation",
        "introduction": "Assesses the ability to comprehend the orientation and placement of objects in space.",
        "guidance": "Describe the specific orientation of objects within the scene. Specify the angle at which each object is positioned (e.g., tilted, upside down, lying flat, upright). Indicate spatial relationships and directions, such as an object facing left, right, towards the viewer, or away from the viewer. Clearly define the placement of each object in relation to others to ensure a coherent composition, such as \"a cat lying on its side next to an upright vase facing the viewer.\" Balance the scene by ensuring that objects with different orientations do not conflict or overlap unnaturally. Avoid ambiguous descriptions that leave the orientation or placement unclear, as this can result in illogical or inconsistent imagery."
    },
    {
        "aspect": "Depth Perception",
        "introduction": "Tests the understanding of spatial depth and the relative distances between objects.",
        "guidance": "Create a prompt that includes a series of objects placed at varying distances from the viewer to demonstrate spatial depth. Specify a foreground, midground, and background, with clear descriptions of at least one object in each. For example, depict a large tree with detailed bark texture in the foreground, a smaller bench and flowers in the midground, and a distant, hazy mountain range in the background. Ensure the objects decrease in size and detail as they recede into the background to enhance the perception of depth. Use descriptive terms that emphasize perspective, such as \"close-up,\" \"middle distance,\" and \"far away.\" Avoid placing all elements in a single plane and ensure the foreground objects partially obscure parts of the midground and background to reinforce the layered spatial arrangement."
    },
    {
        "aspect": "Spatial Relationships",
        "introduction": "Measures the ability to understand how different elements in an image relate to each other spatially.",
        "guidance": "Ensure that the prompt specifies the relative positioning and distances between key elements in the scene. Indicate which elements should be placed in close proximity, which should be spaced apart, and the overall arrangement within the frame. For example, if creating a prompt for a cityscape, specify that tall skyscrapers should be grouped together in the center, with smaller buildings and streets radiating outwards, pedestrians walking closely along sidewalks, and cars parked at a distance. Emphasize the spatial hierarchy, such as larger, prominent objects in the foreground and smaller, less detailed objects receding into the background. To maintain balance, avoid overcrowding one area of the image while leaving other parts too sparse. Ensure each element maintains a logical spatial relationship, making sure that overlapping objects follow realistic occlusion rules (e.g., a tree should not obscure an entire building if it's meant to be positioned behind it). This consideration of spatial relationships will ensure a well-organized and coherent composition."
    },
    {
        "aspect": "Geometric Inference",
        "introduction": "Evaluates the capacity to infer geometric properties and shapes from visual information.",
        "guidance": "Ensure the prompt explicitly describes a scene where geometric shapes are a central element. Specify the types of geometric shapes to be included (e.g., circles, triangles, squares) and their arrangement in relation to one another. Emphasize the importance of clear geometric boundaries and consistent perspective. For example, \"a landscape with a large central triangle overlapping two smaller circles on either side, all situated within a square frame.\" Balance complexity by delineating one or two prominent geometric relationships and avoid overcrowding the image with too many shapes. Include descriptors for color differentiation to enhance shape distinction, such as \"a blue triangle against a background of red circles.\" Ensure the prompt avoids ambiguity by being precise about the spatial arrangement and size ratios of the shapes, such as \"a small circle at the top left corner of the square, one-third the size of the triangle.\""
    },
    {
        "aspect": "Positional Awareness",
        "introduction": "Assesses the understanding of the positional context of objects within the image frame.",
        "guidance": "When incorporating positional awareness into an image generation prompt, specify the exact locations of key objects within the image frame. For instance, indicate whether an object should be in the center, near the edges, at the top, bottom, left, or right areas of the image. Ensure clear instructions that describe where each primary and secondary object should be positioned relative to one another, such as \"place a large tree in the center of the image, with a small bench to the right of the tree, and a dog sitting on the bench facing forward.\" To balance elements within the image, consider the visual weight of objects by specifying scaling or alignment, such as \"align three evenly spaced lamp posts along the bottom edge of the image frame.\" Avoid vague terms like \"near\" or \"close to,\" and instead, use precise locations to create a coherent and well-integrated result. For example, \"position a mountain range along the bottom third of the image, with the sun rising above the mountain peaks just slightly off-center to the left.\" This clarity helps prevent common pitfalls like cluttered or unbalanced images."
    },
    {
        "aspect": "Pathfinding",
        "introduction": "Measures the ability to discern possible paths or movement routes for entities within the image.",
        "guidance": "To effectively incorporate the aspect of Pathfinding into an image generation prompt, instruct the AI to create an image that features a clear, discernible pathway such as a winding road, a forest trail, or a series of stepping stones across a stream. The path should integrate seamlessly with the environment, whether it is a cityscape, rural area, or a fantastical setting. Include visible landmarks like signposts, bridges, or arches that guide entities along the path. Ensure the pathway is navigable and visibly connects different parts of the scene, such as starting from the foreground and receding into the background or branching toward notable points of interest. Indicate the presence of entities, like people, animals, or vehicles, using the path to illustrate its function. Maintain a balance between the path and surrounding elements, avoiding excessive clutter that could obscure the route. Emphasize varying path materials and textures (e.g., cobblestones, dirt trails) to enhance visual clarity and interest."
    },
    {
        "aspect": "Symbolic Interpretation",
        "introduction": "Tests the ability to understand and interpret symbolic elements or representations within an image.",
        "guidance": "When crafting the image generation prompt, first decide on a central theme or concept that will be symbolically represented. Choose well-known symbols that align with the chosen theme, such as a dove to represent peace, a broken chain for freedom, or scales for justice. Clearly specify these symbols and how they should be visually integrated into the scene. For example, \"A dove with an olive branch in its beak perched atop a globe, symbolizing world peace.\" Ensure the symbols are prominent and easily identifiable within the composition. Arrange the elements to create a balanced scene, placing the symbolic elements in key positions, such as the center or focal points of the image. Avoid overcrowding the image with too many symbols, which may dilute their impact and lead to misinterpretation. Maintain a clear and coherent visual narrative to guide the viewer's understanding of the symbolic representation. For instance, \"A balanced scale with a heart and brain on either side, set against a backdrop of a serene landscape, symbolizing balance between emotion and intellect.\""
    },
    {
        "aspect": "Metaphorical Understanding",
        "introduction": "Evaluates comprehension of metaphorical or abstract relationships depicted visually.",
        "guidance": "Create an image that visually embodies a metaphorical concept. Choose a primary metaphor, such as \"time is a thief,\" and depict it through specific elements that illustrate the abstract idea\u2014depict a clock with hands transforming into a pair of hands that subtly snatch away objects or moments. Ensure the primary symbolic elements are clearly identifiable, like the clock representing time and the hands representing theft. Balance the scene by placing these symbolic elements in a contextual setting that reinforces the metaphor, such as an environment where moments or important items are visibly being taken or disappearing. Avoid overly literal interpretations or cluttering the image with unrelated elements to maintain focus on the metaphor."
    },
    {
        "aspect": "Logical Deduction",
        "introduction": "Assesses the capacity to make logical inferences based on abstract visual cues.",
        "guidance": "Create a prompt where the image contains multiple abstract objects or symbols in various juxtaposed positions that suggest a relationship or sequence. Ensure there are elements that naturally lead the observer to draw connections or conclusions, such as a series of gears leading to a light bulb, or a flow of water ending in a planted seed. Balance these elements so that there is a clear narrative progression or cause-and-effect relationship, prompting the viewer to deduce one element based on another. Avoid making the visual metaphors too obscure or disconnected, as this can lead to confusion rather than logical inference."
    },
    {
        "aspect": "Conceptual Blending",
        "introduction": "Measures the ability to combine different abstract elements to form a coherent understanding.",
        "guidance": "Create a prompt that asks for an image featuring two or more distinct abstract elements integrated seamlessly. Ensure that each element retains its unique characteristics while contributing to an overarching theme. For instance, you could describe a landscape where organic shapes blend with geometric figures, such as a forest where trees have fractal patterns merging into angular pathways. Pay attention to color harmony and transitions to ensure a coherent blend. Avoid clutter by limiting the number of abstract elements and focusing on their meaningful integration. Describe the spatial arrangement clearly, specifying how each abstract element interacts with the others and the environment to form a unified visual experience."
    },
    {
        "aspect": "Hypothetical Scenarios",
        "introduction": "Evaluates the capability to imagine and reason through hypothetical situations based on abstract visuals.",
        "guidance": "Design a prompt that sets up a fantastical or imaginative situation, clearly describing both the unusual aspects and how the elements might logically interact within this setting. For instance, prompt the generation of an image showing a city floating in the sky, where buildings rest on large platforms suspended by enormous hot air balloons. In the foreground, depict citizens walking on suspended walkways with safety railings. The background should include a vast sky with distant floating cities and the faint outline of the earth's curvature below. Ensure the image balances the surreal elements with logical visual cues such as shadows cast by the platforms, plausible size ratios between the components, and a coherent light source. Avoid disjointed elements that do not blend naturally into the hypothetical scenario, such as random objects that defy the described physical laws or an inconsistent perspective."
    },
    {
        "aspect": "Thematic Analysis",
        "introduction": "Tests the understanding of broader themes or motifs illustrated in an image.",
        "guidance": "When incorporating thematic analysis into an image generation prompt, specify the central theme or motif clearly. Describe how this theme should be visually represented through key elements, characters, settings, and color schemes. For example, if the theme is \"transformation,\" instruct the AI to depict a scene showing the metamorphosis of a butterfly with stages of its life cycle represented in different parts of the image. Ensure the layout balances these elements to highlight the progression and cohesion of the theme. Use specific symbols or visual metaphors related to the theme to deepen the narrative. Avoid vague descriptions or mixing unrelated motifs, as this can lead to a confusing or disjointed image."
    },
    {
        "aspect": "Emotion Recognition",
        "introduction": "Assesses the ability to identify and interpret emotions expressed by subjects in an image.",
        "guidance": "Craft the prompt to explicitly describe the emotions of the subjects in the image. Specify the number of subjects and detail the exact expressions or feelings they should display, such as joy, sadness, anger, surprise, or fear. Ensure the prompt includes contextual elements that explain why the subjects are feeling these emotions, such as setting, interactions, or significant events. Balance the emotional expressions among the subjects to create a coherent overall scene, and avoid vague descriptors like \"looking emotional.\" Instead, opt for precise language, like \"a child laughing with wide eyes and a big smile, an elderly person with tears in their eyes and a downturned mouth, and a couple arguing with furrowed brows and clenched fists.\" This will help the AI generate an image where the emotions are clear and interpretable."
    },
    {
        "aspect": "Social Interactions",
        "introduction": "Evaluates understanding of interactions and relationships between people captured visually.",
        "guidance": "Describe the setting where the interaction takes place, specifying whether it is public or private, formal or informal. Identify the people involved, providing details about their appearance, clothing, and any objects they may be using or interacting with. Highlight the nature of the interaction, emphasizing emotions, body language, and facial expressions to convey feelings such as friendliness, conflict, collaboration, or affection. Note the positions and gestures of the individuals to illustrate the dynamics between them, such as proximity, eye contact, or physical touch. Ensure a balance between the characters and their environment, making sure neither overwhelms the other. Avoid vague descriptors and ensure each element of the interaction is visually distinct to prevent ambiguity in the relationships portrayed."
    },
    {
        "aspect": "Intent and Motivation",
        "introduction": "Measures the capacity to infer the intent or motivation behind actions depicted in an image.",
        "guidance": "When crafting a prompt to depict intent and motivation, ensure that the actions of the subjects in the image clearly indicate an underlying purpose or goal. Specify the actions being performed, the expressions on faces, and the context that clarifies the intent. For example, instead of just saying \"a person running,\" specify \"a determined athlete running towards the finish line, sweat on their brow, and a look of intense focus on their face, with a cheering crowd in the background.\" Balance the elements by not overloading the scene; keep the main subject's intent clear through body language and contextual clues. Avoid vague or ambiguous actions that do not convey a clear goal or purpose."
    },
    {
        "aspect": "Cultural Context",
        "introduction": "Assesses the ability to understand cultural references and context within a visual scene.",
        "guidance": "When creating a prompt that focuses on cultural context, explicitly specify the cultural elements to be included in the image. Describe specific cultural symbols, practices, attire, architecture, or artifacts that should be present. Ensure the image encapsulates a particular cultural moment or scene\u2014for instance, a traditional Japanese tea ceremony in a Zen garden, complete with kimonos, tea utensils, tatami mats, and cherry blossoms. Balance the elements by having clear focal points that highlight the cultural aspects prominently, such as the tea master in the center performing the ceremony, with the detailed garden setting in the background. Avoid mixing symbols from different cultures to prevent anachronisms or misrepresentations; stay true to the specific cultural context indicated."
    },
    {
        "aspect": "Group Dynamics",
        "introduction": "Evaluates comprehension of the dynamics between multiple people or entities in an image.",
        "guidance": "To effectively depict group dynamics in an image generation prompt, specify the number of individuals or entities and their interactions. Describe the specific roles, activities, and expressions of each member to reflect their relationships and social dynamics. Ensure each individual or entity's placement and actions are balanced to convey a coherent scene. For instance, describe one person leading a conversation while others exhibit reactions like listening, nodding, or disagreeing body language. Avoid ambiguous or overly general terms that do not clearly define each participant's role and interaction, ensuring clarity in their collaborative or conflicting dynamics."
    },
    {
        "aspect": "Social Norms",
        "introduction": "Tests the understanding of social norms and behaviors as depicted visually.",
        "guidance": "The prompt should depict a scenario where individuals are engaged in interactions or behaviors that clearly reflect established social norms. Choose a relevant social situation, such as a formal dinner, a business meeting, a classroom setting, or a public transportation scene. Specify details like attire appropriate to the setting, body language, and facial expressions that convey respect, politeness, or other social etiquette. Ensure the scene includes multiple characters to illustrate social dynamics, including elements like personal space, hierarchical behavior, and gestures that indicate listening or agreement. For balance, highlight key actions or postures distinguishable to the social setting's norms, such as shaking hands or raising a hand to speak. Avoid overly generic or ambiguous depictions that could misrepresent social norms, and ensure the setting and characters\u2019 interactions are contextually rich and culturally consistent."
    }
]