[
    {
        "aspect": "Foreground vs. Background",
        "introduction": "This aspect assesses the model's ability to distinguish between objects that are in the foreground and those that are in the background of an image.",
        "guidance": "When creating a prompt, clearly specify the main subjects or objects to be placed in the foreground of the image, ensuring they are described with distinct and vivid details. For instance, \"a majestic tiger resting on a rock with lush, detailed fur and piercing eyes\" should unmistakably appear in the foreground. Next, describe the elements to be included in the background, ensuring they provide context without overpowering the primary subject. For example, \"a dense jungle with tall trees and a misty atmosphere, slightly blurred to show depth.\" Specify the spatial relationship between foreground and background elements to manage their visual prominence. Use descriptive terms for perspective, such as \"in the distance,\" \"behind,\" or \"surrounding\" to clearly delineate the separation. To avoid common pitfalls, ensure that the foreground elements are more detailed, sharp, and vibrant compared to the background elements, which should retain softer, muted, or less defined characteristics. This contrast will help in achieving a clear differentiation, maintaining a cohesive and realistic image."
    },
    {
        "aspect": "Relative Positioning",
        "introduction": "This evaluates the model's skill in identifying the relative positions of objects, such as one object being above, below, to the left, or to the right of another.",
        "guidance": "Ensure to specify the exact locations of the primary objects in relation to each other. For example, if generating an image of a cat and a ball, the prompt should clearly state, \"a cat sitting to the left of a ball.\" Avoid ambiguity by using precise prepositions and spatial indicators. To maintain balance, arrange the elements so they interact naturally within the scene, such as \"a tree to the right of a house, with a bird perched above the tree.\" Ensure that objects are neither too close nor too distant from each other to maintain visual harmony. Be mindful to avoid placing objects in positions that would logically or physically contradict one another, like \"a car above a tree.\""
    },
    {
        "aspect": "Centrality",
        "introduction": "This aspect judges the model's understanding of whether an object is placed centrally within the image or skewed to the periphery.",
        "guidance": "In the prompt, specify the main subject of the image and clearly state that it should be placed in the center. Emphasize that the subject should dominate the central portion of the image, ensuring that peripheral elements, if any, do not distract from the main focus. For clarity, use phrases such as \"centrally positioned,\" \"central focus,\" or \"main subject in the middle.\" To avoid common pitfalls, indicate that the background and surrounding elements should frame, not clutter, the central subject, maintaining a clear and unobstructed view. Example prompt: \"A centrally positioned golden retriever puppy, sitting in the middle of a grassy field, with trees softly framing the edges. The main focus is the puppy in the center, with peripheral elements that enhance but do not dominate the scene.\""
    },
    {
        "aspect": "Proximity",
        "introduction": "This measures the model\u2019s ability to assess how close or far apart objects are from one another.",
        "guidance": "When creating an image generation prompt focusing on proximity, specify the exact distance between each object. Describe one or two primary objects and their relative positions, emphasizing whether they are close together, overlapping, or far apart. Use descriptive language to make the spatial relationships clear, such as \"a tree and a bench placed side by side with no gap,\" or \"a mountain in the background with a cabin in the foreground, several meters apart.\" Ensure details about any intermediate objects and their distances to the main subjects are also included, maintaining a balance that avoids crowding or excessive spacing. Explicitly mention any necessary spatial hierarchy, such as \"a cat napping next to a dog, just inches away,\" to guide the model toward appropriate proximity representation. Avoid vague terms like \"near\" or \"far\" without quantification, and clarify the visual context to prevent any ambiguity."
    },
    {
        "aspect": "Overlapping Objects",
        "introduction": "This tests the model's capacity to recognize instances where objects overlap, indicating which objects are partially obscured and which are fully visible.",
        "guidance": "Ensure that the prompt specifies at least three distinct objects where one object partially obscures another. For example, describe a scenario where a book lies partially covering a smartphone, and a coffee cup is placed such that only part of the book is visible beneath it. Balance the elements by clearly stating the frontmost and backmost objects, as well as the positions and angles of overlap. Use descriptive positioning cues like \"a book in the center of the table, slightly angled with the top right corner partially covering a smartphone underneath, and a coffee cup overlapping the book on the left side.\" Avoid vagueness in the placement and overlap descriptions to prevent misunderstandings. Specify the objects' colors, sizes, and textures for better differentiation and to ensure clarity in the visual representation."
    },
    {
        "aspect": "Symmetry and Alignment",
        "introduction": "This aspect evaluates the model's ability to detect symmetrical arrangements and alignment of objects, determining if they are evenly spaced or aligned along a particular axis.",
        "guidance": "Create a scene featuring a central focal point, such as a fountain, statue, or building, positioned exactly in the middle of the composition. Ensure that the objects on either side of this central point are mirrored and evenly spaced. For example, if depicting a garden, place identical flowerbeds on both sides in perfect symmetry. When including additional elements like trees, pathways, or benches, they should be evenly aligned along horizontal and vertical axes. Pay attention to details in object sizes and shapes to ensure they are balanced and proportional. Avoid having elements that disrupt the symmetry, such as objects skewed to one side, uneven spacing, or varied sizes that break the uniformity of the scene."
    },
    {
        "aspect": "Depth Cues",
        "introduction": "This aspect measures the model's ability to use visual cues such as shadows, perspective, and occlusion to perceive depth in an image.",
        "guidance": ""
    },
    {
        "aspect": "Depth Layers",
        "introduction": "This evaluates the model's comprehension of how objects are distributed across different depth layers in a scene.",
        "guidance": "In your prompt, clearly specify at least three distinct depth layers: foreground, middle ground, and background. Describe the primary objects or elements that should appear in each layer, and indicate how they interact spatially. For example, place a prominent, detailed subject such as a person or an animal in the foreground, ensuring it captures the viewer's attention. In the middle ground, include secondary elements like trees, buildings, or other structures that provide context and add depth. For the background, describe distant scenery such as mountains, sky, or a horizon line to establish the farthest point in the scene. Ensure the transitions between these layers depict a natural progression in scale and detail. Additionally, consider lighting and atmospheric effects such as fog or light diffusion to enhance the sense of depth. Avoid overcrowding any single layer to prevent a cluttered appearance, and maintain a coherent theme throughout the scene."
    },
    {
        "aspect": "Relative Size Interpretation",
        "introduction": "This aspect assesses the model\u2019s understanding of how the size of objects can indicate their distance from the viewer, interpreting smaller objects as further away.",
        "guidance": "To effectively incorporate Relative Size Interpretation into your prompt, clearly define a major object that will be shown larger in the foreground and another similar or related object that will appear smaller in the background to imply distance. For instance, describe a large tree prominently in the foreground and another smaller tree in the background. Specify spatial relationships and positions such as \"a large tree in the foreground on the left side, and a smaller tree in the background on the right side.\" Additionally, include elements that naturally scale in size, like \"a wide river flowing towards a distant, narrow stream,\" ensuring that closer elements are larger and more detailed, while distant elements appear smaller and less defined. Avoid having objects of inconsistent sizes that do not correlate with their intended depth to prevent the image from looking unnatural. Emphasize perspective and size variation to faithfully represent depth and distance within the scene."
    },
    {
        "aspect": "Focal Points",
        "introduction": "This tests the model\u2019s ability to identify the primary focal point in an image that helps establish the depth of the scene.",
        "guidance": "To incorporate focal points in your prompt, clearly define the primary subject that should be the main point of interest, ensuring it is centrally positioned or highlighted prominently. Describe the primary subject in detail, including its size, color, texture, and any distinct features that make it stand out. Surrounding elements should be included to support the main focal point, without overshadowing it; these can include secondary subjects, background scenery, or any objects that add context to the scene. Emphasize the visual hierarchy by specifying the scale and positioning of secondary elements, ensuring they naturally lead the viewer\u2019s eye towards the main focal point. Use language that suggests depth, such as \u201cin the foreground\u201d or \u201cin the background,\u201d to create a sense of spatial dimension. Avoid overly complex descriptions that might divert attention away from the primary focal point, and ensure all elements are cohesively integrated to support a clear and compelling visual narrative."
    },
    {
        "aspect": "Perspective Lines",
        "introduction": "This aspect evaluates the model's recognition of perspective lines and how they converge to indicate depth and distance.",
        "guidance": "In the foreground, include a detailed focal point such as a person standing on a cobblestone street. Extend perspective lines from the base of the foreground object, converging towards a distinct vanishing point situated in the background. Ensure a sense of diminishing size and increasing detail blur as objects recede into the distance. Populate the background with architecture or natural elements that align with the perspective lines, clearly demonstrating depth. Consider adding shadows and light from a specific direction to enhance the three-dimensional feel. Avoid placing elements that disrupt or obscure the perspective lines, keeping the converging lines continuous and unbroken from foreground to background."
    },
    {
        "aspect": "Depth Consistency",
        "introduction": "This measures how well the model maintains consistent depth perception throughout the scene, recognizing any discrepancies that may indicate a lack of depth understanding.",
        "guidance": "Instruct the AI to create a scene with clear foreground, middle ground, and background elements, each progressively diminishing in scale to simulate depth. Include specific items such as a tree in the foreground with vivid, sharp details, a group of smaller trees and a path in the middle ground, and a distant mountain range in the background with muted colors and less detail. Ensure that the lighting across the scene reinforces the depth, with shadows and highlights suggesting the spatial relationships between objects. Avoid placing objects in a way that disrupts the natural perspective, like a large, detailed item in the background or a hazy, indistinct object in the foreground. Use overlapping elements, such as a branch from the foreground tree crossing in front of a middle ground path, to further enhance the perception of depth."
    },
    {
        "aspect": "Touching Objects",
        "introduction": "This assesses the model\u2019s ability to recognize when objects are in contact with each other.",
        "guidance": "To effectively represent touching objects visually, specify the objects and the context in which they are in contact. Clearly identify the surfaces or points of contact between the objects. For instance, if generating an image of a cat and a dog, describe the cat gently patting the dog's head with its paw. Ensure the prompt specifies the interaction that implies contact, such as \"A toddler's hand holding an adult's finger,\" or \"A book resting on the edge of a table.\" Balance the scene by positioning the objects naturally, avoiding overcrowded compositions that may obscure the touching point. To prevent misinterpretations, avoid ambiguous language and ensure the description of the touch is clear and specific, emphasizing the interaction details and the physical connection where the objects meet."
    },
    {
        "aspect": "Object Support",
        "introduction": "This aspect evaluates the model's understanding of objects supporting each other, such as a book resting on a table.",
        "guidance": "When creating an image prompt involving object support, specify the primary object being supported and the supporting surface or object. Ensure that the supported object is clearly depicted as resting firmly on the supporting surface, emphasizing the contact points. For example, describe a scene like \"a wooden table with a thick, hardcover book resting on its surface.\" Explicitly mention the orientation and position, such as \"the book is lying flat with its spine facing up.\" Balance the elements by detailing all physical properties relevant to their interaction, like \"the table has a polished surface that reflects light subtly, and the book's weight slightly compresses the table surface.\" Avoid ambiguity by not placing floating objects or gaps between the supported and supporting items. Additionally, provide context to enhance realism, like \"add a steaming cup of coffee beside the book on the table, with its shadow reflecting under the light source.\""
    },
    {
        "aspect": "Enclosure",
        "introduction": "This tests the model's ability to identify situations where one object encloses or surrounds another, such as a frame around a picture.",
        "guidance": "Begin by defining the primary object that will be enclosed, ensuring it's central to the composition. Clearly describe the enclosing object, specifying its shape, texture, and how it surrounds the primary object. Maintain a balanced relationship between the enclosed and enclosing elements, ensuring neither one overwhelms the other visually. Highlight details that emphasize the enclosure, such as a garden with a decorative fence or a cathedral with stained glass windows encircling an altar. Avoid ambiguity by using specific descriptors for positioning and interaction, like \"a lush garden enclosed by a white picket fence\" or \"a golden picture frame surrounding a classic portrait.\" Ensure that the enclosure is visually distinct yet harmoniously integrated with the central subject."
    },
    {
        "aspect": "Containment",
        "introduction": "This measures the model\u2019s capability to recognize when an item is contained within another, like a toy inside a box.",
        "guidance": "To represent containment, structure the prompt to clearly depict an item enclosed within another distinct object. Specify both the containing item and the enclosed item, including their relative positions and interactions. For instance, describe a scene where a small, colorful toy is situated inside an open, wooden box. Ensure the toy is visibly within the confines of the box, with part of the box\u2019s interior and exterior visible for context. Balance the elements by detailing the box's position in the image's composition, such as being centrally located or placed to one side. Avoid ambiguities by explicitly stating that the toy is entirely or partially inside the box to prevent misinterpretation. Consider including descriptors for textures, colors, and materials to enhance visual clarity and context. For example, \"A small, red toy car positioned inside a large, open, brown wooden box, with the box\u2019s edges and interior clearly visible.\""
    },
    {
        "aspect": "Dynamic Interaction",
        "introduction": "This aspect evaluates if the model can detect and interpret interactions between objects that imply movement or activity, such as a ball being thrown.",
        "guidance": "The focus should be on creating a scene where objects are engaged in clear, active interactions. One object should appear in the process of affecting another. For example, a prompt could depict a person caught mid-throw with a ball visibly leaving their hand, or a cat pouncing on a toy mouse in mid-air. Ensure that any movement is captured vividly, with appropriate body language or motion lines that indicate direction and speed. Balance the scene by centering the primary interaction while adding background elements that provide context but do not overshadow the main activity. To avoid misinterpretations, clearly define the roles of each object in the interaction and specify the action taking place, such as \"A dog is jumping to catch a frisbee in a park,\" rather than vague descriptions. Use action verbs and descriptive language to convey the dynamic nature of the scene, ensuring the generated image effectively portrays active movement and interaction."
    },
    {
        "aspect": "Stack and Balance",
        "introduction": "This assesses the model\u2019s capability to understand the spatial arrangement in stacks and balanced structures, like a pile of books.",
        "guidance": "Create an image where multiple objects are carefully arranged in a stacked formation, demonstrating balance and stability. For example, depict a pile of books, with each book slightly offset in position, but maintaining an overall stable structure. Ensure that the stack includes a variety of book sizes, colors, and orientations, visually illustrating the concept of balance. The background should be minimalist to keep the focus on the stack, with perhaps a simple desk or floor surface beneath it. Avoid including too many other elements that could distract from the main stack and balance focus. Aim for coherent lighting and shadow effects to enhance the perception of depth and stability in the image."
    },
    {
        "aspect": "Object Tilt",
        "introduction": "This aspect evaluates the model's ability to recognize if an object is tilted or straight within an image.",
        "guidance": "Describe an object in the image and specify the angle or degree of its tilt relative to a straightforward, upright position. Ensure you clearly state how the tilt should be oriented (e.g., tilted to the left, right, forward, or backward). Balance the scene by providing context and surrounding elements that highlight the tilt, such as an upright reference point or horizon. For example, if generating an image of a tilted bicycle, specify \"a bicycle tilted 30 degrees to the right against a straight lamppost and horizontal road.\" Avoid vague descriptions like \"slightly tilted\" and instead give precise angles or directional cues to ensure clarity and accuracy in object representation."
    },
    {
        "aspect": "Rotation Angles",
        "introduction": "This tests the model's ability to determine the rotational angle of an object relative to a standard reference.",
        "guidance": ""
    },
    {
        "aspect": "Mirror Imaging",
        "introduction": "This aspect measures the model\u2019s competency in identifying symmetrical objects and their mirror images.",
        "guidance": "Create an image that features a clear, central object with a perfectly mirrored counterpart. Place the original object and its mirror image directly opposite each other, separated by a distinct axis of symmetry\u2014this could be a vertical line, a water surface, or a reflective material like a mirror. Ensure both versions of the object are identical in size, shape, and detail, maintaining precise symmetry. Incorporate a background that is simple and non-distracting to highlight the symmetry between the primary objects. Avoid overly complex or cluttered scenes that may obscure the clarity of the mirror imaging aspect."
    },
    {
        "aspect": "Object Flipping",
        "introduction": "This assesses the model's capability to recognize objects that have been flipped horizontally or vertically.",
        "guidance": "Specify the main object of the image and indicate that it should appear both in its normal orientation and in a flipped state. Describe the object\u2019s original orientation and the flipping direction (horizontal or vertical). Ensure a clear visual distinction between the two versions of the object within a balanced composition. For example, place the original object on the left side of the image and the flipped object on the right side, or vice versa. Avoid overlapping the objects to prevent visual confusion. Use complementary but contrasting backgrounds to further differentiate between the original and flipped versions of the object, while maintaining a uniform setting to keep coherence in the overall image."
    },
    {
        "aspect": "Orientation Consistency",
        "introduction": "This evaluates if the model can consistently maintain the orientation of objects across different scenes or perspectives.",
        "guidance": "To ensure orientation consistency, describe the main object and its fixed orientation. Specify additional scenes or perspectives in which this object appears, ensuring that its orientation remains unchanged. For example, if the primary object is a car, state that the car is always facing to the right, whether it is in a city street scene, a countryside setting, or a parking lot. Balance the composition by clearly defining the surrounding environments and supplementary elements, ensuring they do not obscure or alter the primary object's orientation. To avoid common pitfalls, refrain from using ambiguous direction terms that are susceptible to different interpretations, and consistently reinforce the orientation direction across all scene descriptions."
    },
    {
        "aspect": "Axis Alignment",
        "introduction": "This tests the model's ability to determine if objects are aligned along specific axes, such as vertical, horizontal, or diagonal alignment.",
        "guidance": "Specify the objects or subjects to be included in the image and explicitly detail their alignment along the chosen axis. For instance, if utilizing a vertical alignment, describe one or more objects stacked vertically or positioned in a straight upright line. If the alignment is horizontal, indicate that objects should be placed side by side, forming a linear row. For diagonal alignment, direct the model to arrange objects from one corner of the image to the opposite corner, creating an angled line. Ensure that the prompt includes clear reference points, such as \"aligned with the vertical center of the image\" or \"forming a diagonal line from the top-left to the bottom-right corner.\" The balance should maintain the relative positioning of each object according to the specified axis, without overlapping or disproportionate spacing. Avoid ambiguous terms that may confuse the spatial orientation, and use definitive language to articulate the axis and placement of the objects precisely. Example prompt: \"A row of blue butterflies, horizontally aligned from left to right, with each butterfly evenly spaced, against a green meadow backdrop.\""
    }
]