{"overall_metrics": {"CHAIRs": 0.38877755511022044, "CHAIRi": 0.11204293861120429, "Recall": 0.5629562043795621, "num_hallucinated_caps": 194.0, "num_caps": 499.0, "hallucinated_word_count": 334.0, "coco_word_count": 2981.0, "length_response": 69.76753507014028, "hallucinated_caps_ls": [144305, 54870, 95908, 324008, 234433, 39053, 128119, 401027, 253630, 317320, 161227, 25812, 195645, 32625, 18688, 481596, 67833, 272880, 254368, 231534, 209347, 13550, 2154, 274931, 104326, 300784, 409346, 522262, 344942, 300124, 490491, 12669, 133146, 261097, 185327, 348654, 556478, 451943, 268854, 495513, 529256, 37907, 506401, 278365, 432213, 201555, 148977, 399058, 183571, 325836, 437910, 467887, 174402, 352734, 295656, 557998, 401010, 313928, 490284, 346940, 472610, 388237, 289640, 59710, 527497, 61959, 229659, 473118, 416104, 5934, 131138, 258209, 285820, 453731, 418692, 120776, 276719, 354559, 468005, 378831, 375769, 20650, 152499, 147762, 425447, 145020, 301837, 240681, 178753, 47781, 211604, 298067, 563938, 47940, 526414, 273132, 272728, 282553, 188136, 191691, 99874, 557447, 221350, 280930, 134223, 217593, 477016, 301266, 558784, 539951, 143479, 230983, 564629, 53431, 53465, 381031, 9628, 254161, 163518, 423984, 118124, 288729, 30667, 114108, 366111, 70815, 88269, 536154, 374061, 158806, 368335, 416384, 531327, 451449, 1268, 151159, 436814, 275611, 478380, 110449, 45550, 205193, 272566, 170099, 14940, 408449, 354540, 124236, 14285, 82488, 226802, 425569, 28842, 45435, 35827, 283118, 429598, 490711, 277051, 304984, 122467, 518287, 352900, 227438, 202658, 18150, 92107, 491000, 474499, 358086, 239445, 105782, 487990, 138834, 111409, 397268, 127659, 305000, 44136, 154846, 567254, 543231, 495357, 523175, 42889, 124601, 581749, 227960, 286503, 315740, 475043, 456784, 99053, 503210]}, "results": {"sentences": [{"image_id": 144305, "caption": "There are a few computer devices displayed in the image. A laptop, placed near the center of the scene, is sitting on a desk or table. A keyboard can be seen to the left side of the laptop, while a headset is situated further to the right. A mouse is also placed near the laptop, possibly connected to it through a computer mouse cord. ", "mscoco_hallucinated_words": [["mouse", "mouse"], ["mouse", "mouse"]], "mscoco_gt_words": ["keyboard", "laptop", "dining table"], "hallucination_idxs": [54, 69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 164091, "caption": "In the image, there are two dolls sitting in beds. A large doll is placed in the center, while a smaller doll sits in the foreground, next to the larger one. The scene takes place inside a bedroom or other room with furniture. There are multiple beds and cribs visible, but the focus appears to be on the two dolls.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "bed", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 308003, "caption": " The image features a green park bench with a cracked and worn surface, resting under a dull sky. The bench takes up most of the scene, occupying the majority of the space within the image. The dull sky creates a damp and slightly gloomy atmosphere, adding an element of contrast to the green color of the bench. The cracked surface of the bench adds an element of age to the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "bench", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 54870, "caption": " The image features a sandwich and a salad on a plate. There is a knife placed near the plate as well. The knife is not being used in the scene, it just happens to be on the same surface as the sandwich and the salad. The knife is on the left side of the plate, slightly above and to the left of the sandwich, while the salad is positioned towards the right side. The sandwich appears to be a cheeseburger, and the salad consists of lettuce and tomatoes. The entire scene is set on a table, likely in a restaurant or a dining area.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["sandwich", "fork", "knife"], "hallucination_idxs": [104], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.6666666666666666}}, {"image_id": 95908, "caption": "A computer desk with multiple devices is showcased in the image. A keyboard and mouse are located on the desk, with the keyboard towards the left side and the mouse towards the center. A silver coffee pot with a star on it is also placed on the desk, sitting close to the mouse. The computer's monitor seems to be partially covered by a mouse pad, which is further towards the right side of the desk. There are no other notable items or individuals in the scene.", "mscoco_hallucinated_words": [["monitor", "tv"]], "mscoco_gt_words": ["keyboard", "cup", "dining table", "laptop", "mouse"], "hallucination_idxs": [61], "metrics": {"CHAIRs": 1, "CHAIRi": 0.07692307692307693, "Recall": 0.8}}, {"image_id": 329614, "caption": "The scene features a yellow school bus driving down a street surrounded by churches. There are several churches visible in the image, with at least two churches at the sides of the road, and another one situated further down. The street has some traffic from other vehicles like cars, with at least four cars visible in the image. The yellow school bus is moving in the opposite direction of these cars.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 324008, "caption": " The image showcases a tray of food that includes various items. In the center of the tray is a banana sitting next to several bottles and a container. There are also two bagels and two cups of food, likely yogurt, next to the banana and bottles. The food seems to be ready to be served and enjoyed.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["container", "bowl"], ["bagel", "donut"], ["bottle", "bottle"]], "mscoco_gt_words": ["cup", "dining table", "train", "orange", "airplane", "person", "banana"], "hallucination_idxs": [25, 28, 34, 49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5714285714285714, "Recall": 0.2857142857142857}}, {"image_id": 52700, "caption": " The image depicts a large dining table that is covered with numerous dishes of cake and cupcakes. There are numerous cake slices placed on the table, covering multiple forks and spoons. The people in the scene are standing around the table, some closer to the edges and others closer to the center. They seem to be curiously looking at the desserts displayed on the table, as the occasion might be a gathering or celebration.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "dining table", "spoon", "cake", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 501311, "caption": " The image captures a baseball game in progress, with several players spread throughout the scene. The main focus of the image is a pitcher on the mound, throwing a baseball in an intentional manner, possibly attempting to retire an opposing player.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 247179, "caption": " The image shows a group of people riding on the back of two elephants, with one of them carrying a person. The elephants have elaborate decorations on their backs, adding to the festive atmosphere. There are a few other people visible in the scene, either walking or standing. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "clock", "person", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 577912, "caption": "A bathroom is showcased in the image, featuring a white toilet situated near a window. A sink can be seen nearby. The bathroom also has a shower, with a hand shower head near the toilet. A tub can be seen in the scene, but it appears to be empty. Overall, the bathroom is clean and well-maintained.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "toothbrush", "toilet", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 196341, "caption": "The image depicts a group of people standing and walking around an enormous clock in the middle of a room. The clock is quite large, with both numbers and letters prominent on its face. There are about 10 people visible in the scene, some walking in various directions while others are standing. The clock, along with the daylight from the windows, gives the space a bright atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 234433, "caption": " In the image, there is a left turn sign placed near a street sign and several signs below it. The signs indicate a location where the street intersects with a parking area, as there is a pole nearby with multiple bicycle symbols around it. There are multiple vehicles in the scene, including a truck, a bus, and a couple of cars. In addition, there are several pedestrians walking in the area.", "mscoco_hallucinated_words": [["truck", "truck"]], "mscoco_gt_words": ["traffic light", "car", "person", "bicycle"], "hallucination_idxs": [58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 455610, "caption": " In the image, there is a red motorcycle parked on the sidewalk near a building. Two people are in the scene, standing next to the motorcycle, and one person is walking toward them, possibly taking a closer look. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "motorcycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 39053, "caption": " The image shows a table with several pieces of food placed on it, including a bowl of eggs and a bowl of pita bread, surrounded by various cups and knives. The food appears to be a part of a meal or an appetizer, and it is spread out across the table.", "mscoco_hallucinated_words": [["knife", "knife"]], "mscoco_gt_words": ["cup", "dining table", "bowl", "spoon", "cake", "cell phone", "apple"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.42857142857142855}}, {"image_id": 493862, "caption": "In the image, there is a man standing on a grassy field holding a baseball, likely waiting for a pitch. The field is filled with grass, extending to the horizon. There are a few other people scattered around the field at various distances, likely engaged in other activities or observing the man playing baseball.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "tennis racket", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 269873, "caption": "The image depicts a busy street scene with a group of people, some walking and some standing, crossing over the street from one side to the other. The people are walking on a street with a trolley passing by. There are various ties in the crowd, some of which appear to be tied around the waists of some of the individuals.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "person", "bus", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 524656, "caption": "The image showcases a small kitchen with a microwave, a toaster oven, and a refrigerator. The microwave is placed near a window, while the toaster oven is located closer to the kitchen's center. The refrigerator is positioned at the side of the kitchen. There are several bowls and cups placed on the counter, as well as a bottle on the far left of the kitchen. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toaster", "cup", "knife", "oven", "refrigerator", "bowl", "spoon", "bottle", "wine glass", "sink", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6363636363636364}}, {"image_id": 369547, "caption": "The image features a black and white dog sitting on a rocky surface, with a bird in the sky above. The bird seems to be circling over the dog, potentially chasing it. There is another bird nearby, but it is closer to the ground. Both dogs and birds can be seen in the picture, with their distinct features standing out. Additionally, the beach is scattered with small pebbles, giving the environment a rocky appearance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "dog", "cat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 181446, "caption": "A man with glasses is leaning against an open refrigerator, looking into it. He is in a kitchen filled with various food and bottles. There are at least three bottles visible in the scene, including a bottle closer to the man, another located further right, and a third one found in the top-left corner. Additional bottles may be scattered throughout the kitchen. The man's location and actions suggest he may be preparing a meal or searching for something to eat.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "bowl", "bottle", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 128119, "caption": "The scene features a bus with two bicycles attached to the front of it. The bus is white and parked on a street with a few cars on the side. There are also a few people standing near the bus and another person on the sidewalk, possibly observing the situation.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["bicycle", "bus", "handbag", "cell phone", "backpack", "person"], "hallucination_idxs": [27], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 401027, "caption": " The image features a sandwich placed on a table, which appears to be cut in half. The sandwich is accompanied by two small bowls filled with dips, one of which appears to be yellow in color. The table also includes a couple of cellophane-covered containers.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["container", "bowl"]], "mscoco_gt_words": ["cup", "knife", "dining table", "sandwich", "fork"], "hallucination_idxs": [25, 48], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.4}}, {"image_id": 253630, "caption": "In the image, there is a living room filled with furniture. The centerpiece of the room is a black coffee table with a glass top and a small square-shaped footstool placed in front of it. The black coffee table serves as a focal point in the room. There are two people sitting in the room, one closer to the front of the image and another towards the right side of the room. There is a television placed towards the back of the living room, providing entertainment for the occupants.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["potted plant", "chair", "tv", "dining table"], "hallucination_idxs": [54], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 326021, "caption": "A woman is sitting behind a table with a teddy bear on it. The teddy bear is placed on the left side of the table, taking up most of the space. The woman is seated next to the teddy bear, posing for the camera while the scene captures her smiling presence.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 317320, "caption": "In the image, there are several people present, including a man riding a bicycle down the street with yellow stripes, two other men, and a woman. The man on the bicycle is holding a backpack and is surrounded by other people going about their day.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.6666666666666666}}, {"image_id": 507171, "caption": "The image features a person riding a wave on a surfboard in the water. The man is in the center of the scene, riding the wave in the foreground. Behind him, a bridge can be seen in the background. The scene is set on a body of water, with waves breaking in the distance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "person", "cat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 161227, "caption": "The image features a couch with a remote control sitting on top of it, with a laptop on a nearby table. There are also a cell phone, a bottle, and a clock in the scene. The cell phone is sitting on the left side of the couch, and the bottle is on the right side, near the remote. The clock is positioned towards the middle of the room.", "mscoco_hallucinated_words": [["table", "dining table"], ["bottle", "bottle"], ["clock", "clock"], ["bottle", "bottle"], ["clock", "clock"]], "mscoco_gt_words": ["remote", "keyboard", "laptop", "cell phone", "chair", "couch"], "hallucination_idxs": [21, 31, 35, 55, 67], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4166666666666667, "Recall": 0.6666666666666666}}, {"image_id": 25812, "caption": " The image shows a person snowboarding down a snow-covered slope while facing towards the right side of the image. There is a snowboard and skis on the ground near the person, indicating that they may have stopped skiing before or during their downhill journey. The slope is surrounded by trees, and there are several chairs visible nearby, possibly belonging to skiers taking a break from skiing.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 267537, "caption": "The image depicts a group of four women sitting on a bench, each wearing an umbrella to shield them from the rain. They are holding hands, possibly as a symbol of unity or support. All of the women are sitting on the bench and holding umbrellas, creating a cozy and intimate scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "umbrella", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 195645, "caption": " In the image, there are several people gathered around a table, with a wine glass in the middle. The people appear to be engaged in conversation as they dine together. Among the group, there's a woman wearing a blue dress and a man wearing a tie, along with other individuals.", "mscoco_hallucinated_words": [["tie", "tie"]], "mscoco_gt_words": ["handbag", "bench", "dining table", "bottle", "wine glass", "person"], "hallucination_idxs": [51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 155671, "caption": "The scene is an empty, sunny street on a beautiful day. There are a few cars present, one at the end of the street, one in the middle, and another near the top. There is a traffic light located near the top as well, indicating the presence of an intersection or crossroad. Apart from the cars, there are a few trucks, including one at the bottom of the scene, one in the middle, and another towards the top left. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 412894, "caption": "According to the picture, it is a busy street scene with multiple people walking, shopping, and standing near numerous signs. There are also a few traffic lights located near the street, one at a corner and two near the center of the street. Among the signs and traffic lights, there is a sign for subway on one side and a couple of traffic lights on the other side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "person", "handbag", "backpack"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 32625, "caption": " In the image, there is a table with various food items placed on it. One plate holds a portion of food that appears to be a mixture of broccoli and carrots. Another bowl is situated closer to the front edge of the table. The food items appear to be dishes for lunch, served with a spoon on the left side of the dish. There is a chair towards the right side of the table, with two additional chairs positioned further back. A variety of books are scattered throughout the image, including a few placed on the table and others scattered around the room.", "mscoco_hallucinated_words": [["broccoli", "broccoli"], ["carrot", "carrot"], ["spoon", "spoon"], ["chair", "chair"], ["chair", "chair"]], "mscoco_gt_words": ["cup", "dining table", "bowl", "fork", "book"], "hallucination_idxs": [30, 32, 60, 72, 84], "metrics": {"CHAIRs": 1, "CHAIRi": 0.45454545454545453, "Recall": 0.6}}, {"image_id": 18688, "caption": "There is a baseball game happening in a room, captured on a television screen. In the middle of the image, a baseball player is standing at the plate, holding a bat, and another player is positioned behind him. A few other players are dispersed throughout the room, including one in the upper left and two in the upper right of the image.", "mscoco_hallucinated_words": [["television", "tv"]], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [13], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.25}}, {"image_id": 343534, "caption": "According to the image, there is a small doll dressed in a knitted sweater standing next to a brown teddy bear. The doll is positioned on the left side of the image, and the teddy bear is positioned on the right side of the image. The scene appears to be a home setting or a living room, as there is a chair visible in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "potted plant", "dog", "book", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 370657, "caption": "In the image, a group of people is riding on a motorcycle, with several adults and at least two children. They are wearing masks, possibly due to an outbreak of COVID-19 in the area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 481596, "caption": "In the image, a man is standing at the top of the bleachers in a sports stadium, eating a hot dog in the stands with a crowd of people around him. Many other people are visible in the background, watching and socializing during the baseball game. The stadium is full of chairs, with some chairs being closer to the man in the stands while others are further away. There are various food items on the hot dog stand, such as ketchup bottles, relish, and mustard.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["hot dog", "chair", "person"], "hallucination_idxs": [89], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 1.0}}, {"image_id": 460347, "caption": "This image depicts a white bus driving down a road surrounded by a busy parking lot. The bus is located in the middle of the lane, while several cars are following closely behind, driving along the road in various positions. There are several drivers on the road, each one responsible for their vehicle.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 398753, "caption": " The image captures a baseball game taking place, with a large crowd gathered around the field, possibly watching the action on the field or waiting to play themselves. The players are dispersed throughout the field, with some players standing near the first and third bases, a few closer to the batter, and others closer to the infield.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 67833, "caption": "The image shows a large pizza placed on a table in the center of a room, surrounded by various food items, including two bottles of wine, and two spoons. There are also two cups on the table. The table is in a dining room setting, indicating that the pizza and the other food items are being served as part of a meal.", "mscoco_hallucinated_words": [["spoon", "spoon"]], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "bowl", "bottle", "fork", "wine glass", "person"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.4444444444444444}}, {"image_id": 272880, "caption": "In the image, there is a piece of pizza on a plate with the word \"mystic pizza\" written on it. There is a cup of coffee nearby. The pizza is positioned towards the right-hand side of the image. In the background, there is a fork resting near the plate. The fork might have been used to eat the pizza, but it's unclear. The plate and the fork are placed on a dining table, which appears to be wooden. Several other objects, such as a cell phone, are scattered around the table.", "mscoco_hallucinated_words": [["cell phone", "cell phone"]], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "fork", "person"], "hallucination_idxs": [98], "metrics": {"CHAIRs": 1, "CHAIRi": 0.09090909090909091, "Recall": 0.6666666666666666}}, {"image_id": 254368, "caption": " The image depicts a man in a white robe sitting in bed, holding a slice of pizza as he eats. He is surrounded by various items such as dishes, bottles, and a handbag, which suggests a casual, relaxing setting. There are two cups in the picture, one placed close to the bed and the other on the right side of the frame. A toilet can be seen in the lower right corner, which further emphasizes the casual atmosphere.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["handbag", "handbag"], ["cup", "cup"], ["toilet", "toilet"]], "mscoco_gt_words": ["pizza", "bed", "person"], "hallucination_idxs": [32, 36, 49, 72], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 1.0}}, {"image_id": 193108, "caption": " In the image, there are multiple baseball players, including a batter standing at the plate, swinging a baseball bat, and a group of onlookers, likely cheering them on. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 459141, "caption": " According to the picture, a person is riding a bicycle with a straw hat or a hat of some sort. There are also a few other people in the scene, but they are not riding bicycles. The bicycle is situated in the middle of the scene, surrounded by the other people. One of the people is closer to the left side of the image, while the other is closer to the right side. The other person is riding a bike as well, but it is not visible in the picture.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "potted plant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 231534, "caption": "The image depicts an airport scene with an airplane parked on an airport tarmac and various people, vehicles, and items involved in the logistics of the airport. On the tarmac, there are multiple trucks, several cars, and a couple of carts, all likely transporting luggage and other airport supplies. A few passengers can be seen in the scene, possibly waiting for their luggage to be loaded onto the plane.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["truck", "suitcase", "handbag", "airplane", "person"], "hallucination_idxs": [40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.8}}, {"image_id": 227031, "caption": "A man and woman are standing together, likely at a wedding or celebration, as they are getting married or commemorating their union. They are both wearing ties, and the man is cutting the cake, while the woman assists him in the process. A large cake is being shared between the couple, placed on the table in front of them. Other people can be seen in the background, likely members of the wedding party or attendees.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "tie", "potted plant", "cake", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 209347, "caption": " In the scene, there is a woman wearing headphones while sitting on the bench of a subway train, looking at her cell phone. She is surrounded by other people on the train. One person is sitting across from her, while two others are closer to the sides of the train. A couple of additional individuals can be seen standing in the background, with one closer to the middle of the train and another towards the back.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["handbag", "tie", "train", "cell phone", "person"], "hallucination_idxs": [14], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.6}}, {"image_id": 523292, "caption": " A cat is standing on top of a laptop sitting on a coffee table, looking at a piece of paper placed on the table in front of it. The laptop is positioned near the front right corner of the table, while the paper is placed near the center. The cat's attention is directed towards the paper, as if it is trying to read something.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tv", "dining table", "laptop", "cat", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 12375, "caption": " The image displays a white plate, possibly a serving plate, filled with a variety of food items, including meat, potatoes, carrots, and pickles. The plate appears to be positioned on a dining table or a tabletop. It is an interesting dish, combining different types of food in a single meal.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["carrot", "broccoli", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 13550, "caption": "According to the picture, there is a woman standing next to an elephant. She seems to be petting the elephant, possibly feeding it, as people might do with domestic animals like dogs or cats. There is a bottle nearby, situated close to the right side of the woman.", "mscoco_hallucinated_words": [["dog", "dog"], ["cat", "cat"], ["bottle", "bottle"]], "mscoco_gt_words": ["person", "handbag", "elephant"], "hallucination_idxs": [35, 37, 42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.6666666666666666}}, {"image_id": 146837, "caption": " The image captures a man skateboarding in an urban setting, appearing to perform a trick while riding a skateboard along a sidewalk or path. He is riding on the top of the skateboard, and it appears that he is going to leap over something or perhaps performing a trick.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 67042, "caption": "The image depicts a man standing in a corridor or passageway while checking his cell phone. He is dressed in a tie, possibly indicating a formal or business setting. There are three other people nearby, and they are not standing very close to the man in the center of the image. The corridor appears to have a few arches, adding visual interest to the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "person", "cell phone"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 209925, "caption": "The image depicts a baseball player in the middle of the field, ready to pitch a baseball. He is standing on the mound, holding a baseball, and he is in the act of throwing a pitch. There are other players scattered throughout the scene, some closer to the mound and others further away in various parts of the field. The scene captures an exciting moment in a baseball game, with the players preparing for the next play.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 2154, "caption": " The image shows a shopping cart in the middle of a street, placed next to a park bench. There are multiple people in various locations nearby, standing, walking, and sitting on the bench. There are two trees on the street, one closer to the shopping cart, and the other a bit further away. A couple of chairs can also be seen in the scene, one near the bench and another in the background.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["car", "umbrella", "bench", "person"], "hallucination_idxs": [65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 274931, "caption": "The image captures a black and white scene of a street where several cars are parked along the side of the road. The cars are parked near some tires and there are some buildings in the background. The image gives a vintage feel, as it could be an old photograph. There are a few people standing near the cars, and the street is relatively empty otherwise.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["traffic light", "fire hydrant", "car"], "hallucination_idxs": [58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 104326, "caption": "The image features a very clean and well-organized kitchen. It has two dining tables located near each other in the room. On the table, there is a vase and multiple cups. In the background, two windows are situated on opposite walls, giving the room a light and open atmosphere. There are two clocks in the kitchen, one closer to the dining tables and the other on the wall near the right dining table. The kitchen also has hardwood floors throughout the room, emphasizing its cleanliness and style.", "mscoco_hallucinated_words": [["vase", "vase"], ["cup", "cup"]], "mscoco_gt_words": ["clock", "knife", "oven", "dining table", "potted plant", "sink", "microwave", "chair"], "hallucination_idxs": [30, 33], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.25}}, {"image_id": 233888, "caption": "According to the picture, a young child, likely around three years old, is playing a video game that features a baseball game. The child is standing in front of a large television set and holding a baseball bat, ready to play the video game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tv", "potted plant", "baseball bat", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 213905, "caption": "The image features two women in black and white walking on a tennis court. They are holding tennis rackets in their hands, possibly playing against each other or chatting before a match. There are several other people present in the scene, but they appear to be scattered throughout the tennis court and not all standing in one particular spot.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "tennis racket", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 277479, "caption": " The image features a little girl standing in front of an open refrigerator, holding her baby doll. There are a few people standing nearby the little girl, including one person closer to the left, another person towards the center, and a third person on the right.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 300784, "caption": " The image shows a group of children sitting on a couch or bench together, enjoying the presence of a cat sitting next to them. There are four children, with two kids seated on either side of the cat, while the other two are sitting at the front of the couch. A chair can be seen in the background, as well as other furniture in the room.", "mscoco_hallucinated_words": [["cat", "cat"], ["cat", "cat"], ["chair", "chair"]], "mscoco_gt_words": ["person", "bench", "dog", "couch"], "hallucination_idxs": [20, 40, 56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.75}}, {"image_id": 183766, "caption": "The image depicts a table with three pieces of food on it. The center piece is a cake-like dessert topped with garnishes, including cherries and berries. There are also two smaller pastries, one on the left and the other on the right side. The desserts are placed on a white surface, possibly a table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "fork", "knife", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 169169, "caption": "According to the picture, a large green tree is located near a sign on the sidewalk, with the words \"springing in the city\" above it. There are several people on the street, with some standing near the tree and others further away from it. The scene is lively, with multiple people walking around and interacting with one another. The tree and sign are situated in the middle of the sidewalk, with the signs hanging above it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "person", "potted plant", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 409346, "caption": " In the image, there are two women standing next to each other, attending a party. They are wearing matching outfits, which gives them a sense of unity. Near the left side of the dining table, there are several plates filled with food. The items on the table consist of numerous sandwiches, several bottles, a cake, and several bowls and cups. Some of these food items appear to be spread out across the table.", "mscoco_hallucinated_words": [["sandwich", "sandwich"], ["bottle", "bottle"], ["cup", "cup"]], "mscoco_gt_words": ["dining table", "bowl", "spoon", "cake", "person"], "hallucination_idxs": [57, 60, 69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.8}}, {"image_id": 522262, "caption": " (image description) In a picture, several bicycles are parked in a row along a fence next to a sidewalk. The bikes are various sizes and are spaced out, creating an organized yet slightly chaotic scene. There are several benches and a couple of trash cans located near the bicycles, with one bench positioned closer to the sidewalk and the other farther back towards the bicycles.", "mscoco_hallucinated_words": [["bench", "bench"], ["bench", "bench"]], "mscoco_gt_words": ["traffic light", "car", "person", "bicycle"], "hallucination_idxs": [44, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.25}}, {"image_id": 79014, "caption": "The image features a man with glasses and a beard, who has a yellow ball placed in front of his nose. The man appears to be peering through a glass surface, which is probably a mirror, to see the yellow ball better. There are no other visible individuals in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 344942, "caption": "The image features a small toy robot with a blank expression sitting on top of a laptop, which is partially visible in the picture. There is another laptop or computer screen on the right side of the scene, likely belonging to a person or a character in a cartoon or animation.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["clock", "laptop", "tv"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 275929, "caption": "According to the picture, there are three old, dirty toilets sitting on the steps of a building. They appear to be in a pile, with the two upper toilets leaning against the steps, while the lower toilet sits further down. All of the toilets appear to be dirty and worn, adding to the overall image of a disheveled scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "bowl", "toilet"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 465601, "caption": " According to the picture, there are a number of people standing outside wearing raincoats, holding umbrellas and walking in the rain. Some of the people are wearing ties and one is wearing a green hat. They appear to be in a line or on a patio, with an open space between them. There are also a few more people in the background, walking or standing nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "umbrella", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 300124, "caption": "The image shows a wooden chair with a small television mounted on top. The TV is positioned on a dining table-like surface, sitting on the chair's cushion. The TV has a built-in mircowave, adding a unique and possibly unconventional aspect to the seating area. The television appears to be relatively large in comparison to the chair.", "mscoco_hallucinated_words": [["television", "tv"], ["tv", "tv"], ["tv", "tv"], ["television", "tv"]], "mscoco_gt_words": ["microwave", "couch", "chair", "dining table"], "hallucination_idxs": [9, 15, 32, 51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5714285714285714, "Recall": 0.25}}, {"image_id": 46919, "caption": "According to the image, there is a man hiking on a snow-covered mountain, traversing the side of a hill. He appears to be wearing snowshoes and has a backpack. There is another person in the background, but it is unclear if they are hiking or simply standing on the slope. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 434022, "caption": "This scene captures a long train traveling on a train track with several coal cars in the background. The train consists of two engines and several other cars with varying positions and widths. The engines can be seen near the center of the train and some cars are placed behind them. The coal cars are also positioned near the back of the train. Overall, the train appears to be moving down the track while carrying some valuable goods.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 490491, "caption": " According to the picture, a small, rocky road is the main focus of the scene. It is situated in front of a tall building with various signs nearby. A stop sign is present near the street, indicating the end of the road. Additionally, a couple of traffic lights can be seen above the street, with one located closer to the left side and another on the right. A few people are visible in the scene, positioned in different areas. A sign with the words \"41a\" is prominently displayed in front of the street.", "mscoco_hallucinated_words": [["traffic light", "traffic light"]], "mscoco_gt_words": ["stop sign", "car", "person"], "hallucination_idxs": [53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 175878, "caption": "According to the picture, four people are standing outside next to each other, forming a group. The people are all looking down and seem to be engaged in conversation. One man in the middle is particularly prominent, while another man is on his left and a woman is on his right. There is a cell phone, possibly belonging to one of the people in the group, located near the woman.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "cell phone", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 12669, "caption": "The image features a room with several people in it, some standing and others sitting on chairs. A woman in a striped shirt is holding a wine glass and appears to be enjoying a wine tasting event. In the room, there are also various items on display, including vases, sculptures, and other decorative objects.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["wine glass", "handbag", "vase", "person"], "hallucination_idxs": [17], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.75}}, {"image_id": 192496, "caption": "The image showcases a blue and white train parked on the tracks at a train station. The train is positioned across the entire scene with two main sections: one is near the front and the other is towards the middle of the train. A person is present in the scene, standing next to the train near the front section, or closer to the middle section. The train appears to be stationary as it is parked on the tracks.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 133146, "caption": "According to the picture, there is a large, nicely decorated living room with a fireplace, a couch, a chair, and a dining table. The room also contains various potted plants placed throughout, adding a touch of nature to the interior. Some of the potted plants are located on the table and close to the fireplace.", "mscoco_hallucinated_words": [["table", "dining table"], ["potted plant", "potted plant"], ["potted plant", "potted plant"], ["table", "dining table"]], "mscoco_gt_words": ["tv", "vase", "book", "chair", "couch"], "hallucination_idxs": [27, 34, 51, 57], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.4}}, {"image_id": 53744, "caption": "In the image, there is a cutting board filled with a variety of food items, including sliced tomatoes and lettuce. The knife used to cut the vegetables is prominently visible in the image, along with the food spread out on the board. A sandwich is also visible in the background, possibly contributing to the meal being prepared.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "knife", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 375902, "caption": " The image features a person wearing ski gear, possibly skiing down the snow-covered mountains, holding a pair of skis together. There are two people on the slopes nearby who appear to be watching or observing the skiing action.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 498747, "caption": "In the image, a man is talking on his cell phone, possibly discussing something with the person he is communicating with. A few other people are in the background, one in the left corner and another two in the right corner, but they seem to be less involved in the conversation. The image captures the moment of the man interacting with others in the crowd.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "bird", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 52435, "caption": "According to the image, there is a large group of people on a sandy beach, some sitting and others standing around. A surfboard is visible on the beach, suggesting a water activity, with multiple people in the water. There is also a small hill on the beach, providing a contrasting environment to the sandy shore. The beach is surrounded by trees, adding a natural element to the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "person", "umbrella", "bird"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 301977, "caption": " The image depicts a brown cow standing in front of a store or market, possibly browsing for food or drinks. The cow is positioned close to the left side of the image, right outside the store front. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cow", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 261097, "caption": " The image features a tennis player walking along a sports court, with many spectators surrounding the player. The player, wearing a black shirt and white shorts, holds a tennis racket, possibly looking off to the side or for a specific ball.", "mscoco_hallucinated_words": [["ball", "sports ball"]], "mscoco_gt_words": ["chair", "tennis racket", "bench", "person"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 227204, "caption": " In the picture, three women are sitting at a table with a cell phone, cereal, and bananas. They appear to be chatting and drinking, perhaps on their lunch break. The table is filled with food and utensils, including cups, bowls, and spoons. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "bowl", "spoon", "bottle", "cell phone", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7777777777777778}}, {"image_id": 353981, "caption": " The image is a black and white photograph of an airplane with multiple people on the tarmac next to it. The plane is sitting on the ground and several people are standing around it. The aircraft appears to be a large plane with multiple engines, as indicated by the multiple propellers.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 185327, "caption": " In the image, a large group of elephants is walking down a street, attracting the attention of both people and vehicles nearby. The elephants are walking from the left side of the scene towards the right, spreading out as they pass by various businesses and shops.", "mscoco_hallucinated_words": [["elephant", "elephant"], ["elephant", "elephant"]], "mscoco_gt_words": ["horse", "bus", "cow", "car", "person"], "hallucination_idxs": [8, 26], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.2}}, {"image_id": 348654, "caption": "There is an iPhone and a small television sitting on a desk in front of a keyboard, with an additional monitor visible. The iPhone is placed on a black-covered smartphone holder near the TV, while the TV is positioned to the left and behind the monitor. A computer screen appears on the right side of the image, displaying a desktop view of an operating system. Overall, the scene seems to feature a combination of a TV, a computer, and a cell phone, creating a typical setup for both entertainment and work purposes.", "mscoco_hallucinated_words": [["keyboard", "keyboard"]], "mscoco_gt_words": ["cell phone", "laptop", "tv", "dining table"], "hallucination_idxs": [16], "metrics": {"CHAIRs": 1, "CHAIRi": 0.08333333333333333, "Recall": 1.0}}, {"image_id": 482432, "caption": " The image features a bathroom scene with a sink, a toothbrush, and a tube of Aveeno skin care product sitting next to it. The bathroom is spacious and well-lit, providing a sense of cleanliness and comfort. The sink is positioned towards the middle of the image, and the toothbrush is visible next to it. The Aveeno tube is also in the background, prominently placed beside the sink. The various bathroom items give a sense of the everyday activities that take place in a bathroom setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toothbrush", "sink", "bowl"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 454161, "caption": "In the image, there is a slice of pizza placed on a white plate. The slice of pizza is situated at the middle of the plate and has several small pieces, giving the appearance of an individual-sized pizza. Next to the pizza, there is a fork, indicating that the food is being served or enjoyed. On the other side of the table, there is another fork, as well as a wine glass with red wine in it, suggesting the consumption of a beverage alongside the pizza. The scene is set against a background, with the pizza and glass prominently featured.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["wine glass", "pizza", "fork", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 144723, "caption": " The image features a black and white dog in the middle of a field, leaping into the air to catch a red frisbee with its mouth. The dog is the center of attention, and it appears to be enjoying the action. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "frisbee", "car", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 129707, "caption": "The image shows a person holding a cellphone in their hand. The cellphone appears to be a flip-style phone with a black and grey color scheme. The person's hand is positioned in front of the phone, which takes up a significant portion of the image. The surrounding area is mostly empty, indicating that there might be other people in the scene or the person holding the phone is standing in an open space.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "cell phone", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 379842, "caption": " The image shows a book and a video game controller sitting on a wooden table. The book appears to contain information about a video game, and the game controller is possibly for the Nintendo Wii game system. There is also a white remote nearby, which is likely associated with the book and the video game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "remote", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 203110, "caption": "The image depicts a large orange trolley traveling down the track, likely in a city or suburban setting. There are several people in the scene, some standing near the tracks and others further away. The trolley appears to be moving on a pathway, possibly along a street.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bus", "train", "orange", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 556478, "caption": " In the image, there are two pizzas neatly placed on a stove top. One pizza is on the left side of the pan, and the other pizza is slightly higher on the right side of the pan. The stove is a gas oven, likely used for cooking and baking. The kitchen appliances, including a bowl, bottle, and two cups, are scattered around the area but do not interfere with the pizzas.", "mscoco_hallucinated_words": [["bowl", "bowl"]], "mscoco_gt_words": ["pizza", "cup", "oven", "bottle"], "hallucination_idxs": [61], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 1.0}}, {"image_id": 103815, "caption": "The image captures a moment during a baseball game where a man in a baseball cap is on the mound, holding a baseball glove as he prepares to throw a ball. Another man, with a baseball bat in hand, is swinging at a baseball pitched by the pitcher. There are other people in the background, possibly playing baseball or watching the game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 451943, "caption": " The image features a neatly made bed in a bedroom with the top sheets pulled back and white blankets visible. The bed is situated close to a window, allowing ample light into the room. On the bed, a single pillow is placed towards the center, while a cup is placed near the foot of the bed, next to a small stool.", "mscoco_hallucinated_words": [["cup", "cup"], ["stool", "chair"]], "mscoco_gt_words": ["backpack", "suitcase", "bed", "dining table"], "hallucination_idxs": [52, 66], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.25}}, {"image_id": 268854, "caption": " There is a sandy beach where two boats and two cars are parked next to each other. A woman carrying a basket of bananas on her head is walking along the beach, possibly toward the water. Other people are scattered around the area, and they appear to be enjoying the time together at the shore.", "mscoco_hallucinated_words": [["car", "car"], ["banana", "banana"]], "mscoco_gt_words": ["handbag", "person", "boat"], "hallucination_idxs": [10, 24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.6666666666666666}}, {"image_id": 34464, "caption": "According to the picture, a group of people are walking across a sandy beach, carrying a large board. The board appears to be red, and it is being carried by three individuals: one person is carrying the front, another person is carrying the middle section, and a third person is carrying the back.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 495513, "caption": " In the image, a man and a woman are standing next to each other and posing for a photo. The man is holding a bottle and a beer, while the woman is holding a hot dog. There are other people in the background, but they are not as close to the main subjects as the man and woman.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["fork", "hot dog", "person"], "hallucination_idxs": [26], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.6666666666666666}}, {"image_id": 529256, "caption": "According to the picture, there are two dogs lying on a red bed in a room. The larger dog is on the left side of the bed, while the smaller dog is closer to the right side. The bed is in the middle of the room, surrounded by a variety of objects, including a couple of chairs and a vase.", "mscoco_hallucinated_words": [["chair", "chair"], ["vase", "vase"]], "mscoco_gt_words": ["book", "bed", "dog"], "hallucination_idxs": [62, 65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 37907, "caption": "In a room with a television, a cat is sitting on top of a media center, such as a TV stand or shelf. The cat is placed on the left side of the media center and is positioned in front of the TV, with its attention directed towards the screen. The room also has a bookshelf located near the center of the scene, possibly providing books or other items for the cat to explore.", "mscoco_hallucinated_words": [["book", "book"]], "mscoco_gt_words": ["cat", "potted plant", "tv"], "hallucination_idxs": [71], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6666666666666666}}, {"image_id": 506401, "caption": "The image captures a small town square featuring various public transportation vehicles such as cars, buses, and a truck parked in different parts of the area. The town square has a statue prominently displayed in the center, surrounded by trees and paved roads. In the scene, there are several traffic lights placed at various intersections, with two traffic lights at the upper left and right corners, and another two at the lower left and right corners.", "mscoco_hallucinated_words": [["bus", "bus"]], "mscoco_gt_words": ["truck", "traffic light", "cat", "car", "person"], "hallucination_idxs": [16], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6}}, {"image_id": 406611, "caption": " The image depicts a busy ski resort, with many people standing around and skis visible. Snow covers the ground, as everyone seems to be dressed and ready for skiing activities. Some people are wearing brightly colored ski jackets, making them easier to spot in the crowd. Skis are scattered throughout the area, with a set of skis at the left side of the image, more towards the center, and another pair of skis on the right side. People can be seen standing in various positions, with some in the background and others closer to the foreground. Overall, it is a bustling atmosphere at the ski resort, with numerous skiers and snow enthusiasts gathered together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 267225, "caption": "The image shows a white truck with a satellite dish parked on the side of a road. The truck appears to be stationary, with several satellite dishes on its roof, creating a bit of interest for passersby. In the vicinity of the truck, several people can be seen, each walking or standing around the vehicle.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "remote", "tv", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 278365, "caption": "According to the picture, the scene shows a man and a woman sitting outside at a table, enjoying a meal together. They are dining on a slice of pizza, which is placed on a plate on the table. The woman is holding a fork, likely used for cutting and serving the pizza. A bottle can be spotted near the table, and a cup is placed on the right side. A bottle is also present at the back of the image. The setting appears to be relaxed and casual, with the pair enjoying their meal together.", "mscoco_hallucinated_words": [["fork", "fork"], ["bottle", "bottle"], ["bottle", "bottle"]], "mscoco_gt_words": ["pizza", "cup", "dining table", "cell phone", "chair", "person"], "hallucination_idxs": [48, 60, 79], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 69757, "caption": "The image features three people, a man, a woman, and another person, sitting around a dining table with chairs on each side. The woman appears to be cutting a cake that is placed in the center of the table. There are two cups on the table, one closer to the center of the table and the other one towards the left side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "cake", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7142857142857143}}, {"image_id": 432213, "caption": "In the image, there is a plate with a sandwich and a serving of french fries. There is another sandwich and french fries at the table. The sandwiches and fries appear to be the main dishes of the meal. A number of cups and bottles can be found on the table, likely containing drinks or condiments. The table is set at a dining area in a restaurant or other food service establishment.", "mscoco_hallucinated_words": [["cup", "cup"], ["bottle", "bottle"]], "mscoco_gt_words": ["knife", "dining table", "bowl", "sandwich", "wine glass", "person"], "hallucination_idxs": [46, 48], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 368367, "caption": "According to the picture, a woman is shown talking on a cell phone, holding it close to her face and smiling as she communicates. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "cup", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 201555, "caption": "There is a sand-filled street at the center of the scene with traffic lights above it, creating a unique view. The street extends from the right side of the scene to the left side, and some of the sand has been pushed aside or rearranged. Two signs can be seen, one on the left side and the other on the right, indicating direction and informing drivers.", "mscoco_hallucinated_words": [["driver", "person"]], "mscoco_gt_words": ["traffic light", "car", "truck"], "hallucination_idxs": [71], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 92131, "caption": " The scene is set in front of a parked bus, with a large green bus with a foreign writing prominently on its front, facing the viewer. A number of people are gathered around the bus, with some approaching from the left side, others from the right side, and some standing near the back.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "handbag"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 21779, "caption": " The image shows a woman laying on top of a suitcase, smiling. The woman is the main subject, while the suitcase is the background. The suitcase is prominently displayed in the scene. The woman is posing, likely enjoying the moment, and the suitcase can be seen behind her.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "suitcase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 565148, "caption": "The scene takes place in a baseball game, likely a professional or semi-professional match. There are several players scattered around the field, including a baseball player up to bat, a catcher, and an umpire, who are the main focus of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 160183, "caption": "The image shows a red and white fire truck parked outside of a building, likely in front of a garage or parking lot. There is also a car parked near the fire truck, with another vehicle located further away in the image. Two people are visible, one on the left side of the fire truck and another on the right side, standing close to the truck. They appear to be inspecting the fire truck or simply enjoying the sight of it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 555456, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 235595, "caption": "In the image, several motorcycles are parked along the sidewalk next to a street, forming a row of motorcycles in the same row. One of the motorcycles is on the left side of the image, with another one in the middle, followed by another one on the right side of the image. There are people in the scene, with some standing near the motorcycles and others a bit further away. The people in the picture are positioned on both sides of the row of motorcycles.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "motorcycle", "potted plant", "car", "parking meter", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 489909, "caption": "The image displays various fruits, including an orange, a banana, and two or three more fruits in the background. These fruits are placed in a basket, creating a visually appealing display of fresh produce. The orange is located in the center of the basket, while the banana is positioned next to it. The other fruits appear to be placed towards the right side of the basket, creating a pleasing combination of fruits for a healthy and delicious snack.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "orange", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 501080, "caption": "According to the picture, a large vase with a single flower on top is filled with a bouquet of colorful flowers, likely poinsettias, which are red and pink. The bouquet is composed of multiple flowers throughout the vase, covering the entire interior. The flowers are arranged in an eye-catching pattern, making the arrangement visually appealing.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "potted plant", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 405183, "caption": " The scene shows several people in a living room, with a woman standing in the center, holding a video game controller. There are a few additional individuals in the room, situated closer to the edges. They are likely enjoying a video game together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "handbag", "tv", "potted plant", "book", "chair", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.125}}, {"image_id": 409138, "caption": "The image features a city street with several vehicles parked along the side and the bus, which has the words \"image content,\" written on it. One car is located in front of the bus, while two other cars are nearby. Additionally, there are two vans located further down the street. The bus is parked next to the sidewalk and a person is visible walking behind it. The scene appears to be near a parking area where the bus has stopped, potentially for passenger pick-up or drop-off.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 411840, "caption": "In the image, two women tennis players are walking across the court, each holding a tennis racquet in their hands. The women are walking from left to right, leaving a space between them as they proceed. There are no other people or objects present in the scene, except for a backpack, which is located towards the left side of the image. The entire scene takes place on a tennis court, emphasizing the sport.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "chair", "backpack", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 333712, "caption": " The image shows a slice of pizza with cheese and onions, prominently displayed on the table. The pizza has been cut into several pieces, each with a slightly different size and shape. The scene features a significant amount of cheese on top of the slice, as well as a variety of green onions throughout the pizza.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "spoon", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 148977, "caption": " In the image, there is a teddy bear with headphones on its ears sitting on a desk or table. The headphones are on top of its head, covering its ears, which is a unique and interesting aspect of the scene. The teddy bear appears to be placed near other objects on the table, which might include books and a cellphone. The books are located on the left and right sides of the table, while the cellphone is situated further to the right. The teddy bear, books, and other objects occupy the scene, creating an interesting visual display.", "mscoco_hallucinated_words": [["cellphone", "cell phone"], ["cellphone", "cell phone"]], "mscoco_gt_words": ["keyboard", "teddy bear", "bear", "dining table", "bottle", "book"], "hallucination_idxs": [65, 83], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.5}}, {"image_id": 399058, "caption": " The scene depicts an elephant walking down a road in front of a truck, with a man riding the elephant. The man can be seen seated on the elephant's back, enjoying the unique experience of riding the massive animal. The truck is traveling in the same direction as the elephant, behind it. There are also two cars visible, one closer to the truck and another one further down the road. A person is present, but it's unclear if they are riding on the elephant or just walking alongside it. The atmosphere is relaxed and possibly even festive, with the two people enjoying their time on the elephant and truck.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["truck", "bicycle", "elephant", "orange", "person"], "hallucination_idxs": [63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.06666666666666667, "Recall": 0.6}}, {"image_id": 183571, "caption": " According to the picture, a man and a dog are on a small red boat, with the man standing on a hose near the left side and the dog on the right side of the boat. There are two bottles on the boat, with one placed near the man and the other on the right side. The scene also depicts several other small boats scattered around the main boat.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["person", "boat", "dog"], "hallucination_idxs": [42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.09090909090909091, "Recall": 1.0}}, {"image_id": 325836, "caption": "The image showcases a motorcycle with a purple and orange paint job, standing out among several other motorcycles and a car. The motorcycle is on the left side of the picture, while other motorcycles are located throughout the scene. A red car is positioned near the center of the picture, and a few other people can be seen in the background, possibly observing the collection of motorcycles.", "mscoco_hallucinated_words": [["orange", "orange"], ["car", "car"], ["car", "car"]], "mscoco_gt_words": ["bicycle", "motorcycle", "handbag", "person"], "hallucination_idxs": [9, 21, 45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 437910, "caption": "The image captures a small bathroom with a toilet, located at the far right of the room. The toilet is positioned near the door, creating an inconvenient and cramped space. The bathroom also has a sink in the corner, to the left of the toilet. There are a few books scattered around the room, indicating a possible storage area or a reading spot. The presence of newspapers may add to the room's clutter and make it feel cramped. Overall, it is a small, compact bathroom with limited space for personal hygiene and convenience.", "mscoco_hallucinated_words": [["book", "book"]], "mscoco_gt_words": ["sink", "toilet", "bottle"], "hallucination_idxs": [55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 155885, "caption": "The image captures a black and white scene with a group of birds swarming around a man sitting on a boat at a dock. The birds are spread out over the water, with some closer to the man's boat while others are further away. The view is a bit old and vintage, providing an old-fashioned perspective. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 467887, "caption": " The image features a man, who appears to be in his mid-50s, sitting at a table using a laptop. He is wearing a red shirt and has glasses on. The table has a chair in the foreground and a potted plant located close to the laptop.", "mscoco_hallucinated_words": [["potted plant", "potted plant"]], "mscoco_gt_words": ["dining table", "laptop", "mouse", "chair", "person", "couch"], "hallucination_idxs": [43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6666666666666666}}, {"image_id": 174402, "caption": " The image features a tennis player in a blue shirt holding a tennis racket on the tennis court. A sports ball is also present in the image, possibly the one the player is attempting to serve or return. Another person is visible in the scene, possibly a teammate or a spectator, with a backpack nearby. The sports ball is situated on the far right side of the image.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 352734, "caption": "The image features a young boy sitting at a dining table eating a piece of chocolate cake with a fork. There is another person in the room, sitting in a chair near the child. The cake appears to be a focal point in the scene, with several pieces of chocolate cake spread across the table. The child uses a spoon to feed himself the delicious dessert.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["dining table", "spoon", "cake", "fork", "person"], "hallucination_idxs": [32], "metrics": {"CHAIRs": 1, "CHAIRi": 0.08333333333333333, "Recall": 1.0}}, {"image_id": 346716, "caption": "In this scene, a double-decker bus is driving down a city street surrounded by traffic. There are several other buses behind the lead bus, and some are parked, likely during their normal operating schedule. The street is quiet except for the buses, which stand out as they make their way down the road.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 451144, "caption": " The image captures a snowy mountain scene, with two people standing on top of the snowy slope, observing the surroundings. One person appears to be wearing skis, while the other person is not wearing any. There are several sets of skis placed near each person, suggesting that they may be skiers. The sky is overcast, adding a bit of tension to the atmosphere. The mountain landscape features various mountains in the background, creating an awe-inspiring view.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 295656, "caption": "According to the picture, the scene features several people dressed in fancy attire, eating dinner together. There are three men and one woman, with each person holding food on a plate. Some of the people are smiling, likely enjoying the dinner party. One person is wearing a tie and another is wearing a bow tie. In the background, there are other people sitting at a dining table, possibly enjoying their meals as well. ", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["pizza", "clock", "tv", "tie", "car", "chair", "person"], "hallucination_idxs": [75], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 0.2857142857142857}}, {"image_id": 129886, "caption": " The image features a group of skiers gathered at a ski slope with a ski lift in the background. Several pairs of skis are visible, including ones with two people each, on the snowy slope.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 557998, "caption": "According to the picture, a young man is sitting at a table with a bottle in front of him, likely focusing on something he is holding in his hand. The bottle seems to contain a liquid, such as an alcoholic beverage. There is a pineapple situated close to the bottle, and a few other cups can be found scattered around the area, as well as several pots on the table.", "mscoco_hallucinated_words": [["cup", "cup"]], "mscoco_gt_words": ["dining table", "bottle", "wine glass", "chair", "person"], "hallucination_idxs": [60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6}}, {"image_id": 401010, "caption": "There is a white bed situated in the middle of the image, with a black chair next to it. Near the bed, there is a desk with a laptop placed on top of it. A person can be seen sitting in front of the desk, possibly working on a laptop or computer.", "mscoco_hallucinated_words": [["desk", "dining table"], ["laptop", "laptop"], ["person", "person"], ["desk", "dining table"], ["laptop", "laptop"], ["computer", "laptop"]], "mscoco_gt_words": ["sink", "chair", "bed", "tv"], "hallucination_idxs": [28, 31, 39, 48, 54, 56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.5}}, {"image_id": 567562, "caption": "According to the picture, there is a woman sitting at a dining table with her two children. They are all enjoying a meal together, with pizzas, slices of pizza, and wine placed on the table. A book is also located in the room, possibly for entertainment or reference while they eat. There are several chairs in the room, with one located near the table and the other positioned in different areas of the room.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "dining table", "bowl", "spoon", "bottle", "fork", "broccoli", "book", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.45454545454545453}}, {"image_id": 313928, "caption": "A person is standing on a patio overlooking the beach, enjoying a day outdoors. The patio features a table with chairs and a large bottle of drink on the table, as well as a yellow surfboard leaning against a nearby wall. The person appears to be looking out at the water and enjoying the view from the patio.", "mscoco_hallucinated_words": [["person", "person"], ["person", "person"]], "mscoco_gt_words": ["surfboard", "dining table", "bottle", "wine glass", "chair"], "hallucination_idxs": [1, 46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.8}}, {"image_id": 490284, "caption": "In the image, a dog's head is seen resting on a person's arm, looking up and possibly begging for attention. There are other people present in the image, but none of them are touching the dog. The person the dog is resting on is seated on the couch, with a hand placed on the dog's head. A chair is also visible in the background, along with a clock situated on the right side of the frame.", "mscoco_hallucinated_words": [["chair", "chair"], ["clock", "clock"]], "mscoco_gt_words": ["couch", "person", "dog"], "hallucination_idxs": [67, 78], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 346940, "caption": " There is a stove in the image with a kettle resting on top. The kettle has its spout at the right side of the stove. A hot water pot and a teapot can also be seen on the counter near the stove, and a bottle is placed in the background. Steam is rising from the pot on the stove, adding a steamy, cooking atmosphere to the scene. ", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["bowl", "knife", "oven"], "hallucination_idxs": [47], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.3333333333333333}}, {"image_id": 222782, "caption": "The image depicts a woman walking down the street in front of a gas station, with the sign for the store visible in the background. The gas station's sign is in black and white, indicating the time period. The woman is walking on a sidewalk with a large gap on the sidewalk. A few other pedestrians can be seen walking in the same direction as the woman.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "cell phone", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 255934, "caption": " The image features an adult elephant and a baby elephant walking next to each other in a fenced area. The baby elephant is following closely behind the larger one, creating a striking scene. There are two people visible in the scene as well, standing in the background on the left and right sides.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "handbag", "elephant", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 278627, "caption": "According to the image, there are various pictures of people enjoying food at a restaurant. There is a group of people, including a man and woman, eating different types of food. Some dishes include shrimp and vegetables. Other pictures show fruits and vegetables served at the restaurant.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "spoon", "potted plant", "bottle", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.125}}, {"image_id": 400851, "caption": "According to the picture, there is a group of people gathered on the beach in front of a lake. Several individuals are sitting and lounging on a bench near the water, while others are standing or walking in the background. There are multiple tents set up nearby, possibly providing shade and protection from the sun.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "umbrella", "chair", "backpack", "boat", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 472610, "caption": "In the image, a woman is standing next to a dog, both outdoors in a field. There are multiple cows in the background of the scene, some of them positioned closer to the camera. There are also several cars scattered throughout the image, with some parked near the field and others further in the background.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["cow", "person", "dog"], "hallucination_idxs": [43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 78565, "caption": " The image portrays a group of people enjoying a day at the beach. There are several small sailboats scattered around in the water, creating a pleasant and playful atmosphere for the people. A dog is also present in the scene, possibly chasing after one of the sailboats or simply enjoying the beach experience.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "boat", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 388237, "caption": "According to the picture, a young man is riding a skateboard down a street or alleyway, in the middle of an outdoor gathering or event. Several other people can be seen around him, some of whom are enjoying the gathering or standing in different spots, including some close to the skateboard rider. There are several backpacks scattered throughout the scene, possibly belonging to some of the people in attendance.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["skateboard", "chair", "person", "bottle"], "hallucination_idxs": [61], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 289640, "caption": "According to the picture, there are several people sitting around tables in a room. A man with glasses is sitting in a chair with an orange frisbee over his face, and another person is close by. Additional people can be seen in the background, either standing or sitting, and some of them seem to be holding cups or bottles. There are also three forks placed on the table in front of them.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["fork", "fork"]], "mscoco_gt_words": ["cup", "dining table", "frisbee", "orange", "wine glass", "chair", "person"], "hallucination_idxs": [64, 70], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.8571428571428571}}, {"image_id": 122602, "caption": " The image displays a table with a variety of crafts and materials spread across its surface. A pair of scissors is located on the left side of the table, while a bottle of glue is situated in the middle. To the right of the bottle, there are various papers and a few cardboard pieces arranged on the table. The overall scene resembles a creative workspace with assorted tools and materials laid out and ready for use.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "scissors", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 24104, "caption": "The image shows a desktop setup consisting of a television, two monitors, a book, and other assorted items. The TV is located on the left side of the setup and displays a program, with a second monitor situated on the right side. There are three keyboards placed on the desk, one in the middle and two on either side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "tv", "dining table", "laptop", "mouse", "book", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 183709, "caption": "In the image, there are multiple people standing on the sidewalk, possibly waiting to cross the street. The scene features a crosswalk, which is a designated area for pedestrians to safely cross a road. Among the pedestrians, there are a few individuals near the front, one in the middle, and others towards the back.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "handbag", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 59710, "caption": "According to the picture, a small, black and white refrigerator is open and visible in the room. Inside, there is a bottle on the shelf, indicating a collection of beverages. Furthermore, there is a door in the background, which might lead to another part of the house or storage area. The room itself appears to be a bedroom or living space, as there is a bed visible, positioned near the wall on the right side. The overall atmosphere of the scene seems casual and friendly, with a focus on the refrigerator and the mini-fridge as important features of the interior design.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["bed", "bed"]], "mscoco_gt_words": ["chair", "refrigerator", "dining table"], "hallucination_idxs": [25, 75], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 279407, "caption": "According to the picture, there is a little boy wearing a baseball glove and holding a baseball in a baseball field. He is standing close to the fence that surrounds the field, with the ball in his hand and a baseball glove on his hand.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "bottle", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 314729, "caption": "The image depicts a vintage photo of a baseball player in a white uniform, standing and holding a baseball bat. The player has a serious expression, and it appears that he may be posing for the photograph. The background of the photo is largely white. The photo has been preserved over time, maintaining its original style, and provides a glimpse into baseball's past.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 527497, "caption": " According to the picture, there is a large desk with two computer monitors at its center. The monitors are positioned side by side, displaying various images. Next to the monitors, there is a keyboard and a mouse, which have likely been used for work. On the right side of the desk, a wireless keyboard and cell phone are visible.", "mscoco_hallucinated_words": [["cell phone", "cell phone"]], "mscoco_gt_words": ["keyboard", "tv", "dining table", "laptop", "mouse"], "hallucination_idxs": [63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 1.0}}, {"image_id": 411564, "caption": "The image features a man sitting on a table in front of a window, using a cell phone. He is sitting on the table with a bottle near him, possibly enjoying the outdoors or the view from the window. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "dining table", "bottle", "cell phone", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 320350, "caption": "The image features a man and a woman standing next to each other on their skis, both wearing snow gear. There are three dogs with them \u2013 one is slightly in front of the couple, while the other two dogs are slightly behind them on the right side. The snow-covered landscape seems to be a flat ground or a slope for the skiers and their canine companions.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "skis", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 401857, "caption": "The image features a baseball player at bat, preparing to swing a baseball bat at a baseball ball on the field. The player stands near the home base, as seen in the top-left corner of the picture, and is likely waiting for the right pitch to hit. Several other people, likely teammates or opponents, are present in the scene, spread throughout the field and behind the players.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "bench", "baseball bat", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 285302, "caption": "In the image, a young boy is sitting at a dining table, surrounded by various crafting materials. He is holding some paper and appears to be cutting it with a pair of scissors. On the table, there is a crayon close to the boy, along with a couple of other crayons scattered in different places around the table. The boy is using the scissors to cut the paper and create a craft.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "chair", "scissors", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 61959, "caption": "The image depicts a large bus driving down a street in a city. The bus has a colorful advertisement on its side, which might be a form of public transportation. There are several cars parked on the road near the bus, possibly making it difficult for the bus to pass through the area. A few people are visible in the scene, and there is a person standing further away from the bus on the side of the street. Overall, the scene showcases the bus and various other modes of transportation along a busy city street.", "mscoco_hallucinated_words": [["person", "person"], ["person", "person"]], "mscoco_gt_words": ["traffic light", "bus", "truck", "car"], "hallucination_idxs": [60, 71], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.25}}, {"image_id": 229659, "caption": "The image shows a laptop computer sitting on a table in a restaurant, with a keyboard, mouse, and monitor on the table. There are a few chairs placed around the table, and a man is seated at the table using the laptop. A few cups are scattered on the table as well, indicating that drinks were being consumed during the meal or other activities.", "mscoco_hallucinated_words": [["keyboard", "keyboard"], ["mouse", "mouse"], ["cup", "cup"]], "mscoco_gt_words": ["clock", "tv", "dining table", "laptop", "bottle", "chair", "person"], "hallucination_idxs": [16, 18, 50], "metrics": {"CHAIRs": 1, "CHAIRi": 0.23076923076923078, "Recall": 0.7142857142857143}}, {"image_id": 473118, "caption": "The image showcases a person riding a skateboard while doing tricks on the ramp, highlighting the fun and excitement of skateboarding. The skateboard is positioned on a ramp as the rider is skillfully riding it, demonstrating their abilities and techniques. In the scene, several vehicles can be seen, including a couple of bicycles nearby, as well as a car on the right side of the image. The presence of these additional elements adds to the busy and active atmosphere of the image.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["bicycle", "skateboard", "person"], "hallucination_idxs": [65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 391222, "caption": "Two people are standing in the snow at the top of a slight hill, wearing skis and holding ski poles. They are positioned next to each other, and a tablet or a sign is placed nearby. The area is blanketed with snow and has a snowy landscape. There are several trees scattered around the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 416104, "caption": "In the image, several people are sitting around tables underneath an umbrella. The people seem to be enjoying the outdoor atmosphere, possibly having a conversation or drinking beverages. A variety of seating options are visible, including several blue chairs placed around the tables, as well as a bench in the background.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["umbrella", "dining table", "bottle", "wine glass", "chair", "person"], "hallucination_idxs": [53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6666666666666666}}, {"image_id": 511852, "caption": " The scene depicts a pair of skiers walking on a snowy slope. There are two men, both wearing skis and with backpacks, facing the camera, and they are hiking up the mountain side by side. One person is carrying skis on their back, while the other person has skis attached to their boots. They are both wearing backpacks and seem to be having a good time. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 59386, "caption": " The image portrays a group of giraffes gathered around a tree, feeding on leaves. There are four giraffes in the scene, with some standing close to the tree while others are further away. The tree appears to be the main source of food for these animals.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["zebra", "giraffe", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 540180, "caption": "The image depicts an intersection in a city with various streetlights and a green traffic light with a smiley face on it. The intersection is bustling with cars of different types and sizes, both parked and moving around the city.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 153692, "caption": " The image captures a man and his dog riding their bikes across a pedestrian crossing, with multiple traffic lights. The man rides his bike while walking, with the dog riding alongside him on the same bike. The crossing is empty of cars, but the traffic lights and the lights on the street are clearly visible.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "traffic light", "dog", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 265628, "caption": "The image is of a toilet, located near the corner of the room. A man is standing in front of the toilet, with his feet positioned close to the edge, wearing black shoes. The toilet is white, taking up a significant portion of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "person", "toilet"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 499594, "caption": " (Potential question) In the image, a piece of meat is served on a plate with broccoli, accompanied by a bottle of red wine and various sauces on the table. There are also some spices and a fork on the table. The meal appears to be an appetizing and flavorful dish.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "bottle", "fork", "broccoli"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 5934, "caption": "The image depicts a baseball and a baseball glove in close proximity. The baseball is on the left side, sitting close to the glove, which is on the right side. This scene is in a dining area, likely a bench or a table, with the baseball and the glove sitting on top of it.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["bench", "sports ball", "baseball glove"], "hallucination_idxs": [47], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 47828, "caption": " The image portrays a night scene with a blue river and a bridge over it. There are a few boats visible on the river, one being close to the middle of the bridge, and the other boats located near both ends of the bridge. In the foreground, there are people walking along the shore, with some of them standing near the boats. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "boat", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 47916, "caption": "In the image, there is a dog running with a frisbee in its mouth, with two other dogs in the scene. One dog is located towards the right side of the image, while the other dog is in the top left corner. They seem to be playing or interacting with the frisbee in the grassy area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bench", "dog", "frisbee", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 255483, "caption": "This image shows a person sitting at a table, likely writing or working on some sort of paper. There are multiple papers placed on the table in front of them. The person is holding a pen, likely a pencil or pen. They are working on the papers. It appears that the person is focused on the task, as they are sitting with their hands on the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "person", "scissors", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 131138, "caption": " The image shows a computer desk with a TV screen, keyboard, and mouse. On the desk, there is also a laptop placed nearby. The TV screen takes up a significant portion of the desk and dominates the area, while the laptop is positioned on the right side. A few cell phones are also visible, placed on the left side of the desk.", "mscoco_hallucinated_words": [["cell phone", "cell phone"]], "mscoco_gt_words": ["keyboard", "cup", "tv", "dining table", "spoon", "laptop", "potted plant", "mouse", "person"], "hallucination_idxs": [56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.08333333333333333, "Recall": 0.5555555555555556}}, {"image_id": 258209, "caption": "The image shows a man sitting on a bench near a building, probably in a city or urban area. The man is talking on a cell phone. The bench he's sitting on has two curved metal backs, one to the left and the other one to the right. Another bench can be seen in the background, closer to the street.", "mscoco_hallucinated_words": [["cell phone", "cell phone"]], "mscoco_gt_words": ["bicycle", "book", "bench", "person"], "hallucination_idxs": [27], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.5}}, {"image_id": 347453, "caption": "In the image, there are several red and white buses driving down the street. One red bus is closer to the front of the scene and one white bus is positioned further back. A number of cars are also present in the street, with three cars visible on the left side and another four cars on the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "bench", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 285820, "caption": "The image shows a well-equipped living room with a couch, desk, and a television, all sitting on a rug. A large TV occupies a significant portion of the room, positioned near the couch. The desk is next to a few small chairs and situated at the other end of the room, along with a cup of a cool drink on it. The couch is located at the center of the room, giving an overall comfortable feel. Various lamps can be seen placed around the space, adding a cozy ambiance to the scene.", "mscoco_hallucinated_words": [["chair", "chair"], ["cup", "cup"]], "mscoco_gt_words": ["bench", "tv", "dining table", "bowl", "cell phone", "book", "couch"], "hallucination_idxs": [47, 61], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.42857142857142855}}, {"image_id": 245664, "caption": "A desk is showcasing a collection of personal belongings and technology, including a computer monitor, a laptop, a keyboard, and various items spread out across the wooden surface. The computer monitor, located at the front of the desk, has a screen displaying images and documents. The laptop is placed towards the right side of the desk, close to a few books and papers, and a keyboard is placed nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "tv", "dining table", "laptop", "mouse", "book", "wine glass"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.625}}, {"image_id": 453731, "caption": "According to the picture, a dog is lying on a wooden table in front of a building. The dog appears to be sleeping and is positioned on the table with its legs stretched out. The background includes a variety of beverages, with some being displayed in the front and others in the back. Some of the bottles are closer to the camera, while others are farther away. The dog is the main attraction in the image.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"]], "mscoco_gt_words": ["cup", "dog", "bottle"], "hallucination_idxs": [12, 30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 524601, "caption": "According to the picture, a group of people, including a woman and a man, are playing a video game together in a large room. They are all actively engaged in the activity, with one woman holding a Wii remote and a man nearby. They are standing next to each other, forming a group. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "refrigerator", "book", "chair", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 253262, "caption": " The image depicts a small airplane, likely a private jet, parked on a tarmac or airport runway. Two men can be seen in the scene, standing near the front of the aircraft. They may be airport personnel, such as pilots or ground staff, or simply passengers who have just disembarked from the plane.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "suitcase", "tie", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 87052, "caption": "The image depicts a small kitchen, possibly in a residence, with a couple standing and sharing the space. The man is standing towards the back of the kitchen, while the woman is to the left, in the front. They appear to be attentively listening to the person speaking in the kitchen.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "oven", "refrigerator", "bowl", "bottle", "vase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.14285714285714285}}, {"image_id": 357578, "caption": " The scene features a red fire truck parked on the street, partially visible. There are several cars parked on the street, including a white car in front of the fire truck, a white car behind it, and a black car further down the street. In addition, there are three other cars along the street, including two parked towards the center and the last one parked further towards the right.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 418692, "caption": " In the image, a man is sitting on a couch in a living room, holding a cell phone. The couch is positioned towards the left side of the room and the man is sitting slightly to the right. The living room is decorated with a fan, positioned above the couch. The room is lit by various lamps, including one located on a table and a few others placed around the room.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["remote", "cell phone", "book", "chair", "person", "couch"], "hallucination_idxs": [69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 259345, "caption": "According to the picture, a man with a beard and mustache is sitting in a chair, holding a large pink umbrella, and wearing a long multi-colored dress. He is the main focus of the scene, occupying the central area. There are a couple of other chairs near the man, but they appear to be empty, as he is the only person in the image. The man is not interacting with any other chairs or objects in the scene, merely sitting there holding the umbrella.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 119709, "caption": "According to the picture, a woman is preparing donuts on a red table, surrounded by various other donuts of different shapes and sizes. The table is full of piled-up donuts, with some of them on the ground. Some of the donuts appear to be in a pile, while others are scattered around the table. The woman can be seen wearing an apron and glasses while she works on the delicious treats.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "donut", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 443299, "caption": "According to the picture, two people are sitting on a bench in a subway station, both using their cell phones. The main man is checking something on his cell phone while the other man is sitting next to him. The bench appears to be a common seating area for people waiting in the subway station.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "cell phone", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 120776, "caption": "In a busy urban environment, a small group of people is sitting on a balcony with a view of the street. Two large umbrellas, one with an artful metal sculpture on it, are on either side of the balcony. The people appear to be enjoying their surroundings and taking in the beautiful day outside. There is another umbrella further behind the group, and a bench can be spotted towards the right side of the image. Additionally, a few handbags can be seen on the table, one closer to the left and the other in the center of the group.", "mscoco_hallucinated_words": [["bench", "bench"], ["handbag", "handbag"], ["table", "dining table"]], "mscoco_gt_words": ["tie", "umbrella", "person"], "hallucination_idxs": [71, 87, 93], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 0.6666666666666666}}, {"image_id": 411765, "caption": "In a tennis court, a man is playing tennis, with a tennis racket in his hand, about to hit a ball. The ball is located in the center of the court, near where the player is swinging the racket. There are also several other people present, some of which can be seen standing along the edges of the court or near the man playing tennis.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 447991, "caption": "A woman is standing on the sidewalk next to a van, surrounded by various storefronts, including a business with the name \"Asian Food\" and several other shops. Another person is walking along the sidewalk.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "handbag", "traffic light", "cell phone", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 276719, "caption": " There is a scene with a small dog standing on the sidewalk in front of a blue car and two vintage cars, creating a unique and interesting intersection. The dog is positioned in the middle of the street, with two additional people walking nearby. The blue car appears to be an older model, while the other two cars have a more classic and vintage look. The people are spread throughout the scene, with some walking in different directions and others standing more close to the vehicles or the dog. There are also a few items on the ground nearby, such as a bottle and a book. Overall, the scene appears to be an engaging social gathering around the unique intersection.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["book", "book"]], "mscoco_gt_words": ["truck", "handbag", "motorcycle", "dog", "car", "chair", "person"], "hallucination_idxs": [111, 114], "metrics": {"CHAIRs": 1, "CHAIRi": 0.18181818181818182, "Recall": 0.42857142857142855}}, {"image_id": 422328, "caption": "In the scene, there is a large snow pile near a horse and a carriage. The horse, likely a horse-drawn carriage, is covered in snow, while the carriage is parked next to it. The snow appears to have piled up in front of the horse and carriage, possibly after being pushed or transported. Two people are present, one near the horse and the other further away, possibly watching the scene or waiting for the horse and carriage to move.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["horse", "snowboard", "car", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 354559, "caption": "The image features a woman in a black dress, standing close to each other. The woman is holding a teddy bear, possibly giving the impression that she's trying to hug it or protect it. She looks pretty, with brown hair and brown eyes. In the scene, there are a couple of chairs positioned in the background, and the background itself is relatively dim.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["teddy bear", "bear", "person"], "hallucination_idxs": [59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 438788, "caption": " In the image, there are two pieces of white cake arranged on a dining table. One piece is in the process of being cut, likely by a person holding a knife nearby. The other piece is slightly bigger and fully cut, with a portion of it missing.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "knife", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 468005, "caption": "The image features a large grey elephant walking down a street or sidewalk with several people around it. Some of the people appear to be interacting with the elephant, while others are watching it from a distance. There are also two umbrellas present in the scene, possibly for protection from the sun. Overall, the image showcases the animal and its environment, with the people around it contributing to the scene's atmosphere.", "mscoco_hallucinated_words": [["umbrella", "umbrella"]], "mscoco_gt_words": ["backpack", "person", "elephant"], "hallucination_idxs": [44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 545594, "caption": " In the picture, a man wearing a tie-dye shirt is standing on a bus, holding onto a backpack. There are several people around him, with one person standing near the front and two others towards the back of the bus. Another person is sitting nearby. Several seats are available for the passengers, with two along the side and one near the middle of the bus. The scene appears to be a busy bus ride with many people onboard.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "chair", "orange", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 219546, "caption": "The image features a table laden with various food items. The table is set for a party or celebration, and it contains bowls, dishes, and utensils filled with a variety of food, including meat and cheese.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "bowl", "spoon", "bottle", "fork", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 386645, "caption": " The scene depicts a rickety building with its roof partially collapsed. A blue truck is parked in front of the building and several people can be seen standing near the truck, with some of them walking past it. The truck appears to be the centerpiece of the scene, surrounded by people.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "motorcycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 155131, "caption": " The image shows a woman in a white dress standing on a tennis court, leaning forward and preparing to serve a tennis ball. She is the main focus of the scene. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "tie", "tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 188752, "caption": " The image depicts a man standing near two large bunches of bananas hanging from the ceiling at an outdoor market. There are many bananas in the bunches, with some bananas hanging individually and others clustered together. The bananas appear fresh and plump. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["apple", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 228350, "caption": " A group of people is sitting outside at a restaurant, enjoying a sunny day underneath a series of red umbrellas, providing shade and comfort. There are multiple tables and chairs set up for the diners, who are dressed in a variety of attire, including jackets, t-shirts, and dresses. The diners are seated in chairs near the tables and some people are holding drinks.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "umbrella", "dining table", "bowl", "tie", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 437720, "caption": "According to the image, a naked woman is sitting at a table, looking like she is working on something. The painting depicts a feminine figure sitting on a chair with red chairs in the background.  There is another chair in the room that appears to be a kitchen chair. The artwork suggests the scene takes place in a studio setting.  The painting seems to have been created in the mid-20th century.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "vase", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 378831, "caption": "The image depicts a woman walking past a fire hydrant. She is carrying grocery bags and walking towards the right side of the picture, with a person also visible in the background near the fire hydrant. The scene is set in a parking lot or an open area, where the woman appears to be heading towards her car or a destination in the distance.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["fire hydrant", "handbag", "person"], "hallucination_idxs": [61], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 375503, "caption": "There is an image of a staircase in a home or building, possibly leading to an upper or lower level. The staircase features several wooden railings and banisters, creating an elegant and decorative style. There is a wooden desk with a clock on top of it in the room near the staircase. The clock may have been placed there as an ornamental element. There are a few pieces of furniture, including a wooden desk, and a doorway can be seen on the left side of the room. The room has a somewhat traditional or vintage style, with wooden decorations and a clock as the main focal point.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "tv", "dining table", "laptop", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 330369, "caption": "The scene features a red and white boat in the water at a dock. The boat is large, occupying a significant portion of the image, and has a distinct shape with a mix of red and white colors. There are other boats nearby, positioned closer to the right side of the image, with one boat taking up a central position and another boat closer to the top of the picture. Overall, it's an interesting view of a vessel on the water in an ocean setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 288576, "caption": " According to the picture, there is a man standing next to a small tree stump, throwing a red frisbee near the tree trunk. In the background, a couple of bicycles are visible, with one located near the left side and another on the right side. The scene appears to be set in a park or an outdoor area, and the frisbee appears to be flying in a circular motion.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "frisbee", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 32190, "caption": "The image captures a small toy horse, likely a collectible figurine, placed on the sidewalk in front of a street. The street appears to be in a city setting with various cars parked around the area. Some of the cars are closer to the sidewalk, while others are further away. The toy horse is situated near the middle of the sidewalk, creating an unusual scene in the city.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["horse", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 535403, "caption": " The image features a bathroom with two sinks, one larger and one smaller, located in different areas of the room. The larger sink is on the left side of the image and is accompanied by a mirror above it. There is also a small sink located near the back of the room, but it can't be seen in the picture.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "potted plant", "bottle", "toilet", "sink", "hair drier"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 449798, "caption": "The image features a home office set-up with various computer items placed on a table. There is a keyboard on the left side and a mouse placed near it. A laptop is also present on the table, partially covered with a mouse and various keychains. In the top center of the table, there's a small computer screen, with a keyboard and mouse close by. The table appears to be the only piece of furniture in the room.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "dining table", "laptop", "mouse", "cell phone", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 375769, "caption": " The image depicts a young boy sitting at a dining table in a restaurant. He is wearing a helmet and has a hoodie on. The boy appears to be eating a doughnut while also drinking water. A bicycle is present in the scene, situated at the right-hand side of the table.", "mscoco_hallucinated_words": [["doughnut", "donut"]], "mscoco_gt_words": ["bicycle", "cup", "umbrella", "dining table", "hot dog", "car", "chair", "person"], "hallucination_idxs": [33], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.375}}, {"image_id": 426172, "caption": " The image depicts a dining area with a table featuring a wine rack and several books. One of the books is a large book about wine placed on top of the table, with its title visible. Another book is located near the left side of the table, and there are additional books scattered around the room. The table also has a broom placed closer to the center, possibly for cleaning purposes. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "bottle", "book", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 20650, "caption": " The image features a dining table with a couple of items on it. There is a large, orange plate placed on the table that is covered in broccoli and two slices of bread. The broccoli is not only on top of the sandwich but also served separately on the plate. There is also a fork placed alongside the dish on the table. In addition to the sandwich, a couple of cups are present on the table. One cup is placed closer to the sandwich, while the other cup is situated towards the left side of the dining table.", "mscoco_hallucinated_words": [["cup", "cup"], ["cup", "cup"], ["cup", "cup"]], "mscoco_gt_words": ["dining table", "bowl", "sandwich", "fork", "broccoli", "orange"], "hallucination_idxs": [76, 84, 95], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.8333333333333334}}, {"image_id": 138078, "caption": "In the image, there are two young baseball players standing on a field. The first player, wearing a red shirt and white pants, is in the foreground, holding a baseball glove. The second player, also in red and white, is more in the background and seems to be watching the first player. They both appear to be focused on the game and ready to make a play.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 152499, "caption": "According to the picture, a group of people, including some children, are sitting in a pool next to a couple of surfboards, one of which appears to be on top of an air mattress. There is a person wearing a party hat, and another person in a purple shirt. The pool also has a couple of cups.", "mscoco_hallucinated_words": [["cup", "cup"]], "mscoco_gt_words": ["surfboard", "umbrella", "dining table", "potted plant", "chair", "person"], "hallucination_idxs": [63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.3333333333333333}}, {"image_id": 180764, "caption": " The image depicts a man in the process of eating a pink glazed donut. He is the focal point of the scene, and a number of other people can be seen in the background, with some situated near him and others further away. The donut can be seen partially consumed as the man takes a bite. In the room, there are multiple bottles of water, one near the man and the other two located further away. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["microwave", "donut", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 147762, "caption": "The image features a plate with a small dessert on it. The dessert is covered with a white substance and has a yellow center. A few knives are spread across the table, giving an idea of the contents of the plate or the meal being served. There is also a fork on the table that seems to be placed in the middle of the table.", "mscoco_hallucinated_words": [["knife", "knife"]], "mscoco_gt_words": ["fork", "cake", "dining table"], "hallucination_idxs": [28], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 113860, "caption": "The image depicts a sailing boat on the river, moving along the water with some trees in the background. The boat appears to be quite large and is the main focus of the scene. There are a few people around the area, suggesting they may be part of the boat's journey or simply enjoying the environment.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 460866, "caption": "The image features a variety of toothbrushes grouped together inside a cup or sink. Some of the toothbrushes are clustered together in the center, while others are scattered throughout the cup. A toothbrush tube can be seen with the bristles visible inside, and another toothbrush can be spotted on the right side. Overall, it's an interesting and colorful arrangement of toothbrushes.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toothbrush", "cup", "dining table", "bowl", "sink"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 177015, "caption": "According to the picture, a man is sitting on a couch, holding a cat and using a laptop. There are a couple of other individuals in the room, with one person occupying a couch nearby and another person seated on a different couch. The setting is a living room or a lounge area with multiple chairs and couches, indicating the comfort and casual atmosphere of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "laptop", "cat", "apple", "chair", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7142857142857143}}, {"image_id": 333998, "caption": "The image features a well-equipped kitchen with a refrigerator on one side and a microwave oven on the other. Two ovens are positioned on the counter, one above the refrigerator and the other below. There is a sink on the left side of the room, and a dishwasher can be found in a corner near the microwave.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "suitcase", "oven", "refrigerator", "bowl", "bottle", "sink", "microwave", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4444444444444444}}, {"image_id": 47131, "caption": "The image depicts a group of men trying to get onto a train that has derailed. The men are scattered around the wreckage, with some trying to climb on top of the train cars. Some of the men are standing, while others are trying to climb up. There are also a few people looking at the scene from a distance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 425447, "caption": "According to the picture, a white truck is driving down the street, appearing to be watering a fire hydrant in the middle of a sidewalk. The fire hydrant is located at the center of the image, surrounded by trees and parked vehicles. Some of these vehicles include a white pickup truck, a blue pickup truck, and a car parked on the side of the street. The truck appears to be spraying the fire hydrant, likely to water the surrounding area.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["truck", "fire hydrant", "person"], "hallucination_idxs": [64], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 0.6666666666666666}}, {"image_id": 295105, "caption": "The image features a man standing in front of a refrigerator in a kitchen. He is looking towards a person outside the frame, likely through a window or door. There is also a second person in the scene, standing to the left of the refrigerator.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["oven", "refrigerator", "bowl", "bottle", "apple", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 443259, "caption": "This is an image of a large open kitchen with a refrigerator and a dining area. The kitchen features a stainless steel refrigerator and a few cabinets. The dining area has a table in the middle of the room, with several fruits and vegetables displayed on it. There are also a few chairs in the dining area, placed around the table. The overall look of the space is clean and well organized.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["oven", "refrigerator", "dining table", "bowl", "potted plant", "bottle", "orange", "sink", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 537861, "caption": " In the image, there is a yellow and gray train traveling along a train track. The train occupies the middle of the picture, while the left side is occupied by the station platform. There are a few people standing near the train and platform, with one person positioned closer to the train on the left side, another standing in the middle, and another person slightly further away on the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 145020, "caption": "According to the image, there is a large stuffed duck with a yellow vest standing next to a toy car and many other toys in a parade setting. The duck is likely a Disney character like Donald Duck or Mickey Mouse, and there are other toys present in the scene. The toys are in various positions around the stuffed duck, with some close by and others further away.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["teddy bear", "bird", "mouse", "person"], "hallucination_idxs": [20], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.5}}, {"image_id": 249969, "caption": "In the image, a plate of food is set on a table. The meal features multiple kinds of meat, such as chicken and beef, as well as vegetables like broccoli and carrots. The plate is adorned with a variety of sauces and is accompanied by various utensils. These include multiple forks, knives, and spoons, which are arranged around the plate in different sizes and positions.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "spoon", "fork", "broccoli", "wine glass", "carrot", "bird"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 146504, "caption": " In the image, there is a person sitting at a desk with their hands on a computer keyboard and mouse. The keyboard and mouse are placed on a table in front of the person. The keyboard is slightly to the left side of the mouse, as the person uses their left hand to type or control their computer.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "dining table", "laptop", "mouse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 40037, "caption": "There is a large clock tower with a green roof near a building in a scene. The clock tower is tall and has a large clock on top of it, indicating a building with a clock face on top. The clock face is grey, which suggests that the clock tower has a stone or masonry appearance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 343218, "caption": " In the image, two men can be seen on an outdoor tennis court, playing and enjoying a game of tennis. One of the men is holding a tennis racket, likely in the middle of a match or warm-up, while the other man is positioned nearby, waiting to play. There is also a ball visible on the court, perhaps on its way to being swung by the man with the racket.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 301837, "caption": "The image showcases a long and narrow kitchen with white cabinets. The countertops appear to be made of granite or another similar stone material. A sink is located in the center of the counter, which is surrounded by various bottles, utensils, and a few books.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["book", "book"]], "mscoco_gt_words": ["clock", "oven", "refrigerator", "apple", "sink"], "hallucination_idxs": [42, 49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.2}}, {"image_id": 309071, "caption": " The image features an older man bending over to remove the wheel of his bicycle from the paved area. He is standing next to a small potted plant and closer to the left side of the picture. The bicycle is leaning against a fence on the right side of the image. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bench", "potted plant", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 240681, "caption": "In the image, a person is standing next to a red motorcycle on the side of the road. The man is holding a cell phone and appears to be taking a photo of the motorcycle or possibly using it to take a call. The location seems to be an empty street, and there are no other people or vehicles present in the scene. The scene provides a glimpse of an everyday road and the hobby of photography.", "mscoco_hallucinated_words": [["cell phone", "cell phone"]], "mscoco_gt_words": ["backpack", "motorcycle", "person", "car"], "hallucination_idxs": [25], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.5}}, {"image_id": 178753, "caption": "The image features a girl with braces smiling and holding a cellphone. She appears to be seated in a chair or a bench in an indoor environment. ", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["cell phone", "chair", "person"], "hallucination_idxs": [23], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 278890, "caption": " The image shows a person holding a cell phone, wearing a blue shirt. The person seems to be in the process of using the phone while it's being placed close to their face. The phone appears to be a central focus in the scene, surrounded by various other objects in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 47781, "caption": "There is a room with a large bed and other furniture, including a table located near the bed. The bed is situated at the center of the room, and there are a few drawers underneath it. A television is placed to the left of the bed, near the wall, and a second television can be found in the room as well. The overall arrangement suggests a modern design with a focus on functional storage.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["tv", "vase", "book", "bed", "person"], "hallucination_idxs": [14], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.4}}, {"image_id": 542792, "caption": "According to the picture, a large wooden baseball bat is standing tall, leaning against a building in the foreground. The large bat is a prominent feature of the scene, taking up most of the area around it. The building is situated in the background, likely a multi-story building. The surrounding area is also noticeably empty, with no people or vehicles visible near the building or the bat.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 293703, "caption": " A group of three men, presumably friends, is sitting at a table at a restaurant, enjoying a meal of pizza. The table has two large slices of pizza placed on it, one closer to the camera and another further away. Various bowls and cups are scattered around the table, likely containing beverages, salad, or other food items. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "bowl", "bottle", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5555555555555556}}, {"image_id": 318777, "caption": " The image features a baseball game happening in a stadium surrounded by a large crowd of spectators. There are several baseball players on the field, scattered throughout the scene. A baseball pitcher is positioned towards the center of the image, and several baseball gloves are visible in various locations around the field. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 14773, "caption": " The image features a colorful plate with a hot dog and fries, placed on a table dining. The hot dog is surrounded by the fries, giving the impression that it's a snack or a meal. The table is a dining table made of wood and metal, with several drinks placed around it, including two glasses on the left side and a bottle in the middle. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "spoon", "bottle", "fork", "hot dog", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.375}}, {"image_id": 521863, "caption": "According to the picture, a young man riding a skateboard is skating through a crowd of people near a library. The skateboarder is skating down the sidewalk, making his way through the busy pedestrian area. The crowd is composed of various individuals, both standing and sitting, who seem to be enjoying their surroundings.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 250540, "caption": "In the image, there is a group of four people sitting on motorcycles, one after another. They appear to be in the center of the street, with other motorcycles surrounding them. The riders are wearing helmets, ensuring their safety while on the road. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "motorcycle", "cell phone", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 211604, "caption": " In the image, there is a young woman lying on the floor, possibly due to exhaustion or illness. She is wearing a hat, a black and white shirt, and pants. A tennis racket is nearby her, resting on the floor in a horizontal position. The tennis racket might be used as a prop or symbol in the photo. There is also a bottle visible towards the right side of the image, which may be a sports drink or an item belonging to the young woman.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [71], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 298067, "caption": "The image features a close-up view of meat being served with vegetables and a glass of beer next to it. The meat appears to be chicken or some other kind of poultry dish. There is a dish of green sauce to the left of the meat and a knife on the table as well. A beer bottle and two bowls are also present in the picture, adding to the overall dining scene.", "mscoco_hallucinated_words": [["knife", "knife"]], "mscoco_gt_words": ["bowl", "cup", "dining table", "bottle"], "hallucination_idxs": [50], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.75}}, {"image_id": 563938, "caption": " The scene depicts two people sitting on a couch, with one person wearing sunglasses and the other wearing a tie. Sitting in front of them is a woman, and a man can be seen to the left. Another chair can be seen further back. ", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["bench", "tie", "bottle", "person", "couch"], "hallucination_idxs": [42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6}}, {"image_id": 461311, "caption": "The image depicts a crowded outdoor park filled with numerous people of all ages, some with kites in the sky. There are multiple kites in the air, with a clear blue sky as the backdrop. Several people are standing and walking around the park, enjoying the kite-flying festivities. The kites appear to be various sizes, with some closer to the camera and others farther away. It seems like the event is a public gathering where people come to fly kites for enjoyment or a celebration.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["kite", "handbag", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 47940, "caption": "In the image, there is a person sitting on a bed, resting their legs on the edge of the bed. The person is holding a book and a mouse, with a computer keyboard visible in the background. On the bed, there is a map, a bottle, and some other books, adding to the scene.", "mscoco_hallucinated_words": [["mouse", "mouse"], ["keyboard", "keyboard"], ["bottle", "bottle"]], "mscoco_gt_words": ["book", "laptop", "bed", "person"], "hallucination_idxs": [31, 36, 52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2727272727272727, "Recall": 1.0}}, {"image_id": 261318, "caption": " There is a man sitting outside on a bench, talking on a cell phone. He is wearing a grey shirt and a red chair is located behind him. In the background, a couple of suitcases can be seen, one is closer to the man and another one is farther away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["suitcase", "handbag", "bench", "cell phone", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 134768, "caption": "In a close-up view of a computer desk, a keyboard is visible, sitting near the computer screen. A pair of speakers is also situated on the desk, just to the left of the keyboard. A mouse can be seen partially covering one of the speakers. The overall setup suggests a combination of computer components and music equipment, indicating the potential for multimedia and entertainment activities.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "laptop", "mouse", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 526414, "caption": " The image shows a quaintly decorated sewing room with various household items, including a TV, books, and a computer. The room has a comfortable atmosphere, with a desk and a chair arranged next to a bookshelf. A sewing machine and a couch can be seen in the background, suggesting this is a well-equipped living space.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["remote", "cup", "tv", "dining table", "bowl", "laptop", "cat", "book", "scissors", "couch"], "hallucination_idxs": [35], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.5}}, {"image_id": 273132, "caption": "The image shows an old woman, perhaps in her 70s, sitting on a park bench, wearing a hat and with a hand placed near her head. A suitcase is placed next to the woman, likely signifying her traveling or preparing for a trip. In the background, there are numerous people, some of whom are sitting on benches or patios. A tie can be spotted in the distance, and a cellphone, likely belonging to one of the people in the background, is visible as well.", "mscoco_hallucinated_words": [["suitcase", "suitcase"], ["tie", "tie"], ["cellphone", "cell phone"]], "mscoco_gt_words": ["handbag", "bench", "person"], "hallucination_idxs": [31, 69, 79], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 505213, "caption": "In the image, a man with a black jacket is taking a bite of a hot dog while laughing. There are multiple hot dogs visible surrounding the man. There are also a few people visible in the background at different distances.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "hot dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 192656, "caption": " According to the picture, a baseball player is standing on a grass field wearing a baseball glove and throwing the ball. The player is wearing a black and blue jersey and white pants, giving a casual and cool appearance. The baseball glove is also visible, held by the player.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 272728, "caption": " The image showcases a vase filled with different types of flowers and greens. The vase is prominently placed on a table, creating a nice display. The flowers include pink, orange, and purple, along with green leaves, giving a vibrant and colorful appearance to the arrangement. There are a total of 15 flowers in the vase, spread across different parts of the arrangement.", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["vase", "potted plant", "chair", "dining table"], "hallucination_idxs": [33], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 509811, "caption": "In a room filled with many wooden tables, a woman is sitting at one of them, surrounded by numerous laptops that are spread out around her. There are at least 15 laptops placed on the tables in various locations, some on the desks themselves and others on the floor. The room is organized with rows of wooden tables lined up against the walls, creating a spacious workspace.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "apple", "book", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 412407, "caption": "The image depicts a person holding an open cell phone up to their right hand. The cell phone is displaying a screen with red bars across the top, indicating the phone is active. In the scene, several other individuals are also present in the background. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "clock", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 353658, "caption": " The scene features a woman holding a pink polka dot umbrella on a rainy day in an outdoor market. She is standing amidst a group of people gathered at the market, some of whom are holding their umbrellas to stay dry. The market is filled with vendors selling various products like vegetables and fruits, which can be seen in the image. Another woman can be seen on the right side of the scene, walking away from the main group of people.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "suitcase", "handbag", "umbrella", "carrot", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 441363, "caption": "The image depicts a man playing tennis, in the middle of the tennis court, swinging a racket as he strikes a tennis ball with it. The surrounding area of the court is filled with other people, either watching the game or waiting their turn to play. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 331084, "caption": "The image shows a large, black dining table filled with various ingredients for making pizzas. There are several uncooked pizzas placed on the table, each with toppings such as onions and tomatoes. In addition to the pizzas, carrots, knives, and utensils are also present on the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "knife", "dining table", "bowl", "spoon", "carrot", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 282553, "caption": "According to the image, there are two women standing near the road at dusk. One of them appears to be closer to the camera than the other. They are walking in a grassy area with a car parked off to the side. There is also a barrier in the image.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["cell phone", "handbag", "person"], "hallucination_idxs": [39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 328818, "caption": "According to the picture, there is a woman in a pink shirt, standing by a bench, adjusting her shoes. There is a bicycle placed behind the bench, partially visible in the scene. The bench seems to be a popular place for people to rest or set their belongings.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bench", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 3103, "caption": " | According to the picture, the image features a young man wearing a yellow shirt riding a skateboard and performing a trick on a large skate ramp. The skateboarder is in the middle of the trick, leaning into the skate ramp and showing off his skills. Several people are observing the trickster, with a few individuals scattered around the scene, possibly enjoying the performance or waiting for their turn to use the skate ramp.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 188136, "caption": "The scene takes place inside a kitchen, featuring various appliances and food on a countertop. The counter is filled with fruit, including apples, oranges, and pears, which are scattered throughout.", "mscoco_hallucinated_words": [["apple", "apple"]], "mscoco_gt_words": ["dining table", "bowl", "orange", "sink", "carrot", "person"], "hallucination_idxs": [25], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.16666666666666666}}, {"image_id": 449675, "caption": "In the image, there is a white plate with a sandwich cut in half on it. A fork and knife are situated near the sandwich, ready for someone to eat it. The plate is placed on a dining table along with another dish or meal on the table as well. There are a few bottles and cups positioned around the table, adding to the dining experience. A knife is positioned close to the sandwich, likely to cut it further. This image could be taken in a restaurant setting or a personal meal being shared or consumed.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "spoon", "bottle", "sandwich", "fork", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 191691, "caption": "The image depicts a man shaving his face in a bathroom. He is standing by the sink, shaving his face in front of a mirror, while another person is standing in the background. The bathroom is spacious and features a sink at the side, with a toothbrush and toothpaste visible. The man is getting ready for the day or simply grooming himself before a mirror.", "mscoco_hallucinated_words": [["sink", "sink"], ["sink", "sink"]], "mscoco_gt_words": ["toothbrush", "person", "bottle"], "hallucination_idxs": [17, 44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 89378, "caption": " In the image, a man wearing a red suit is standing near a motorcycle, which is positioned in front of him. The motorcycle is next to a red building with large doors, giving the scene an attention-grabbing appearance. There is another motorcycle situated nearby, this one closer to the building and appearing slightly bigger than the main motorcycle. The scene presents a striking image of the man and the motorcycle, suggesting the possibility of a photo shoot or an interesting event.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 99874, "caption": " In the image, a woman is sitting at a table while using her cell phone. She is holding her phone close to her ear, and appears to be in a conversation. The scene is set in a dining area, likely a restaurant or a cafe, with a dining table and chairs positioned around it.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["handbag", "dining table", "bottle", "cell phone", "cat", "person"], "hallucination_idxs": [56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.5}}, {"image_id": 437895, "caption": "There is an image of a bed with a colorful quilt spread over it. There is a lamp situated next to the bed, adding a touch of light to the scene. The quilt features different geometric patterns that add a dynamic, intricate design to the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 557447, "caption": "The image features a man in a suit sitting at a desk, looking focused as he points at something on the table in front of him. There are multiple books or binders placed on the table, and one chair is positioned to the left of the desk.", "mscoco_hallucinated_words": [["book", "book"], ["chair", "chair"]], "mscoco_gt_words": ["knife", "dining table", "tie", "spoon", "fork", "person"], "hallucination_idxs": [31, 41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.3333333333333333}}, {"image_id": 159979, "caption": "The image shows a large group of people standing around a waterway filled with small boats. There are many umbrellas present in the scene, with one being much larger than the rest and covering a number of the boats. The umbrellas provide shade for the people gathered around the boats.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "boat", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 357, "caption": " A baseball game is in progress, with several players on the field. The baseball player on the left side of the image is holding a baseball bat, getting ready to swing. Several other players are scattered around the field, ready for their turn or in different positions.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 538891, "caption": "The image features a woman playing tennis on a sports field. She is holding a tennis racket, prepared to serve the ball. The court is surrounded by people, with at least 12 other individuals present.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 561967, "caption": "The image features a train traveling down the train tracks on a sunny day. The engine and cars are spread out across the track, with a blue and yellow train on the right side and several other train cars visible on the left. The train is traveling in the same direction as the rest of the tracks on the left side of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "train", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 565778, "caption": "The image shows a blue passenger train on train tracks traveling along a section of track that extends into a tunnel. The train is moving from left to right, giving the impression of a train entering a tunnel. There are two additional cars on the track, one behind the main train and another further in front of it. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "train", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 7125, "caption": " According to the picture, a group of people is riding bicycles along a path near a river or body of water. There are five distinct bicycles visible in the image, each with a person riding on it. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "dog", "bottle", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 221350, "caption": " In the image, a man is holding his necktie, which has an intricate heart pattern. The tie is a red color, and the man appears to be putting on or adjusting it. There are a few chairs present in the scene, likely a workplace setting. Some of the chairs are positioned closer to the man and others are more spread out throughout the scene. A cup is also visible in the background, placed on a surface near the man's upper torso.", "mscoco_hallucinated_words": [["cup", "cup"]], "mscoco_gt_words": ["handbag", "dining table", "tie", "chair", "person"], "hallucination_idxs": [73], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.6}}, {"image_id": 280930, "caption": "The image shows a little girl laughing in a kitchen. She is wearing a blue shirt and holding a pan with various food items inside. There are two ovens in the kitchen: one on the left and the other on the right side. The girl appears to be excited and amused about the food in the pan. Additionally, a knife is visible near the left side of the image.", "mscoco_hallucinated_words": [["knife", "knife"]], "mscoco_gt_words": ["person", "oven", "refrigerator", "bottle"], "hallucination_idxs": [65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 151081, "caption": "The image features a large green field with a single tractor in the center. There are two helicopters, one of which is flying above the tractor, while the other is following the tractor along the highway, creating an interesting and peculiar sight. The airplane is positioned higher than the tractor, flying closer to the sky.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 122232, "caption": "The image features a woman in a yellow raincoat playing tennis on a tennis court. She is holding a tennis racket while standing on the court, with several tennis balls scattered around her. The court appears to be an outdoor location, as there is a fence surrounding the area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 134223, "caption": "The image features a plate of food containing a hot dog and french fries. The hot dog is situated at the center of the plate, surrounded by the french fries and various sauces. There are two ketchup bottles on the plate, one placed closer to the french fries and the other near the hot dog.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["bowl", "hot dog", "broccoli", "dining table"], "hallucination_idxs": [40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.25}}, {"image_id": 198163, "caption": "In the image, there is a man sitting at a table, surrounded by several white pans filled with a variety of colors. The pans are arranged on the table and cover most of the space, with some closer to the man and others further away. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 3742, "caption": " A vase of a single rose is placed next to a pink bowl filled with apples on a window sill. The apples are arranged neatly in the bowl, giving the appearance of being both fresh and carefully arranged. The roses, in contrast, add a pop of color to the scene, making it both visually appealing and a bit whimsical.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "apple", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 217593, "caption": " The image features a young boy in a baseball uniform standing in a dirt area near two trees, likely a sandbox or play area. The boy is holding a baseball glove in his hand, ready to throw the ball during a game. There are several other people present in the scene, with a few of them standing further away from the boy.", "mscoco_hallucinated_words": [["ball", "sports ball"]], "mscoco_gt_words": ["handbag", "chair", "baseball bat", "person", "baseball glove"], "hallucination_idxs": [41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.4}}, {"image_id": 279078, "caption": " The image features a young man standing in front of a refrigerator, holding out an open bottle of beer and showcasing all the different bottles of beer inside the fridge. The bottles are displayed both on the left and right sides of the refrigerator, providing a view of a variety of different brands and types of beer. The man is standing with his hands out, highlighting the bottles, and drawing attention to the large collection of beer in the fridge.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 572081, "caption": " The image features a group of people gathered around a tennis court, likely watching a game or playing themselves. There are several tennis rackets on the court, including two at the far left side, one in the center, and three towards the right side of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "chair", "sports ball", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 477016, "caption": "According to the picture, a young girl wearing a pink shirt is standing in front of a refrigerator that has a fish and other items displayed inside it. A woman, possibly her mother or a family member, is also present, along with other people in the background.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"]], "mscoco_gt_words": ["handbag", "oven", "backpack", "potted plant", "sink", "person"], "hallucination_idxs": [18], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.16666666666666666}}, {"image_id": 301266, "caption": " In the scene, a group of people is gathered around a large dining table. There are two tables, one is located at the left side of the room and the other one at the right side. Various bowls and cups are spread out on the tables, as well as various utensils like spoons, forks, and knives.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["knife", "knife"]], "mscoco_gt_words": ["cup", "dining table", "spoon", "book", "fork", "sink", "person"], "hallucination_idxs": [41, 62], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.7142857142857143}}, {"image_id": 475813, "caption": "The scene features a child, likely between the ages of two and four, standing in a room with their arms outstretched, holding a remote control. The child appears to be playing or engaged in some sort of activity, such as testing the remote.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 558784, "caption": "According to the picture, there is a man standing next to a dugout area near a lake. The man is holding a baseball bat, indicating a possible intention to practice or play baseball. The scene features various trees in different positions throughout the grassy area, adding a sense of nature to the surroundings.", "mscoco_hallucinated_words": [["baseball bat", "baseball bat"]], "mscoco_gt_words": ["backpack", "frisbee", "person"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 308543, "caption": " The image depicts a hand holding two silverware utensils, namely two spoons, with their metallic surface and silver color contrasting the black background. The spoons appear to be in close proximity, perhaps being placed side by side or in the hand. The hand and spoons can be seen in a large kitchen sink or silverware sink.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "spoon", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 359164, "caption": "According to the image, there are two small dogs standing on the grass in the backyard. The dogs seem to be enjoying their time outside, likely exploring their surroundings or chasing each other. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "chair", "dog", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 539951, "caption": "The image displays a kitchen with a refrigerator and sink placed in the middle of the space, surrounded by checkers. There is also a stove in the kitchen, as well as a few bottles, including one near the center of the room. The kitchen appears to be old and retro, with the appliances and refrigerator representing the 1950s and 60s era.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["sink", "clock", "oven", "refrigerator"], "hallucination_idxs": [36], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.75}}, {"image_id": 143479, "caption": "According to the picture, a group of people is gathered together outdoors in a backyard setting. An old man is holding a tray with a large number of hot dogs, possibly being shared with others in the group. There are also several bottles scattered around the scene, indicating a picnic or backyard BBQ atmosphere.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["hot dog", "chair", "person"], "hallucination_idxs": [46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 475024, "caption": " The image depicts a vase with a variety of flowers inside. It is placed in front of a mirror, likely a decorative mirror. The vase is positioned in the center of the scene and has a few orange flowers mixed with various shades of white, purple, and yellow, giving it a bright and colorful appearance. The arrangement of flowers in the vase creates an interesting visual display.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "orange", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 238065, "caption": "According to the image, a person is skateboarding up the side of a ramp, with the skateboard and the person being on the left side. The skateboarder is wearing a blue hat, likely a baseball cap, as they are skillfully riding the skateboard in the skatepark. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 314016, "caption": "According to the picture, the bathroom contains a toilet and a sink, with a bottle of liquor placed on a shelf near the sink. In the background, there is a sign above the sink pointing towards the bathroom's name. Near the toilet, there is a book titled \"Magic Eye\".", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "book", "toilet", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 322163, "caption": "The image portrays a group of people in a commercial kitchen, preparing various foods and drinks for a meal. The main person in the scene is a man in a chef's apron, standing in the middle of the kitchen, surrounded by others who are also preparing the food. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "pizza", "oven", "dining table", "bowl", "potted plant", "bottle", "microwave", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.1111111111111111}}, {"image_id": 230983, "caption": "According to the picture, it captures a scene of a man on a skateboard performing a trick on a green container, which is partially blocked by another person. The skateboarder is in motion, creating an interesting visual contrast by leaping over the green object in the midst of the city. The background includes various buildings, with one located on the left side of the image, two on the right side, and another at the end of the sidewalk. The city setting further enhances the dynamic atmosphere of the scene.", "mscoco_hallucinated_words": [["container", "bowl"]], "mscoco_gt_words": ["skateboard", "car", "person"], "hallucination_idxs": [21], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 564629, "caption": "A kitchen has wooden cabinets, [IDK], [IDK]. The image also shows a stove top and oven in the kitchen, possibly a combination appliance. There is a refrigerator in the kitchen and a sink next to the oven. The sink and oven are next to each other, which suggests that they might be installed together. There are two chairs in the kitchen as well.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["sink", "microwave", "oven", "refrigerator"], "hallucination_idxs": [69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.75}}, {"image_id": 53431, "caption": " In the image, there is a man sitting on a ledge next to a building, with a laptop on his lap. He is wearing a vest and two ties. The man is also sitting on a chair next to him, with a bottle and a cellphone on the chair.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cellphone", "cell phone"]], "mscoco_gt_words": ["laptop", "chair", "person"], "hallucination_idxs": [47, 50], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 1.0}}, {"image_id": 441840, "caption": "In the image, there are two people and a young boy in a baseball field. The boy is wielding a baseball bat, while the other two people are present in the scene as well. One of them appears to be in the background, standing near the edge of the field, and the other one is closer to the boy swinging the bat, possibly an adult supervising or coaching him.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "baseball bat", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 53465, "caption": "In the image, there is a cake with the words \"welcome to McDonald's\" written on it, sitting on a plate placed on a dining table. The table is covered with a tablecloth and surrounded by a few chairs, giving the scene a welcoming atmosphere. ", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["cup", "dining table", "bowl", "spoon", "cake", "fork", "person"], "hallucination_idxs": [43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.2857142857142857}}, {"image_id": 381031, "caption": "The image depicts a plate of food, consisting of two donuts and a sandwich. The donuts are placed towards the right side of the plate, while the sandwich is located closer to the center. There are also some beverages present on the table, including two bottles of juice and an orange towards the top left of the image. The juice bottles are positioned towards the left and right edges of the table, and the orange is closer to the center. ", "mscoco_hallucinated_words": [["sandwich", "sandwich"], ["sandwich", "sandwich"], ["orange", "orange"], ["orange", "orange"]], "mscoco_gt_words": ["cup", "dining table", "spoon", "bottle", "apple", "microwave", "donut"], "hallucination_idxs": [14, 30, 55, 81], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.42857142857142855}}, {"image_id": 530520, "caption": " In the image, there are two men sitting on the grass in front of a building. One of them is using a laptop, while the other sits nearby, appearing to be engaged in conversation. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "motorcycle", "laptop", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 539555, "caption": "In the image, there are multiple people of various ages and heights, with some standing or walking around. There is a man actively playing with a frisbee, which is visible in the background, possibly flying through the air. The frisbee is also shown in the foreground, being held by the man in motion. Other individuals in the scene appear to be observing the action or chatting with each other.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "frisbee", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 279521, "caption": "According to the picture, there are three men standing next to a sidewalk near a train. One person is in front of the group, with two others standing further back, close to the train. A handbag can be seen on the sidewalk near the men. The train appears to be a mix of red and white, possibly a public transportation system. There are a few other people walking nearby, including one near the train and another close to the men on the sidewalk.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "handbag", "train", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 560439, "caption": " In the image, three snowshoes can be seen, with two men and one woman wearing them and walking down a snowy hill. The group is in the center of the scene, and their snowshoes can be seen throughout the image. The hill is flanked by evergreen trees on both sides, emphasizing the snowy landscape.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "snowboard", "backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 248353, "caption": " There are several people skiing down a snowy slope, with some standing in line near the path. Among the group of skiers, a man can be seen in the front, wearing a red jacket, while another person is standing slightly behind and to the right. Additionally, there is a person further back, wearing a blue coat, and another person wearing a jacket of an unknown color.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 9628, "caption": "The image captures an exciting scene on the water where a man is skydiving with a parachute. The man can be seen on the water, wearing a harness, as he leaps through the air with a kite, likely behind him. The parachute is also visible, spread out across the scene. In the background, another person can be seen sitting on a stand-up paddle board. The image presents an action-packed water sport, with the skydiving man and the parachute capturing the attention of the scene.", "mscoco_hallucinated_words": [["kite", "kite"]], "mscoco_gt_words": ["surfboard", "boat", "person"], "hallucination_idxs": [39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.3333333333333333}}, {"image_id": 330050, "caption": "According to the image, a man and a woman are standing on a tennis court, each holding a tennis racket. They are both smiling and appear to be enjoying themselves. There are no other people visible in the image, and no other items are prominently featured in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 254161, "caption": " According to the picture, the scene depicts a plaza or public square with a man standing on a red brick surface, looking up at a kite flying high in the sky. The kite flies off in the distance, near a tall building. Another person is also in the background, standing near the far edge of the plaza. A few other objects are visible, such as a cellphone located near the man and a small car parked in the middle of the plaza.", "mscoco_hallucinated_words": [["cellphone", "cell phone"], ["car", "car"]], "mscoco_gt_words": ["handbag", "kite", "person"], "hallucination_idxs": [74, 82], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.6666666666666666}}, {"image_id": 275749, "caption": "According to the picture, there is a group of people standing and sitting around a table, interacting with one another. There are four people in the scene, with three of them positioned closer to the table while the remaining one is further away. The people are in various positions, with some standing, others seated, and one person slightly leaning towards the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "tv", "dining table", "laptop", "mouse", "bottle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 163518, "caption": "In the image, a woman is standing in a kitchen, working on a buffet or table filled with a variety of vegetables. There are numerous vegetable dishes on the table, such as lettuce, tomatoes, and broccoli. The woman appears to be preparing the food and arranging it on the table.", "mscoco_hallucinated_words": [["broccoli", "broccoli"]], "mscoco_gt_words": ["oven", "dining table", "bowl", "chair", "person"], "hallucination_idxs": [41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.4}}, {"image_id": 156696, "caption": "The image is of a train station with a train parked on the right side and a person standing near the left side of the train. The train is located in a building, perhaps a train station, and a person is walking on the left side of the train. There are multiple people standing on the left side of the train, some closer to the front and others further away. One person is standing on the right side of the train, closer to the end.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "train", "handbag", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 525202, "caption": " In the image, a large pizza is placed on a metal pizza panner in front of a table. It appears to be a take-out pizza, as it has no visible toppings or cheese, but it has a distinct round shape. The table is dotted with utensils and other food, such as cups and an utensil, which suggests it might be a place where people eat and gather.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "spoon", "bottle", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4444444444444444}}, {"image_id": 423984, "caption": " The picture features a plate full of food that appears to be pasta with vegetables, including carrots and broccoli. There are several pieces of carrots spread across the plate, some closer to the edges and others more towards the center. In addition, there are multiple broccoli florets, arranged both near the edges and further into the dish. The vegetables are accompanied by bread, which is likely a side dish for the pasta. The plate also includes some cups, likely used for serving the food.", "mscoco_hallucinated_words": [["cup", "cup"]], "mscoco_gt_words": ["carrot", "broccoli", "dining table"], "hallucination_idxs": [86], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 499486, "caption": "According to the picture, there are two cats sitting on a bed. One cat is black and white, while the other cat is grey and white. They appear to be relaxing and enjoying their time together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "cat", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 118124, "caption": " The image shows a young man sitting in a car wearing glasses and a purple tie. The man is also wearing a shirt, possibly dark-colored, under his button-down shirt. The car appears to be a passenger vehicle, and the man is sitting in the back seat of it, likely using it as a place to relax and take pictures. A bottle can be seen close to the man, and a cell phone is resting on the car's dashboard.", "mscoco_hallucinated_words": [["seat", "chair"], ["bottle", "bottle"], ["cell phone", "cell phone"]], "mscoco_gt_words": ["tie", "person", "car"], "hallucination_idxs": [50, 67, 78], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 288729, "caption": "The image features a decorated cake with a small elephant inside, placed underneath a large red and yellow tipped tent, which looks like a circus tent. The cake appears to be about two feet high and covers a significant portion of the image. A toy teddy bear is sitting in front of the elephant in the cake, adding an extra touch to the decoration. There is also a small bottle visible at the top left corner of the image, perhaps representing a circus or carnival theme.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["teddy bear", "bear", "elephant", "cake", "person"], "hallucination_idxs": [75], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6}}, {"image_id": 177721, "caption": "According to the picture, a man is performing a trick on a skateboard, with a green shirt and blue jeans visible in the scene. He is balancing on the skateboard as he is skillfully riding it while in motion.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 372220, "caption": "According to the picture, there are multiple people standing in different areas of the scene, including two men in the center of the image, and another man standing to the right. A man with a beard, glasses, and a black hat is standing near the other people, looking at something on his cell phone.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "horse", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 252625, "caption": "The image depicts a narrow, empty sidewalk street with several people and bicycles, creating a quaint town-like atmosphere. A bicyclist rides down the sidewalk on the left side of the street, while three additional people are on the street in different spots. One person can be seen walking at the top of the street. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "potted plant", "person", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 493610, "caption": "The image captures a scene of people sitting on the back of a truck, with a crowd of individuals enjoying a car ride. Some of the people are seated close to each other, while others are dispersed across the truck. The large group includes people of various ages and ethnicities. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "person", "bus", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 302789, "caption": "The image shows a room with a small bed and a fridge, possibly in a kitchen or hotel room. The bed is covered with a sheet and has a suitcase placed on top of it, suggesting that someone has just packed their belongings before leaving. The suitcase, which is placed near the bed's edge, has black handles and appears to be well-used. The fridge can be seen in the background, occupying part of the room.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["suitcase", "handbag", "refrigerator", "backpack", "bottle", "sink", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 30667, "caption": " Sitting in the middle of the scene is a large, brown dog. It appears to be comfortably lying down on the floor, with a person sitting nearby, on the right side. To the left of the dog, there are two sets of feet visible - one closer to the dog and the other farther away, indicating that multiple people are in the room or area. A book and a pair of shoes are located at the bottom left corner, further adding to the scene's ambiance.", "mscoco_hallucinated_words": [["book", "book"]], "mscoco_gt_words": ["person", "boat", "dog"], "hallucination_idxs": [74], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 114108, "caption": "The image depicts a dining table with two pizzas placed on it, one with cheese and the other with cheese on it. There are multiple people around the table, and some of them are sitting on chairs while others stand. A bottle is placed near the pizzas, possibly a drink for the diners.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "spoon", "fork", "chair", "person"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 328110, "caption": "The image captures a scene with a woman sitting on a bench and holding a baby. Other people are around the woman, with some sitting on a bench or standing nearby. The environment features a mix of indoor and outdoor elements, such as a brick surface and a paved road, possibly indicating a street or a sidewalk.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "handbag", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 366111, "caption": "This is a black and white image showcasing a street scene with several men riding bicycles. In the middle, there is a man in a suit wearing a bow tie while traveling on his bicycle. The bicycles and their riders are scattered along the street, making it appear as though they are riding down the city street.", "mscoco_hallucinated_words": [["tie", "tie"]], "mscoco_gt_words": ["traffic light", "car", "person", "bicycle"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 532481, "caption": " The image captures a thrilling moment of a person parasailing in the ocean. There are two kites visible: one on the left side and another on the right, indicating a pair of people enjoying the outdoor activity together. The water has multiple waves, some visible in the foreground and others in the background. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "surfboard", "kite", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 291634, "caption": " The scene features a row of motorcycles parked on the side of the street, with several bikes of various sizes and colors lined up side by side. Some motorcycles are positioned closer to the viewer, while others are further down the street. In addition to the motorcycles, there is a bicycle on the sidewalk nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 463678, "caption": "The image captures a man standing on a tennis court, preparing to swing his racket and hit the tennis ball. The man is wearing white and can be seen in the center of the picture, holding the tennis racket in his hand. The court appears to be well-maintained, with the surface and net in good condition. In the background, there are a few other people around the court, likely waiting for their turn or observing the ongoing game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 70815, "caption": "This image depicts a dog being walked on a leash, which is hanging from a motorcycle or bicycle in the background. The scene is likely a street or sidewalk, as there are multiple other people nearby, some sitting and others standing. In the foreground, there are also several bicycles and another person, creating a diverse and lively scene. Overall, this image captures a moment of companionship and leisure while walking a dog on a leash.", "mscoco_hallucinated_words": [["motorcycle", "motorcycle"]], "mscoco_gt_words": ["bicycle", "person", "truck", "dog"], "hallucination_idxs": [16], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.75}}, {"image_id": 367872, "caption": " The image captures a baseball game, with a batter swinging at an oncoming ball. The batter is in the middle of the action, while the team members gather around the field to watch the play.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 353898, "caption": "The image features a man sitting on a stone bench surrounded by a large group of birds. The birds are a mix of doves and pigeons, with a few standing on the ground and many more in the air. The majority of the birds are scattered throughout the area, while some are closer to the man sitting on the bench.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "cell phone", "backpack", "bird", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 88269, "caption": " The image features a meal consisting of a bowl of soup and a couple of grilled cheese sandwiches. There are three sandwiches, each placed on the plate with different knives, one in the center and two on either side. The bowl of soup is located at the left side of the plate, with a cup of soup sitting on the edge of the bowl. There are various food items in the dish, with two forks near the sandwiches, and a spoon placed near the bowl of soup. The overall scene appears to be a typical meal with soup and sandwiches.", "mscoco_hallucinated_words": [["knife", "knife"], ["cup", "cup"], ["fork", "fork"]], "mscoco_gt_words": ["sandwich", "spoon", "bowl"], "hallucination_idxs": [31, 59, 81], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 463211, "caption": "The image features two little girls wearing backpacks, one on the left and the other on the right, in front of a school bus. The girls appear to be leaving the bus and walking towards the right side of the scene. In the background, there is a second school bus. There is also a pink backpack in the scene, which one of the children might have been carrying.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bus", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 396269, "caption": "There is a large room with two chairs against the wall, one closer and one farther back. The furniture is adorned with decorative elements, and there are three vases in the room filled with flowers. The room itself is decorated with several framed pictures on the walls.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "chair", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 98979, "caption": "The image showcases a microwave, sitting on top of a kitchen counter, with its oven door slightly open. The microwave features a silver color, and a small button is visible inside the oven's interior, close to the bottom of the door. It seems to be positioned in the kitchen for easy access and convenient use.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["microwave", "knife", "oven"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 244455, "caption": " In the image, two men riding motorcycles are on the street, passing through the city or at a public event. They are both dressed in black and wearing helmets, and are riding on their motorcycles with one wearing a white shirt and the other wearing a black one. There are a few other people in the scene, likely attending the event or simply going about their day.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 379443, "caption": "The image features a pair of orange scissors, arranged in a diagonal pattern. The scissors appear to be lying on the top of a table or desk. The orange scissors create a striking visual effect, drawing attention to the image. The scissors have been placed in a strategic location to make a fascinating scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "scissors", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 536154, "caption": "The image features a young boy sitting on a bed, surrounded by a massive collection of stuffed animals. The boy is the main subject in the scene, while the stuffed animals are arranged all around him in various sizes and shapes. The bed and room seem to be a small space that is filled with the many stuffed animals. ", "mscoco_hallucinated_words": [["bed", "bed"], ["bed", "bed"]], "mscoco_gt_words": ["teddy bear", "cat", "person"], "hallucination_idxs": [9, 46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 374061, "caption": "The scene is a black and white photo featuring a cat, which is lying down on a couch. It appears to be sleeping or resting next to several teddy bears. At the left side of the image, a person's hand is visible, perhaps belonging to the cat owner.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["teddy bear", "bear", "cat", "bed", "couch"], "hallucination_idxs": [42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6}}, {"image_id": 560018, "caption": " The image captures a group of people enjoying a day out in the water on surfboards. The large crowd, consisting of various individuals, is spread out across the ocean waves, having fun and riding them. Among the surfers, there are a couple of boards visible, including one towards the left of the image and another closer to the center. A few people can be seen sitting on the boards, while others are standing on the waves. Overall, it is a lively scene filled with people enjoying a water-based activity on a nice summer day.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "kite", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 353320, "caption": "In the scene, a guitar case is leaning on a bench, with various pieces of luggage surrounding it. There are five suitcases in total: one is situated close to the guitar case and the bench, while the other four are further back and towards the right side of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "suitcase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 256628, "caption": " The image features a large dog lying on the wooden floor, with its mouth open and tongue out. The dog is positioned on the right side of the scene, with its head and neck occupying a significant portion of the area. A red collar is lying on the floor in front of the dog, and there is a chair placed on the left side of the image, near the back of the dog.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "chair", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 158806, "caption": " The image captures a scene of a person feeding a cute brown and black dog a sandwich on a blue plate. The person has placed the sandwich on the table and the dog seems excited and eager to eat it. The scene is filled with various items such as a handbag on the ground, a backpack near the left side of the table, and other objects around the area. In addition, a bottle is visible near the center of the table, along with another person standing in the background. This image could represent a fun or playful interaction between the dog and the person.", "mscoco_hallucinated_words": [["table", "dining table"], ["handbag", "handbag"], ["backpack", "backpack"], ["table", "dining table"], ["bottle", "bottle"], ["table", "dining table"]], "mscoco_gt_words": ["sandwich", "person", "dog"], "hallucination_idxs": [30, 52, 58, 65, 78, 86], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 1.0}}, {"image_id": 424270, "caption": " There is a small child in a white dress sitting on the bed in the image, occupying the center of the scene. The bed looks comfortable, with sheets and blankets covering it. The bed is positioned in the middle of the room and appears to be the main focal point. The child seems to be enjoying their time in the bed.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bed", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 368335, "caption": " In the image, a brown horse is walking in between two cars, seemingly calmly exploring the area. The horse is placed in the middle of the street, surrounded by cars and various other traffic, creating a unique scene. The cars are different sizes, with one being relatively larger and the other one smaller. Some cars are parked to the right and left of the scene, while others are moving. A few people are also visible in the picture, likely observing the horse or passing by.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["truck", "horse", "car"], "hallucination_idxs": [81], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.6666666666666666}}, {"image_id": 483227, "caption": " The image captures a group of young tennis players wearing purple shirts standing on a tennis court. The players are holding tennis rackets, ready to play or practice. There are several tennis balls visible on the court, with one close to a racket, and another one near the upper part of the court. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 228306, "caption": "This is a photo featuring a red bus driving on a street, likely in a city. The bus is parked near several tall buildings, including a building in the front, another in the middle, and another at the back of the bus. The bus seems to be traveling in a more urban or suburban area, as there are no other buses or public transport vehicles present in the scene. There are also a few people in the area, with one person visible near the front of the bus, another near the back, and a third individual situated close to the side of the building on the right. The presence of these people suggests that the bus is stopping at a stop or station.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 416384, "caption": " The image features a woman sitting on a white, wooden outdoor chair using her cell phone. There are other people in the background, including some near the sidewalk, and some standing at the side of the scene. The chair and bench are placed outside, likely in a public area or a park, where people can gather and enjoy their time together.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["cell phone", "chair", "person"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 364783, "caption": "The image features a red and white bathroom with various bathroom accessories. There is a sink located next to the toilet, and the toilet itself is surrounded by red walls. Additionally, a mirror is prominently displayed above the sink. The bathroom has a white countertop and a white and red combination in the floor tiles, creating an attractive color scheme. Overall, the bathroom is nicely designed and visually pleasing.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toothbrush", "cup", "potted plant", "bottle", "toilet", "sink", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 335041, "caption": "The image features a person in a red jacket skiing down a snowy street, surrounded by a lot of cars parked along the side. The cars are of different sizes and styles, including some that are at the edge of the street and others that are closer to the middle. The person is holding two skis and seems to be enjoying the outdoor activity. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 531327, "caption": " The image features a large white bowl filled with a variety of mixed vegetables, including broccoli, carrots, and other colorful, small pieces. The bowl is on a table, surrounded by different dining utensils such as forks, a knife, and a spoon, indicating that it is a meal to be served or enjoyed. The table is also accompanied by a couple of cups, which could be used for drinking or as part of the presentation. The food looks fresh and appealing, inviting the viewer to sample the meal.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["bowl", "bowl"], ["cup", "cup"]], "mscoco_gt_words": ["knife", "dining table", "spoon", "fork", "broccoli", "wine glass", "carrot"], "hallucination_idxs": [6, 28, 71], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3, "Recall": 0.8571428571428571}}, {"image_id": 451228, "caption": "The image depicts a city street at night with numerous streetlights and multiple red double-decker buses traveling down the street. Some of the lights are hung up overhead in a festive display, creating a celebratory mood. The street is lined with various buildings on both sides, and several people are walking, riding bicycles, and standing around. The atmosphere is lively, with many different people and transportation modes in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "bus", "person", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 256365, "caption": " The image depicts a plate covered in white frosting, filled with variously decorated cupcakes. Three cupcakes are present on the plate, one of which features blue frosting, creating an interesting contrast to the white frosting on the others. The cupcakes are placed in various positions, adding visual appeal to the overall display. This colorful and creatively frosted cakes appear to be homemade treats.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "cup", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 502936, "caption": "The scene features a man playing tennis on a court, holding a tennis racket as he tries to hit a tennis ball. The ball is positioned near the center of the court, and the man is standing in the left side of the court, preparing to strike the ball.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 451449, "caption": "According to the picture, a black and white dog is lying in a dog bed surrounded by numerous books, both small and larger ones. The dog bed is in a room, and the books are stacked on various surfaces throughout the scene. Some of the books can be seen resting on the shelves, others are scattered on the floor and one is on top of a desk in the room. The books cover a variety of topics, and the room seems to be the dog's favorite place for a nap.", "mscoco_hallucinated_words": [["desk", "dining table"]], "mscoco_gt_words": ["book", "bed", "dog"], "hallucination_idxs": [72], "metrics": {"CHAIRs": 1, "CHAIRi": 0.09090909090909091, "Recall": 1.0}}, {"image_id": 51054, "caption": "The image features a woman sitting on a bed, holding a cellphone and talking on it. She appears to be an older woman, possibly the subject of a photo blur effect. The bed is located in the center of the image, and a pillow can be seen on the left side of the bed. The woman sits on the right side, close to the edge.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "bottle", "cell phone", "bed", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 1268, "caption": "The image depicts a group of people gathering on a bridge over water. A man is standing in the middle, sitting on a bench to look out over the water, while a woman is next to him taking a picture. A handbag can be seen on the ground near the man, and a couple of other individuals are present, but their focus seems to be on the camera rather than the man. A bird, possibly a stork or a goose, is visible in the water nearby.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["handbag", "cell phone", "backpack", "bird", "boat", "person"], "hallucination_idxs": [25], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.5}}, {"image_id": 62692, "caption": "The image captures a female tennis player swinging her racket during a game, focused on her task. She is positioned at the center of the court and has her back to the camera. There are several people in the background of the scene, and one person seems to be standing near the left edge of the image. The surroundings appear to be outdoors with a few trees visible on the left side of the image and some other buildings in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 542771, "caption": "The image features an old black and white photograph of a room with several chairs and a single table. There is a vase of flowers on the table, adding a decorative touch to the setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "potted plant", "vase", "book", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 575406, "caption": "The image depicts a blue and red bus with a large white wing on the side parked on a city street. The bus seems to be a tourist attraction and features a mix of passenger seating and standing areas. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "bus", "potted plant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 244834, "caption": "In the scene, two people, a man and a woman, are walking along a rainy sidewalk. They both hold colorful umbrellas, with the man carrying one in red, white, and blue and the woman carrying a yellow one. Additionally, they are holding a bag between them, possibly for both to use during their walk.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "clock", "handbag", "umbrella", "potted plant", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 151159, "caption": " In the image, two men can be seen riding skateboards on a tennis court. One man is wearing a tie, while the other person is dressed in a white shirt and brown pants. There are several benches dispersed around the court, indicating that this is likely a public space for outdoor activities. ", "mscoco_hallucinated_words": [["tie", "tie"]], "mscoco_gt_words": ["skateboard", "bench", "person"], "hallucination_idxs": [21], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 277073, "caption": "In the scene, two adults are sitting on the back of a moped as they travel down a busy street. One person is a man, the other a woman. They appear to be riding in a unique and distinctive style, sitting back-to-back while a third person, a woman, is seated on the front of the moped. There are a few vehicles present in the scene, including a car in the distance and another one closer to the moped. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 377879, "caption": " The image shows two military tanks on a train. They appear to be large and in good condition. One tank is situated closer to the left side of the train, while the other is located near the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "train", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 436814, "caption": "The image depicts a small room with various items placed inside it. In the room, there is a laptop sitting on a desk, with a mouse positioned near it. A few papers can be found on the desk as well, including a paper placed on the laptop, and a couple of books, one with the words \"Ryan and Katie\" on its cover, can be found throughout the room.", "mscoco_hallucinated_words": [["book", "book"]], "mscoco_gt_words": ["laptop", "mouse", "chair", "dining table"], "hallucination_idxs": [57], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.75}}, {"image_id": 63934, "caption": "A person dressed as a nun is skating down a city street on a skateboard, likely in a parking lot or other public area. The skateboard is located on the sidewalk, with some cars parked alongside the road. One car is positioned further away from the nun, while another car is closer, almost in front of her. Other cars are visible in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "skateboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 275611, "caption": " The image features a whole pizza with cheese and spinach sitting on a plate. The pizza is placed on top of a wood table, which takes up most of the image. The pizza is positioned towards the left side of the table, allowing for a closer view of the cheese and spinach toppings. The table is a dining or wooden table, as there are two people seated nearby, indicating that this pizza could be enjoyed as part of a meal.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["pizza", "knife", "dining table"], "hallucination_idxs": [71], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.6666666666666666}}, {"image_id": 516248, "caption": " Sitting in front of a computer at night, someone is holding a keyboard and mouse. The keyboard and mouse are plugged into a computer, indicating that the person is using it to communicate or navigate through the internet. The keyboard can be found in the lower-left corner, while the mouse can be found closer to the center, right under the keyboard. The person's hand is holding the mouse, and a computer monitor is located to the left of the mouse.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "tv", "dining table", "laptop", "mouse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 478380, "caption": " The image shows a man driving a truck down a road, surrounded by a large herd of sheep. There are multiple large flocks of sheep scattered throughout the scene, with some in the front of the truck, while others are further back.", "mscoco_hallucinated_words": [["truck", "truck"], ["truck", "truck"]], "mscoco_gt_words": ["person", "sheep", "car"], "hallucination_idxs": [7, 39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.6666666666666666}}, {"image_id": 422326, "caption": " The image features a parking lot filled with various vehicles, including motorcycles, a sidecar, and a pickup truck. Among the motorcycles, there is a sidecar, which stands out in the scene. Several motorcycles are parked side by side, while the pickup truck can be found in the middle of the row.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 110449, "caption": "The image shows two men wearing headphones, each standing near the edge of a table. One man is in the foreground, and the other man is in the background. There is a cell phone on the table as well. The scene is likely set in a house or apartment.", "mscoco_hallucinated_words": [["cell phone", "cell phone"]], "mscoco_gt_words": ["remote", "tv", "dining table", "laptop", "person"], "hallucination_idxs": [36], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.4}}, {"image_id": 45550, "caption": " The image features a man standing in a restaurant or cafe, smiling as he poses for a photo. He holds a plate with a sandwich, likely French fries and a cup of soda. There are other people in the background, likely also enjoying their time at the restaurant.", "mscoco_hallucinated_words": [["cup", "cup"]], "mscoco_gt_words": ["sandwich", "clock", "person", "bowl"], "hallucination_idxs": [33], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 444913, "caption": "According to the picture, the scene is of a man standing in a grassy area. He appears to be holding a red Frisbee and throwing it towards a nearby building. The location might be a park or a backyard, surrounded by more trees on the right side of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["frisbee", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 82709, "caption": " The image depicts a black and white dog catching a frisbee in the air, mid-jump, as it leaps over a large group of people. The frisbee is suspended in the air, with the dog in pursuit.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dog", "bottle", "frisbee", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 132796, "caption": " In the image, three people are riding on the backs of two elephants. The first elephant has a man sitting on its back, with another person seated to the right, and a third person seated to the left. The second elephant has a man sitting on the left side, with another person seated behind him. The people seem to be enjoying their time on the elephants, likely during a tour or excursion.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "elephant", "backpack", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 303099, "caption": "In the image, two cats are sitting on a toilet seat. One cat is black and white, while the other is mostly black. They are in the bathroom, with a bathroom tile behind them. Additionally, there are several bottles placed in different locations around the scene. Some are placed to the left of the cats, others are located above the cats and behind them, and still others can be seen in the right corner of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cat", "toilet", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 271565, "caption": "According to the image, there is a group of motorcycles driving down a street. There are a total of nine motorcycles lined up on the road, some in the front row and others in the middle row. All the motorcycles appear to be in motion, traveling down the street.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 205193, "caption": "In the picture, there is an assortment of beverages, with two cups and one bottle positioned on a marble counter. The cups are placed close to each other, while the bottle is located further away to the right.", "mscoco_hallucinated_words": [["cup", "cup"], ["bottle", "bottle"], ["cup", "cup"], ["bottle", "bottle"]], "mscoco_gt_words": ["toaster", "microwave", "oven", "person"], "hallucination_idxs": [13, 16, 24, 34], "metrics": {"CHAIRs": 1, "CHAIRi": 1.0, "Recall": 0.0}}, {"image_id": 272566, "caption": "The image shows a man sitting on the ground near a sidewalk outside a store, surrounded by various types of fruits. In the front of the display, there are numerous bananas, both ripened and green, spread out across multiple bowls. There are also apples and pears displayed, adding to the overall fruit selection. ", "mscoco_hallucinated_words": [["bowl", "bowl"]], "mscoco_gt_words": ["cup", "apple", "person", "banana"], "hallucination_idxs": [44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.75}}, {"image_id": 170099, "caption": " In the image, a man dressed in a suit is standing in front of a couch. He appears to be in the middle of speaking, as he looks towards the camera and gestures with his hands. The couch is positioned behind him, and there is another chair visible in the background to the right of the man. A couple of bow ties can be spotted in the image - one is worn around the man's neck, and the other is placed on top of the couch.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["tie", "orange", "person", "couch"], "hallucination_idxs": [51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.75}}, {"image_id": 14940, "caption": " The image shows two animals, a cat and a stuffed dog, both laying on a bed or couch, with the cat closer to the front of the scene. The cat is laying on top of the stuffed dog, which has a long snout. The cat appears to be comfortable and at ease as they relax with the toy.", "mscoco_hallucinated_words": [["bed", "bed"]], "mscoco_gt_words": ["dog", "cat", "orange", "chair", "couch"], "hallucination_idxs": [17], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.6}}, {"image_id": 408449, "caption": "According to the picture, a man is shirtless and riding a skateboard down the street, likely enjoying his activity. There are multiple people in the background, some closer to the skateboarder and others further away. The scene appears to be set in a city environment with a mix of buildings and vehicles, such as a car in the left foreground and a truck towards the center. There are also two traffic lights in the image, one closer to the center and the other further to the right side.", "mscoco_hallucinated_words": [["traffic light", "traffic light"]], "mscoco_gt_words": ["truck", "skateboard", "bus", "handbag", "car", "backpack", "person"], "hallucination_idxs": [77], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5714285714285714}}, {"image_id": 140702, "caption": " The image features two elephants, one large and one small, walking in the grass. The larger elephant is significantly taller than the smaller one, and they appear to be standing close to each other, possibly even in a playful manner.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "person", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 354540, "caption": " The image depicts a kitchen with a person sitting at a tabletop, working on a laptop. There is a microwave oven and a toaster oven in the kitchen, with the laptop placed on top of one of them. The person is holding a cell phone in one hand, and there are several other appliances in the kitchen, including a refrigerator, an oven, and a dishwasher.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"]], "mscoco_gt_words": ["toaster", "oven", "laptop", "cell phone", "microwave", "chair", "person"], "hallucination_idxs": [65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.09090909090909091, "Recall": 0.8571428571428571}}, {"image_id": 19763, "caption": "According to the picture, a slice of deep-fried pizza is sitting on a white plate, which is placed on a dining table in a restaurant. There is another plate visible near the pizza, likely containing another piece of food. The table appears to be crowded as there are several chairs around it, some of them occupied by people or items.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 191301, "caption": " The image showcases a dinner table with a tray displaying two desserts \u2013 a cake and a piece of ice cream. The cake is a chocolate one, and the ice cream is a combination of two scoops. The tray also includes a spoon, placed near the desserts.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "spoon", "cake", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 124236, "caption": " The image displays a kitchen area with a dark, narrow opening from a window, revealing sunlight and a few items placed on the countertop. A table sits in the middle of the kitchen space, surrounded by a few pots and pans. The table has a few bottles and a box on it, likely used for various purposes. ", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["toaster", "oven", "dining table", "bowl", "spoon"], "hallucination_idxs": [51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.2}}, {"image_id": 344930, "caption": "A red fire hydrant sits on the side of the road in the image. The fire hydrant is placed close to a car parked nearby, with a blue top on it. There is also another car visible, this one driving on the road close to the fire hydrant. The scene appears to be a typical city street, with a residential area in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fire hydrant", "orange", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 14285, "caption": " There is a large, multi-colored cat sitting on a bed in the image. The bedroom setting has a TV in one corner, a couch nearby, and a desk with a computer on it. The cat appears to be in the middle of the bed, resting and enjoying its surroundings.", "mscoco_hallucinated_words": [["couch", "couch"], ["desk", "dining table"], ["computer", "laptop"]], "mscoco_gt_words": ["remote", "tv", "cat", "chair", "book", "bed"], "hallucination_idxs": [26, 31, 34], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.5}}, {"image_id": 82488, "caption": " The image features two beds in a hotel room setting, each with a yellow pillow on top. One bed is positioned to the left, and the other bed is located to the right. Both beds have sheets, blankets, and a comforter on top. There is also a nightstand located between the two beds, featuring a vase, two lamps, and a book.", "mscoco_hallucinated_words": [["vase", "vase"]], "mscoco_gt_words": ["remote", "clock", "dining table", "orange", "book", "bed"], "hallucination_idxs": [63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.3333333333333333}}, {"image_id": 348251, "caption": " The image shows a person skiing down a snow-covered slope. They are wearing skis and holding ski poles, as seen in the surrounding area. There are also several other people around, likely enjoying skiing activities. The scene seems to be set in a ski resort with multiple buildings, including a house in the center of the image and a few other structures along the sides. The sky is covered with snow, giving the entire scene a wintery atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 150703, "caption": " The image displays a table with a hot dog served on a tray. The hot dog has tomatoes and pickles, making it an interesting culinary choice. On the side of the table, there's a cup of beer, likely paired with the hot dog for a tasty treat. The combination looks appetizing and appealing.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "cup", "hot dog", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 381615, "caption": "The image features a window filled with colorful plants and a potted plant, creating a lively and cozy atmosphere. The plants are arranged both around the edges of the window and near the middle, filling the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "potted plant", "vase", "book", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 352703, "caption": " The image shows a woman standing in a kitchen, surrounded by various objects. She is wearing an apron and appears to be in the middle of cooking. The kitchen features an array of food-related items, including several bowls, cups, and spoons, all placed on a countertop or other surfaces.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "handbag", "oven", "dining table", "bowl", "spoon", "bottle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 111109, "caption": "The image features a young boy wearing a sweater and shorts, sitting on a chair in the middle of the room. The boy seems to be enjoying his time while posing for the camera.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tv", "tie", "motorcycle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 226802, "caption": " The image captures a busy street in an Asian city where people are gathered, walking, or sitting, such as on the sidewalks. There are multiple vehicles present, including a large truck and several buses. The people appear to be walking, with some closer to the truck and others dispersed throughout the scene. ", "mscoco_hallucinated_words": [["truck", "truck"], ["truck", "truck"]], "mscoco_gt_words": ["motorcycle", "bus", "person", "car"], "hallucination_idxs": [35, 52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.5}}, {"image_id": 49005, "caption": "There is a baseball game happening on a field, with several players visible around the grassy area. A total of 15 players are spread across the field, participating in the game. Some players are standing near the baseball diamond, while others are in various positions across the field. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 410114, "caption": " In this image, a herd of cattle is walking down a dirt road, with a large group of cows and bulls following the lead of a person riding a horse nearby. There are also several other horses along the road, riding along or trailing behind the lead horse and other animals. The cows and bulls are spread out along the road, covering a large portion of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "horse", "person", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 100274, "caption": " The image features a train track where multiple train cars are parked, including a flat car. The train cars are spaced out along the track, with the flat car at the center of the scene. The scene also includes a car parked alongside the train track, in the middle between the first and last train cars.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "train", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 317254, "caption": "The image features a person wearing a mask while riding a motorcycle down the street. The person appears to be in motion, possibly traveling on the motorcycle. There is a car nearby, also moving in the same direction.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 425569, "caption": " The image captures an empty, large, restaurant-style kitchen with various appliances and cooking utensils. There are multiple refrigerators and ovens in the space. Some of the ovens are located on the top row of appliances, while others are towards the bottom. A variety of silver-colored tools, including knives and scoops, can be seen placed on the countertops throughout the kitchen. The focus appears to be on the vast array of appliances and equipment in the industrial-sized kitchen setting.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"], ["knife", "knife"]], "mscoco_gt_words": ["bowl", "sink", "oven"], "hallucination_idxs": [20, 54], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 28842, "caption": "There is a small park near a grassy area where a few people are hanging out. One person appears to be riding a skateboard, and another person appears to be wearing a blue backpack. The park features a long cement ramp, and some graffiti can be seen on the surface of the ramp. A few other people can be seen, mostly standing and not engaged in any particular activity.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["skateboard", "person", "dog"], "hallucination_idxs": [35], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 319607, "caption": "The image depicts a street scene with a combination of traffic lights and a bike on the side of the road. On the right side, there are multiple traffic lights, with one red light to the left, followed by a yellow light, another red light to the right, and another yellow light. A bike is visible, positioned close to the left side of the road.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bus", "traffic light", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 210372, "caption": "The image portrays a small kitchen with red cabinets and a sink. The sink is located towards the middle of the kitchen, positioned underneath a window. There is a single door present on the right side of the kitchen, possibly leading to the rest of the house or to a storage room. The kitchen has a couple of wine bottles placed along the wall, likely serving as decoration.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 156915, "caption": "The image showcases a table with a beautiful arrangement of flowers, featuring a red rose in the center and white flowers around it. There are also some greens and filler flowers accompanying the roses, adding a bit of greenery to the overall bouquet. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 349642, "caption": "According to the image, there are two people sitting on a bench while a dog is sitting on the ground in front of them. The dog is positioned in the foreground, while the people are seated further back. There is another person standing nearby, but they are not interacting with the other people or the dog. The scene takes place on a brick-laid walkway, possibly a sidewalk.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bench", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 423008, "caption": " The image depicts a group of people gathered around a body of water where several boats are docked. The people are standing close to the water's edge, with some of them occupying spots under umbrellas or tents. The umbrellas and tents create a sheltered area, making it a comfortable spot for people to stand and gather.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["suitcase", "handbag", "umbrella", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 45435, "caption": "According to the picture, a person is holding a cell phone in their hand. The phone is occupying a significant portion of the image, making it the focal point. The person's hand appears to be positioned near the center of the frame, with the cell phone extended to show the back side of it. The background features a dining table with some books and papers placed on it, suggesting that the setting might be an office or study area.", "mscoco_hallucinated_words": [["table", "dining table"], ["book", "book"]], "mscoco_gt_words": ["cell phone", "cup", "person"], "hallucination_idxs": [66, 69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.6666666666666666}}, {"image_id": 35827, "caption": " The image depicts a man sitting at a desk in front of two computers, using one of them. He is sitting in front of a computer, which has a laptop on the desk. The laptop is situated on the right side, with a keyboard below it and a mouse nearby. The second computer is situated on the left side, with a chair close to it. ", "mscoco_hallucinated_words": [["keyboard", "keyboard"], ["mouse", "mouse"]], "mscoco_gt_words": ["cup", "tv", "dining table", "laptop", "bottle", "chair", "person"], "hallucination_idxs": [48, 53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.18181818181818182, "Recall": 0.5714285714285714}}, {"image_id": 72656, "caption": " In the image, three young boys are sitting at a small table eating. There are two chairs at the table and each child has a plate on the table. The boys appear to be enjoying their meal, possibly having pizza. There is no dining table in the scene. The table is positioned near the right side of the image, which appears to be against the side of a building.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "suitcase", "dining table", "bowl", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 372087, "caption": "The scene depicts a green and yellow train traveling along train tracks as it crosses a bridge. The train occupies a substantial portion of the image, with different areas of the train visible. The front of the train can be seen towards the left side of the image, while its back end is located towards the right side. There is also a sense of movement, as the train appears to be travelling to the right, passing through the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "suitcase", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 336209, "caption": "According to the picture, a man is standing on a skateboard, likely practicing a trick. He appears to be riding on a wooden ledge or a ramp. The scene takes place in front of a building with graffiti on it, likely adding visual interest and contributing to the edgy atmosphere of the location.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 46031, "caption": "The image features a table with a laptop sitting on it. The laptop has a keyboard, mouse, and a monitor, creating a typical setup for a desktop computer. A small screen sits on top of the laptop, displaying an image of green leaves.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "tv", "dining table", "laptop", "mouse", "cell phone"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 283118, "caption": "The image captures a family at a dining table with a variety of foods, including a few sandwiches and chicken wings. The father, mother, and their two children appear in the picture, sharing a moment at the dinner table. The dining table is located in the middle of the scene, surrounded by chairs. The food is spread out across the table, with various plates containing the sandwiches and chicken wings.", "mscoco_hallucinated_words": [["sandwich", "sandwich"], ["sandwich", "sandwich"]], "mscoco_gt_words": ["pizza", "handbag", "dining table", "bottle", "cell phone", "car", "backpack", "chair", "person"], "hallucination_idxs": [18, 75], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.3333333333333333}}, {"image_id": 429598, "caption": "The image shows a kitchen with a brick wall and a small refrigerator located near the sink. A stove top oven can be seen in the foreground. The refrigerator appears to be made out of wood. Additionally, a microwave is situated towards the center of the room. A chair is placed in the corner of the kitchen, providing seating options.", "mscoco_hallucinated_words": [["microwave", "microwave"], ["chair", "chair"]], "mscoco_gt_words": ["sink", "oven", "refrigerator", "bottle"], "hallucination_idxs": [42, 53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.75}}, {"image_id": 333365, "caption": "Scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 424174, "caption": "The image features a large bowl filled with a variety of fruit, including numerous bananas and apples, placed on a table. The bananas can be found dispersed throughout the bowl, while the apples are situated around the edges. The fruits are arranged in different sizes, offering an appealing display of fresh, juicy fruit. This colorful display is sure to attract attention and make a great addition to any home.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "apple", "orange", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 3035, "caption": " The image captures an airport scene where two airplanes are parked on a runway. One airplane is positioned at the top right, while the other airplane is situated closer to the bottom center of the scene. A truck is visible at the bottom right of the image, likely associated with transportation services.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 177213, "caption": "The image features a person sitting at a table, eating a pizza with cheese and spinach on it. The pizza is the main focus of the scene, occupying the majority of the table's space. The table appears to be a dining table, as there is a fork placed near the pizza and a knife on the table. A glass can be seen in the background, possibly containing a drink or sauce. The individual appears to be enjoying their meal, making the scene a pleasant one.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 438535, "caption": "The image captures a city street in black and white, featuring a large clock with a white face and Roman numerals in the center of the photo. The clock, mounted on a large pole, has a dark background, making the numbers more visible. The city landscape is dotted with various cars parked at different locations, some closer to the main road, and others scattered nearby. The parking lots are full, with one near the left side of the street and another at the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "clock", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 490711, "caption": "According to the picture, the image features a white refrigerator standing next to a white cabinet in a kitchen. There is a large sink located on the left side of the refrigerator and a microwave placed on the right side, which could be used for cooking or warming food. There are several cups, a bottle, and a wine glass scattered throughout the scene, suggesting the kitchen is also used for drinking or dining.", "mscoco_hallucinated_words": [["microwave", "microwave"], ["cup", "cup"], ["bottle", "bottle"], ["wine glass", "wine glass"]], "mscoco_gt_words": ["sink", "refrigerator", "banana"], "hallucination_idxs": [36, 56, 59, 63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5714285714285714, "Recall": 0.6666666666666666}}, {"image_id": 163728, "caption": " The image features a small black and white kitten sleeping on a table, specifically on a mouse pad, seemingly relaxing after its playtime or nap.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "laptop", "mouse", "cat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 456756, "caption": " The scene features a tall giraffe standing next to a zebra, both looking towards the right side of the image. There are three other giraffes, one to the left of the tall giraffe and two more located near the center of the image, further back from the camera.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["zebra", "giraffe", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 281028, "caption": "The image features a train, likely a subway car, with a set of chairs placed in the center of the vehicle. There are two side chairs on the left side of the image, and two more chairs are situated on the right side. The train appears to be quite spacious, with plenty of seating for passengers. Two people are standing near the doors, possibly waiting to board the train or observing the interior.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "person", "chair", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 343937, "caption": " In the image, a woman in black snow pants and a white jacket is snowboarding down a snowy hill. She is wearing a helmet for added safety and protection. She appears to be a beginner snowboarder, enjoying the sport as she practices her skills. The snowboard itself is visible on the left side of the frame.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 277051, "caption": "Two birds are sitting on top of a table covered with pizza slices and other items. There are two bottles visible in the background, one closer to the left side of the image and the other one on the right side. The pizza appears to be a popular dining option, as there are several pieces placed on the table. In addition to the pizza, there are several cups and dishes located around the table, indicating a varied and well-stocked dining experience.", "mscoco_hallucinated_words": [["pizza", "pizza"], ["pizza", "pizza"], ["pizza", "pizza"], ["cup", "cup"]], "mscoco_gt_words": ["knife", "dining table", "bottle", "bird", "chair"], "hallucination_idxs": [11, 45, 68, 73], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4444444444444444, "Recall": 0.6}}, {"image_id": 145666, "caption": " The image portrays a large pot filled with a variety of vegetables, primarily broccoli and carrots. The broccoli is situated in different parts of the pot, while the carrots are positioned in the middle of the pot. Some green beans can also be observed, adding to the overall mix.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "carrot", "broccoli", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 304984, "caption": " The image displays a plate of food, including a sandwich and two small bowls. The sandwich appears to be a French or grilled cheese sandwich, and it's placed on a white plate. The two small bowls are placed next to the sandwich, one on the left and one on the right side. There is another bowl visible on the table, and a fork is positioned near the plate, suggesting that it's part of the meal. The scene seems to be on a table or countertop, and a bottle is placed close to the plate, possibly containing a beverage.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"], ["bottle", "bottle"]], "mscoco_gt_words": ["bowl", "sandwich", "cake", "fork", "orange", "wine glass"], "hallucination_idxs": [66, 93, 99], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2727272727272727, "Recall": 0.5}}, {"image_id": 122467, "caption": "According to the picture, a person is holding a white Wii remote, likely playing a video game. In the background, there are two chairs positioned in the left corner, one couch and the other more upright.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["remote", "dining table", "person", "couch"], "hallucination_idxs": [27], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.75}}, {"image_id": 518287, "caption": "According to the image, a toothbrush is sticking out of an orange container. This unique object, likely a toothbrush holder or a cup, features a smile on one side and a toothbrush sticking out of the other side. It is a creative way to showcase a toothbrush while using the container for its intended purpose.", "mscoco_hallucinated_words": [["container", "bowl"], ["container", "bowl"]], "mscoco_gt_words": ["toothbrush", "cup", "orange", "dining table"], "hallucination_idxs": [13, 55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.75}}, {"image_id": 145448, "caption": "The image features a black cloth covering an area on a table, likely on a patio or a similar outdoor setting, where a small cat is sitting and resting. The cat is curiously watching a man standing closer to the camera than the cat. Another person can be seen in the background, sitting closer to the right side of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "motorcycle", "cat", "car", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 352900, "caption": "The image displays a dish of food on a table, likely containing a combination of pasta and broccoli. It is served on a white plate and appears to be a substantial portion. The dish appears to be finished, with the broccoli fully cooked and integrated within the pasta.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["bowl", "fork", "knife", "broccoli"], "hallucination_idxs": [9], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.25}}, {"image_id": 153607, "caption": "According to the picture, a blue vase with a flower can be seen in different parts of the image. There is a close-up shot of the vase with a single flower placed in it, and a few other parts of the image reveal the vase with different angles and perspectives. There is another vase on the left side of the picture, but it is not filled with any water or flowers.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "orange", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 157581, "caption": "The image shows a table lined with a variety of bottled wine, giving it a festive appearance. There are 10 different wine bottles of various sizes, colors, and designs arranged across the table. The wine bottles are situated close together, creating a compact display of different types of wine. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "bottle", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 227438, "caption": " The image features a clean, well-made bed in a hotel room. The bed takes up most of the space in the room, and there are a few decorative elements, including a vase situated on a table near the bed. The bed has a blanket and a pillow, and there is a pair of shoes underneath it, indicating the possibility of a person lying on it. The entire scene is lit by a lamp placed above the headboard, giving the room a pleasant atmosphere.", "mscoco_hallucinated_words": [["vase", "vase"], ["table", "dining table"], ["person", "person"]], "mscoco_gt_words": ["clock", "couch", "bed", "chair"], "hallucination_idxs": [35, 39, 68], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 0.25}}, {"image_id": 116037, "caption": "The image captures a baseball game scene with multiple people involved. There are three main people in the scene: a batter swinging a baseball bat at a ball during a game, a catcher, and an umpire standing close to the players.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 202658, "caption": " A small toilet, adorned with a Hello Kitty seat cover, is sitting in a room, likely in a basement or storage area. The toilet has a pink seat, which gives it a cute appearance. In the background, there is a black car door, possibly indicating a place where the toilet is located. The bathroom or storage room has a few other objects, such as a sink and a backpack, which can be seen on the left side of the scene. The scene captures the unusual presence of a toilet and its decorated seat in an unconventional setting.", "mscoco_hallucinated_words": [["car", "car"], ["sink", "sink"], ["backpack", "backpack"]], "mscoco_gt_words": ["bowl", "chair", "toilet"], "hallucination_idxs": [46, 73, 76], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 0.3333333333333333}}, {"image_id": 18150, "caption": "A man and a young child are sitting on a floor together, surrounded by some chairs, likely in a living room setting. The child is eating a slice of pizza as the man watches, seeming to share the pizza with him. There are additional pizzas visible in the image, both placed on a tabletop nearby.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["pizza", "bottle", "backpack", "person", "couch"], "hallucination_idxs": [16], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.4}}, {"image_id": 418711, "caption": " The image features a man riding a bicycle down the street, with a large load of bananas strapped to the bike's back. The man appears to be moving from the right to left direction of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "car", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 92107, "caption": " The image depicts a green bowl filled with soup, likely containing corn, broccoli, and carrots. The bowl is sitting on top of a placemat or place mat in a dining room setting. The soup appears to be a white and yellow color.", "mscoco_hallucinated_words": [["carrot", "carrot"]], "mscoco_gt_words": ["bowl", "spoon", "broccoli", "dining table"], "hallucination_idxs": [17], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 491000, "caption": "In a narrow city street, a group of people is walking and enjoying their surroundings. They are spread out along the sidewalk and walking in different directions, some near a building with a sign on it. Several individuals are wearing backpacks as they move around, giving the scene a sense of activity and life. ", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["handbag", "motorcycle", "dog", "car", "person"], "hallucination_idxs": [44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.2}}, {"image_id": 401163, "caption": "In the image, a man is sitting on a park bench while a dog is laying near him. They appear to be enjoying the park area, with a green park bench serving as the setting. Another person is also present in the scene, standing a little further away from the bench. The park is full of greenery and trees, making it an ideal spot for leisure activities.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bench", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 39663, "caption": " The image shows a collection of items, including a backpack, a laptop, and a few books spread out across the floor. The backpack is placed in the center of the scene, while the laptop is located slightly towards the left side. The books are positioned at the right and top areas of the image. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["laptop", "book", "cell phone", "backpack", "scissors"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 268293, "caption": " The image captures a tennis player in motion, swinging a racket to hit a tennis ball during a game. The main subject, wearing a blue and white shirt, is holding a tennis racket and is focused on the approaching ball. The tennis ball is seen flying mid-air, indicating that it has been hit by the player. There are other people in the background, including a person on the left side and a person towards the top center of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 474499, "caption": "The image depicts a small kitchen with various appliances. In the room, there is an oven, a washing machine, and a refrigerator. Additionally, a sink is located towards the middle of the room. The kitchen also has a dishwasher placed in the middle, near the washing machine. Several bottles and a vacuum cleaner can be seen throughout the scene, likely storing cleaning supplies. Overall, the kitchen appears well-equipped with the necessary appliances for a typical household.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"]], "mscoco_gt_words": ["clock", "oven", "spoon", "bottle", "vase", "apple", "sink"], "hallucination_idxs": [25], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.42857142857142855}}, {"image_id": 358086, "caption": " The image depicts a group of people playing a game of frisbee. There are several people standing or jumping around the field, actively participating in the game. In the background, there is a bench, and some chairs are also placed in the scene, likely for spectators or people to rest. ", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["bottle", "frisbee", "backpack", "chair", "person"], "hallucination_idxs": [37], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6}}, {"image_id": 89541, "caption": " The image features a variety of food in a yellow container, with several pieces of cakes and muffins. Among these items, there are three different kinds of cakes, with one being larger and the other two being smaller. The cakes appear to be different flavors.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "cake", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 239445, "caption": "The image depicts a man with a white shirt standing in front of a fenced-in area with several people scattered around. He is holding a white frisbee, seemingly contemplating what to do with it. A few benches are present, one towards the left and two others at the center right area. ", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["frisbee", "sports ball", "person"], "hallucination_idxs": [39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 105782, "caption": "A giraffe is standing in a large grassy field, appearing to be chasing a kite. The kite flies in the background, adding interest to the scene. The giraffe is a unique sight, standing tall and walking around the field. The kite seems to be a bit more subdued in terms of color than the giraffe, providing an interesting contrast.", "mscoco_hallucinated_words": [["giraffe", "giraffe"], ["giraffe", "giraffe"], ["giraffe", "giraffe"]], "mscoco_gt_words": ["cat", "kite", "dog", "car"], "hallucination_idxs": [1, 31, 60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 487990, "caption": "A person is holding a guitar and placing it in front of a table filled with various food items. The table is surrounded by chairs and other people, creating a fun and festive atmosphere. The scene is set up with various bowls, plates, cups, and a bottle placed around the table. A bottle can be seen in the background of the image. The table covers a significant portion of the image, and the food items are spread out across it, making it a lively and diverse setting.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["cup", "knife", "dining table", "bowl", "spoon", "bottle", "cake", "fork", "donut", "person"], "hallucination_idxs": [25], "metrics": {"CHAIRs": 1, "CHAIRi": 0.09090909090909091, "Recall": 0.5}}, {"image_id": 138834, "caption": "In the scene, there is a man walking next to a vehicle, possibly a truck or a SUV, wearing snow equipment. He is carrying ski poles in his right hand and skis on his back. Behind him, there is another vehicle, possibly another truck, positioned close to the scene. The man appears to be headed toward a parking spot or getting ready to leave the area.", "mscoco_hallucinated_words": [["truck", "truck"], ["truck", "truck"]], "mscoco_gt_words": ["car", "skis", "person"], "hallucination_idxs": [16, 50], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 1.0}}, {"image_id": 492057, "caption": "In the image, a man wearing a black jacket is taking a selfie. He appears to be in a public bathroom, possibly recording a message or a sign on the back wall. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "person", "cell phone"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 388215, "caption": "The image shows a man in a blue and white shirt playing tennis, standing on a court and preparing to serve the tennis ball. The man is holding a tennis racket and is getting ready to swing it to hit the ball, which is positioned at a location on the court close to his left side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 111409, "caption": "The image features a man riding a motorcycle on a sidewalk. He is wearing a helmet and sitting on the bike while it's stationary. There are a few other people visible in the scene, but they are not riding any vehicles. The background consists of pavement, and a few benches can be seen scattered around the area. Overall, the image captures a scene of a person enjoying their ride on a motorcycle.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 451391, "caption": "The image captures a person holding a large bouquet of green broccoli flowers, which are arranged in an artistic manner. This arrangement showcases the beauty of broccoli flowers, which are typically used as decorative items or presented as a gift. The bouquet appears to be prominently displayed in the background of the picture, likely in a store or market.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["broccoli", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 92462, "caption": "The image features a group of cows walking down a dirt road in front of a car. There are six cows visible in the scene, with two of them in the middle and the others trailing behind, creating a long line of cows. The car appears to be moving along the road, likely following the cows or perhaps traveling down the road. There are also three other vehicles visible in the background, with one near the car and the other two positioned further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "cow", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 397268, "caption": "According to the picture, a store window displays two bicycles side by side, one leaning against the other. The bicycles are placed in front of the window, outside the shop. The scene features several chairs, with four chairs positioned close to the two bicycles, and a couple more chairs placed further away. Additionally, there are a couple handbags scattered around the area, possibly associated with the store.", "mscoco_hallucinated_words": [["handbag", "handbag"]], "mscoco_gt_words": ["bicycle", "chair", "potted plant", "bench"], "hallucination_idxs": [66], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 127659, "caption": " The image features a large passenger train pulling along the railroad tracks, which is the main focus of the scene. There are a few people waiting at a train station, some sitting on the benches that are placed throughout the platform, and others walking around the area. The train itself covers a significant portion of the image, stretching from the right edge all the way towards the left edge. There are also a few buses parked near the station, adding to the overall scene.", "mscoco_hallucinated_words": [["bus", "bus"]], "mscoco_gt_words": ["train", "person", "bench", "dog"], "hallucination_idxs": [81], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.75}}, {"image_id": 251857, "caption": "The image captures a group of skateboarders in a city plaza. Two men are skateboarding, with one skateboarding right in front of the camera, and the other one further back. A few other people can be seen in the plaza, some standing still, and others walking around. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "skateboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 243212, "caption": "According to the picture, a young boy in a green shirt is standing in a field while holding a blue frisbee. The boy is the main focus of the image as he is the only person present in the scene. The frisbee appears to be in his hand, and he seems to be looking intently at the toy. In the background, there are multiple cars parked in different areas around the scene, adding to the context of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "dog", "frisbee", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 444304, "caption": "The scene shows a person sitting on a bed, crouching down with their back to the wall. They appear to be playing a video game on a TV, possibly on a stand next to them. A remote is visible on the left side, being controlled by the person. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "person", "bed", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 305000, "caption": " In the image, there is a market stand in the background with two women shopping for produce. An older woman is standing near a table full of vegetables, with several baskets and an empty cup nearby. A younger woman can be seen at the side of the table, browsing the various fruits that are displayed. Some apples and bananas are visible, likely being offered for sale.", "mscoco_hallucinated_words": [["table", "dining table"], ["cup", "cup"], ["table", "dining table"]], "mscoco_gt_words": ["apple", "person", "banana"], "hallucination_idxs": [26, 37, 51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 1.0}}, {"image_id": 44136, "caption": "Two men dressed in tuxedos are seated at a table in a social setting. One of the men is holding a glass bottle, possibly a wine bottle, while the other person has a bottle in their hand. The scene features several chairs arranged around the table, with a knife situated closer to the center of the table and a couple of forks located near the edges of the table. There is a second bottle in the scene, placed further away from the camera. Another bottle can be seen in the background. Various food items, such as cups and plates, are also present, but it is not clear what they contain. A vase is positioned near the center of the table.", "mscoco_hallucinated_words": [["chair", "chair"], ["knife", "knife"], ["fork", "fork"], ["cup", "cup"], ["vase", "vase"]], "mscoco_gt_words": ["dining table", "tie", "bottle", "wine glass", "person"], "hallucination_idxs": [45, 53, 66, 106, 124], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2777777777777778, "Recall": 0.6}}, {"image_id": 154846, "caption": "The image features a large pan of pizza covered in pepperoni and mushrooms. Several slices of the pizza can be seen throughout the dining table. A few knives are spread across the table as well, indicating that the pizza is being prepared for consumption.", "mscoco_hallucinated_words": [["knife", "knife"]], "mscoco_gt_words": ["pizza", "person", "dining table"], "hallucination_idxs": [29], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 567254, "caption": " The image features a small child with blonde hair in the center of the scene, sitting in front of a television. The child appears to be playing with a pink toothbrush held in her right hand, possibly brushing her teeth. ", "mscoco_hallucinated_words": [["toothbrush", "toothbrush"]], "mscoco_gt_words": ["person", "vase", "tv", "bottle"], "hallucination_idxs": [32], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 79261, "caption": " According to the picture, a man is standing in a body of water, mopping the area with a broom. The water appears to be located in front of a building and is surrounded by various people in different areas of the scene. There are a few individuals scattered throughout the water's edge, possibly engaged in different activities or enjoying the setting. A clock can be seen in the image, suggesting a sense of time and organization within the area. The mopping action being performed by the man can be seen to be thorough and intentional.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "clock", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 301402, "caption": "The image depicts a man wearing a hat, sitting on a chair and looking at his laptop. He is positioned at the center of the scene, focusing on his laptop, which is placed on a table nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "potted plant", "bottle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 415828, "caption": "The scene takes place on a street where a man and a woman are standing outside of a building. They appear to be waiting for a bus or a ride, as they are accompanied by a few other people nearby. The road itself is lined with a variety of trees on either side, providing some shade and greenery. There are several cars parked near the street, as well as a few people walking on the sidewalk. The street is relatively narrow and has houses lined up along it. The atmosphere is that of a city street, with people waiting for transportation or walking around town.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "umbrella", "traffic light", "stop sign", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 543231, "caption": " In the image, a woman is taking a picture with a camera. She is holding a remote control in one hand while standing next to a couch, and there is another person in the background. The room seems to have a living area with a couch and a dining table in the background, along with another person, possibly sitting on the couch. The overall atmosphere appears relaxed as the woman is taking a photo.", "mscoco_hallucinated_words": [["couch", "couch"], ["couch", "couch"], ["table", "dining table"], ["couch", "couch"]], "mscoco_gt_words": ["remote", "bowl", "book", "bed", "person"], "hallucination_idxs": [28, 49, 53, 67], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4444444444444444, "Recall": 0.4}}, {"image_id": 495357, "caption": " In the scene, a small motorcycle is traveling down the street with a person riding it. There are several people on the street, including a few men and a woman, who are interacting with each other or focusing on other aspects of the street scene. Some of the people are wearing backpacks, possibly with their personal belongings.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["motorcycle", "person", "handbag", "dog"], "hallucination_idxs": [56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 523175, "caption": " The image displays a bowl filled with soup on a countertop or table. The soup has many vegetables, including oranges, carrots, and onions. There is also a spoon in the bowl that is holding a vegetable, possibly a tomato. It's quite a vibrant meal, with a variety of colors and textures.", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["bowl", "carrot", "spoon", "dining table"], "hallucination_idxs": [21], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 184945, "caption": "The image depicts a row of motorcycles parked on the side of a road. The motorcycles are positioned close to each other, creating a line. There are four motorcycles in total, and they are spread out along the road. Some motorcycles appear to be newer models, while others are smaller in size. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "potted plant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 49942, "caption": " The image features a man holding an umbrella with a pink and white striped shirt. The man is facing forward, and the umbrella covers much of his face. He is wearing a tie, which is visible on the front of his shirt. The other parts of his clothing are not clearly visible.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 223019, "caption": "The image features a blue train, likely a commuter train, sitting on the train tracks, with a person standing on the right side of the train and several people on the left side of the train. There are also a couple of handbags on the platform, one closer to the left side of the train and the other closer to the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["suitcase", "handbag", "train", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 42889, "caption": "In the image, there is a teddy bear sitting on a table in front of a computer, with a keyboard and a mouse nearby. The keyboard is placed slightly behind the teddy bear, while the mouse is positioned to the right of it. In addition to the keyboard and mouse, a television remote control can be seen on the right side of the table. The teddy bear appears to be the focus of the scene.", "mscoco_hallucinated_words": [["table", "dining table"], ["computer", "laptop"], ["mouse", "mouse"], ["mouse", "mouse"], ["mouse", "mouse"], ["television", "tv"], ["remote", "remote"], ["table", "dining table"]], "mscoco_gt_words": ["keyboard", "teddy bear", "bear"], "hallucination_idxs": [12, 17, 24, 39, 54, 57, 58, 69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5714285714285714, "Recall": 0.6666666666666666}}, {"image_id": 22112, "caption": "In the image, there are a few interesting objects on display. Among the items, a large, blue vase with greenery inside is prominently positioned in the center of the image. Near the vase, there are three smaller vases, one placed to the left, and the others positioned at the left and right edges of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "potted plant", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 247587, "caption": "In the image, there is a beach scene with a man riding a surfboard on the sand. The man appears to be standing on the surfboard as he walks along the beach. There are several people scattered in the background, some of them closer to the surfboard and others a bit further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 171962, "caption": " There is a large city bus with bicycles attached to the front of it. The bicycles can be seen on the right side of the bus. The bus has the number 1741 prominently displayed on the front, indicating the bus's route or destination.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 545056, "caption": " The scene takes place in a large, grassy field where a herd of cows is grazing. There are many cows scattered throughout the field, with some closer to the front of the image and others further away. Some cows are sitting down while others are standing, creating a diverse view of the herd.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "sheep", "cow", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 401411, "caption": " In the image, there is a man standing in a kitchen, surrounded by other people. There are three people sitting across from the man, with one person slightly to his left, another slightly to his right, and the third one right behind him. The rest of the room appears to be filled with various food items and other kitchen appliances.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "oven", "dining table", "bowl", "book", "carrot", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.14285714285714285}}, {"image_id": 2867, "caption": " The image portrays a group of people standing outside in the snow, ready to go skiing. All the individuals have skis on their backs and some are holding backpacks. There are seven people in the scene, spread throughout the area. The skis are visible on their backs or being held, indicating the group is getting ready for the day's skiing adventure.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 246963, "caption": "According to the picture, it depicts a group of motorcyclists riding down the road. They appear to be traveling in a single lane on the right side of the road, with a stop sign and traffic sign nearby. The riders are spread out throughout the scene, with some riding closer to the stop sign and others in the middle of the group. The motorcycles are well-maintained, with a number of them visible along the roadway, including some in the front and some in the back. The scene has a relaxed atmosphere, as the riders seem to be enjoying their journey together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "stop sign", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 498225, "caption": "In the picture, there are three people standing in front of a glass door, likely to be a display window of a bakery. They appear to be admiring the various cakes and pastries visible in the shop.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "cake", "cell phone", "donut", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 124601, "caption": " In the image, there is a kitchen with various appliances and a laptop on a counter. The kitchen has a wooden surface with cabinets against the wall, along with two cupboards with bottles of drinks placed inside. There is a microwave towards the left side of the counter and a refrigerator towards the right side. A laptop sits on the countertop with a computer mouse beside it. Additionally, there are several items on the counter, including a vase, a few cups, and a cell phone.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"], ["mouse", "mouse"], ["vase", "vase"], ["cell phone", "cell phone"]], "mscoco_gt_words": ["keyboard", "cup", "dining table", "laptop", "bottle", "wine glass", "sink", "microwave"], "hallucination_idxs": [54, 69, 85, 93], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.5}}, {"image_id": 581749, "caption": " The image displays a variety of different types of vegetables and fruits. Among the produce, there are several bunches of broccoli, which are placed together in two separate containers, with some broccoli scattered on the table as well.", "mscoco_hallucinated_words": [["container", "bowl"]], "mscoco_gt_words": ["broccoli", "orange", "dining table"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 227960, "caption": " The image displays a dining table with a white dish containing various food items, including carrots and at least one bowl of broccoli with sauce. A spoon can be seen next to the white dish, indicating the presence of sauce within the food itself. A bottle of red wine can be seen in the background, further adding to the sense of dining.", "mscoco_hallucinated_words": [["table", "dining table"], ["broccoli", "broccoli"], ["bottle", "bottle"]], "mscoco_gt_words": ["cup", "bowl", "spoon", "wine glass", "carrot"], "hallucination_idxs": [5, 23, 49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.6}}, {"image_id": 286503, "caption": "The image depicts an elephant standing near an artificial structure, likely a tall support or a bench, in a fenced enclosure. The elephant appears to be engaging in its feeding activity, as it is surrounded by a group of people, likely visitors or staff members at a zoo.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["person", "sports ball", "elephant"], "hallucination_idxs": [17], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 315740, "caption": "The image depicts a clean and nicely organized kitchen with a sink and a countertop. There is a living room in the same space or nearby. The sink is placed at the back of the kitchen, with a knife nearby. The kitchen also features refrigerators and an oven, both located towards the center of the room. In the living room, there are two chairs positioned further away from the sink. The kitchen is equipped with stainless steel appliances, making it appear modern and sleek.", "mscoco_hallucinated_words": [["knife", "knife"], ["refrigerator", "refrigerator"]], "mscoco_gt_words": ["oven", "tv", "bowl", "potted plant", "sink", "microwave", "chair", "couch"], "hallucination_idxs": [41, 48], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.375}}, {"image_id": 142108, "caption": "In the image, a group of people with bandannas covering their eyes are gathered around a dining table with multiple bananas placed on it. The bananas are scattered across the table and several hands are visible, likely belonging to the blindfolded individuals attempting to peel the bananas.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 542933, "caption": " This image captures a baseball game in action. There are several people standing around the field, including the baseball player who has just hit an inside ball during a game. The player is holding a baseball bat, standing up, and preparing to swing at the ball. There is also a catcher in the vicinity, ready to catch the ball after the batter's swing.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "chair", "baseball bat", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 475043, "caption": "The scene is set in a living room, with a woman sitting on a couch in the center and a man sitting on another couch nearby. The woman has a laptop and a cup on a table, while the man has two beers and a cup. Another man is positioned towards the left side of the image. A chair is placed off to the right side of the scene, where a person can be seen sitting. In addition, there is a dining table placed on the right side of the image, which appears to be set for a meal.", "mscoco_hallucinated_words": [["cup", "cup"], ["table", "dining table"], ["cup", "cup"], ["chair", "chair"], ["table", "dining table"]], "mscoco_gt_words": ["bowl", "spoon", "laptop", "bottle", "person", "couch"], "hallucination_idxs": [35, 38, 48, 63, 90], "metrics": {"CHAIRs": 1, "CHAIRi": 0.35714285714285715, "Recall": 0.5}}, {"image_id": 456784, "caption": " The image captures a person cutting a cake with a large knife. The cake, which has a white, round shape, is positioned on a table. The person is in the center, focused on the task of slicing the cake with the knife. In addition to the cake and the knife, there are several other objects in the image, including a bottle on the left side and a cup on the right side. There are also a few hand-shaped items scattered throughout the scene.", "mscoco_hallucinated_words": [["table", "dining table"], ["bottle", "bottle"], ["cup", "cup"]], "mscoco_gt_words": ["cake", "knife", "person"], "hallucination_idxs": [28, 69, 76], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 33753, "caption": "According to the picture, there is a wooden park bench in the scene, which has a remote control sitting on top of it. The remote control is positioned in the center of the bench.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "chair", "bench", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 448694, "caption": " A group of young men, most likely friends, are playing a game of frisbee outside in a field. They are engaged in the activity, with some of them trying to catch the frisbee, while others focus on the game. One man can be seen in the foreground, running towards the frisbee, while another man is farther away to the right.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["frisbee", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 99053, "caption": "The image features a bowl of food on a table, likely dining or dining table. The bowl contains a mixture of noodles and vegetables such as broccoli. There is a fork in the bowl with a spoon-like utensil also present, likely used to mix the food together. This combination of utensils seems to indicate the dish is ready to be eaten. The spoon and fork are placed within the bowl, suggesting the food is being prepared for consumption.", "mscoco_hallucinated_words": [["spoon", "spoon"]], "mscoco_gt_words": ["dining table", "bowl", "fork", "broccoli", "carrot"], "hallucination_idxs": [68], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 0.8}}, {"image_id": 503210, "caption": "According to the picture, a spacious room with a piano and two couches has been converted into a living area by removing the piano and adding chairs. A large couch is situated on the left side of the room, and two smaller couches are placed on the right side, with one couch closer to the middle. The walls are yellow, and there are a number of potted plants in various spots throughout the room. A large clock is located prominently on the wall near the top, with a second clock placed near the ceiling above a door. A handbag can be found on the couch, and there is a book visible in the room as well.", "mscoco_hallucinated_words": [["handbag", "handbag"]], "mscoco_gt_words": ["clock", "potted plant", "vase", "orange", "book", "chair", "couch"], "hallucination_idxs": [107], "metrics": {"CHAIRs": 1, "CHAIRi": 0.09090909090909091, "Recall": 0.7142857142857143}}, {"image_id": 142257, "caption": "A busy street with a traffic light is the setting of this scene. There are several cars, some more prominent than others, going about their business. A red traffic light is visible near the center of the image, indicating a stop for the vehicles. There are also several pedestrians on the street, walking by and going about their day.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}], "overall_metrics": {"CHAIRs": 0.38877755511022044, "CHAIRi": 0.11204293861120429, "Recall": 0.5629562043795621, "num_hallucinated_caps": 194.0, "num_caps": 499.0, "hallucinated_word_count": 334.0, "coco_word_count": 2981.0, "length_response": 69.76753507014028, "hallucinated_caps_ls": [144305, 54870, 95908, 324008, 234433, 39053, 128119, 401027, 253630, 317320, 161227, 25812, 195645, 32625, 18688, 481596, 67833, 272880, 254368, 231534, 209347, 13550, 2154, 274931, 104326, 300784, 409346, 522262, 344942, 300124, 490491, 12669, 133146, 261097, 185327, 348654, 556478, 451943, 268854, 495513, 529256, 37907, 506401, 278365, 432213, 201555, 148977, 399058, 183571, 325836, 437910, 467887, 174402, 352734, 295656, 557998, 401010, 313928, 490284, 346940, 472610, 388237, 289640, 59710, 527497, 61959, 229659, 473118, 416104, 5934, 131138, 258209, 285820, 453731, 418692, 120776, 276719, 354559, 468005, 378831, 375769, 20650, 152499, 147762, 425447, 145020, 301837, 240681, 178753, 47781, 211604, 298067, 563938, 47940, 526414, 273132, 272728, 282553, 188136, 191691, 99874, 557447, 221350, 280930, 134223, 217593, 477016, 301266, 558784, 539951, 143479, 230983, 564629, 53431, 53465, 381031, 9628, 254161, 163518, 423984, 118124, 288729, 30667, 114108, 366111, 70815, 88269, 536154, 374061, 158806, 368335, 416384, 531327, 451449, 1268, 151159, 436814, 275611, 478380, 110449, 45550, 205193, 272566, 170099, 14940, 408449, 354540, 124236, 14285, 82488, 226802, 425569, 28842, 45435, 35827, 283118, 429598, 490711, 277051, 304984, 122467, 518287, 352900, 227438, 202658, 18150, 92107, 491000, 474499, 358086, 239445, 105782, 487990, 138834, 111409, 397268, 127659, 305000, 44136, 154846, 567254, 543231, 495357, 523175, 42889, 124601, 581749, 227960, 286503, 315740, 475043, 456784, 99053, 503210]}}}