{"overall_metrics": {"CHAIRs": 0.352, "CHAIRi": 0.10688920454545454, "Recall": 0.5517084282460136, "num_hallucinated_caps": 176.0, "num_caps": 500.0, "hallucinated_word_count": 301.0, "coco_word_count": 2816.0, "length_response": 71.182, "hallucinated_caps_ls": [144305, 324008, 455610, 401027, 253630, 326021, 317320, 267537, 195645, 32625, 343534, 254368, 231534, 227031, 523292, 13550, 146837, 209925, 2154, 274931, 300784, 409346, 275929, 490491, 133146, 498747, 185327, 348654, 482432, 144723, 379842, 556478, 451943, 495513, 529256, 37907, 278365, 69757, 432213, 368367, 201555, 92131, 333712, 148977, 399058, 437910, 467887, 295656, 557998, 401010, 313928, 346940, 388237, 289640, 122602, 24104, 285302, 61959, 229659, 391222, 416104, 5934, 258209, 347453, 285820, 524601, 253262, 87052, 418692, 259345, 120776, 276719, 219546, 228350, 32190, 449798, 375769, 180764, 177015, 333998, 343218, 301837, 240681, 178753, 47781, 293703, 211604, 563938, 47940, 526414, 505213, 441363, 282553, 188136, 449675, 191691, 557447, 159979, 561967, 134223, 3742, 217593, 475813, 558784, 308543, 359164, 539951, 564629, 53431, 441840, 381031, 539555, 423984, 499486, 118124, 288729, 252625, 493610, 302789, 30667, 353898, 88269, 98979, 353320, 256628, 416384, 531327, 542771, 277073, 436814, 275611, 110449, 45550, 205193, 272566, 19763, 124236, 344930, 82488, 381615, 111109, 425569, 319607, 72656, 372087, 438535, 490711, 277051, 145666, 304984, 122467, 352900, 18150, 491000, 401163, 39663, 268293, 474499, 105782, 138834, 127659, 305000, 44136, 154846, 567254, 301402, 543231, 495357, 42889, 401411, 124601, 581749, 227960, 286503, 315740, 503210]}, "results": {"sentences": [{"image_id": 144305, "caption": "The image features a computer with a laptop open, sitting on a desk. The laptop is displaying the picture of a woman. The computer also has a microphone attached to it, suggesting that it is being used for communication purposes. A cell phone is also visible in the image, placed in a similar position to the computer.", "mscoco_hallucinated_words": [["woman", "person"], ["cell phone", "cell phone"]], "mscoco_gt_words": ["keyboard", "laptop", "dining table"], "hallucination_idxs": [23, 46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 164091, "caption": " In the image, there are two dolls positioned side by side. One is a doll dressed in a white dress, and it appears to be a large baby doll. The other doll is placed in a bassinet, and it is a bit smaller than the white doll. There is a single bed in the background, likely a crib for the smaller doll.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "bed", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 308003, "caption": "According to the picture, there is a wooden bench sitting in a grassy area, surrounded by trees. The bench is made of green and white wood. The scene captures a park setting with benches and outdoor elements.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "bench", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 54870, "caption": "The image features a plate of food, consisting of a sandwich and a salad, and another plate with vegetables. The sandwich is topped with pickles, and the salad is accompanied by a knife. The knife is not being used in the scene, as it appears to be positioned at the side of the plate. The plate is placed on a countertop in a restaurant or possibly a kitchen setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "fork", "knife"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 95908, "caption": "The image features a mouse pad placed on a desk, which has a pair of keyboards, a computer mouse, and a silver coffee cup with a mug inside it, sitting on top of the mouse pad. The mug appears to have a mouse as its design, making it a unique item for this setup. A keyboard is positioned near the mouse pad, and another one is found further down towards the right side. The scene conveys a cozy atmosphere with a combination of computer devices and a mug, potentially used for heating or serving hot beverages while using a computer.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "dining table", "laptop", "mouse"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 329614, "caption": "The image depicts a road in a city, with a double-decker bus down the road surrounded by cobblestone streets. There are three people visible in the scene, standing at different locations along the road. Two cars are also present in the image, one near the bus and the other farther away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 324008, "caption": "The image features a tray filled with a variety of food items, including bread, bananas, and yogurt. The food items are placed on the tray, which is situated on a table. Bananas are located on the right side of the tray, while bread and yogurt are located in the middle and towards the left side, respectively. The scene captures a portion of an airplane, with a chair and a table occupying a significant part of the image.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["cup", "dining table", "train", "orange", "airplane", "person", "banana"], "hallucination_idxs": [76], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.42857142857142855}}, {"image_id": 52700, "caption": "According to the picture, a large group of people is gathered around a dining table, which is covered in an assortment of cakes and desserts. The cakes appear to be served on small plates, placed along the length of the table. There are an assortment of cakes, ranging from white to yellow, indicating a diverse selection of treats. The people are standing close together, enjoying the spread and socializing with each other. Some individuals are holding dishes with their hands, while others are simply standing and observing the assorted treats.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "dining table", "spoon", "cake", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 501311, "caption": "The image features a baseball game happening on the field. There are several players scattered around the scene, some closer to the pitcher\u2019s mound and others further away. One of the players, wearing a baseball glove, is on the mound, and he appears to be in the middle of pitching the baseball. The other players in the scene are likely positioned to catch the ball or to play various positions on the field. The field seems to be well-maintained and prepared for the game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 247179, "caption": "According to the image, there are several elephants parading down a street, with people in various positions accompanying the animals. Some people are standing near the elephants, while others are riding on their backs. The elephants are decorated with lights, making them more spectacular. In total, there are at least six people visible in the scene, and at least four elephants. The people are spread out along the parade, some walking alongside the elephants and others riding them.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "clock", "person", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 577912, "caption": "According to the picture, there is a clean and well-maintained bathroom with a blue bathtub in the center. The bathroom has a toilet on the left side of the room. The sink and the bathtub are located near each other and are the main focal points of the bathroom. There are no other objects or decor in the bathroom, making it a minimalistic space.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "toothbrush", "toilet", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 196341, "caption": "The image features a group of people gathered in front of a large round clock, which can be seen through the crowd. There are 20 people visible, some in the foreground and others in the background, with a mix of silhouetted and more visible figures. They are standing underneath the clock, possibly waiting for it to change or admiring it while enjoying their time together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 234433, "caption": "The image features a parking sign with text, indicating left and right only signs. The sign is placed on a pole, near some cars and a street with a sidewalk. The cars are parked close to the street, with some further down and a few closer to the parking sign.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 455610, "caption": "According to the picture, there is a red, green, and purple motorcycle parked on the side of a street. The motorcycle is surrounded by a group of people, with some standing near it and others in the background. The motorcycle appears to be a large, unique design, attracting attention from those passing by. There is also a truck present in the scene, positioned towards the right side of the image.", "mscoco_hallucinated_words": [["truck", "truck"]], "mscoco_gt_words": ["traffic light", "motorcycle", "person", "car"], "hallucination_idxs": [66], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 39053, "caption": "The image features a table topped with a variety of food items, including pita bread, cheese, and bowls of food. The table is placed on a red and white dotted surface, creating an inviting dining atmosphere. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "bowl", "spoon", "cake", "cell phone", "apple"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 493862, "caption": "According to the picture, there is a young man standing on a hill holding a tennis racket. He appears to be playing a game of tennis, enjoying the outdoor atmosphere while taking in the scenery.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "tennis racket", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 269873, "caption": "According to the image, the scene is a black and white photo of a large group of people walking on a street, likely in a city. The people are in various positions, some moving to the left or right, while others are standing around.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "person", "bus", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 524656, "caption": "The image features a small kitchen with a stove. The kitchen is well-equipped with appliances, including a microwave, oven, and a refrigerator. There are also multiple cups placed on the countertops, indicating that the kitchen is used for preparing and consuming meals. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toaster", "cup", "knife", "oven", "refrigerator", "bowl", "spoon", "bottle", "wine glass", "sink", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.36363636363636365}}, {"image_id": 369547, "caption": "According to the picture, there is a small black and white dog standing on the beach, watching a bird flying nearby. The bird appears to be flying in a sky, with a different bird farther away. The dog and the birds can be seen at different positions on the beach. The area is covered with pebbles, creating a unique environment for the birds and the dog to interact.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "dog", "cat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 181446, "caption": "The image shows a man standing in front of an open refrigerator, with a confused expression on his face. He is leaning towards the open refrigerator, possibly looking for something. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "bowl", "bottle", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 128119, "caption": "The image features a large, white city bus driving down a street, with people walking near the bus. There is another, smaller bus further in the scene. Two bicycles are visible, one on the sidewalk and one hanging from the bigger bus, which appears to have been ridden alongside it and left attached as a mode of transport. The other people in the image are going about their day-to-day activities, some walking on the sidewalk and others near the bus.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bus", "handbag", "cell phone", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 401027, "caption": "The image features a plate of food, accompanied by a variety of items. The meal includes bread, cheese, and cucumbers. The plate is placed on the table, and there are multiple cups and bowls on the table as well. ", "mscoco_hallucinated_words": [["bowl", "bowl"]], "mscoco_gt_words": ["cup", "knife", "dining table", "sandwich", "fork"], "hallucination_idxs": [39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.4}}, {"image_id": 253630, "caption": " According to the picture, a living room has a black coffee table sitting on the carpeted floor in front of a TV. There is a sofa placed against a wall on the left side of the room and a potted plant can be seen on the floor close to the sofa.", "mscoco_hallucinated_words": [["sofa", "couch"], ["sofa", "couch"]], "mscoco_gt_words": ["potted plant", "chair", "tv", "dining table"], "hallucination_idxs": [27, 52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.75}}, {"image_id": 326021, "caption": " The scene features a woman sitting at a table with a teddy bear and holding a sign that reads, \"IDK.\" She is sitting in the middle of the table, which is draped with a red tablecloth. The table is surrounded by chairs, with a couple of them placed close to the woman, and a few others scattered around the room. The presence of a few pedestals at different locations in the scene suggests that they might be used to place the teddy bears or other items on display. Overall, it appears to be an interesting scene featuring the woman with her teddy bear, with her sign indicating her stance on the issue at hand.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["teddy bear", "person", "dining table"], "hallucination_idxs": [47], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 1.0}}, {"image_id": 317320, "caption": "The image features a person riding a bicycle down a street with yellow lines. The bicycle is parked on the side of the road, and there are multiple people in the scene walking and standing nearby. There are a few handbags scattered around the area, suggesting that the people may have various possessions with them.", "mscoco_hallucinated_words": [["handbag", "handbag"]], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 507171, "caption": "The image depicts a man surfing on a wave in the ocean. He is riding the wave and appears to be enjoying the experience. Several other surfers can be seen in the background, perhaps enjoying the same wave or waiting for their turn to ride it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "person", "cat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 161227, "caption": "In the image, there is a chair with a brown leather cushion in the foreground. A cell phone is placed on the cushion, as well as a remote control. There is another remote control positioned closer to the left side of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "keyboard", "laptop", "cell phone", "chair", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 25812, "caption": " According to the picture, a person is skiing down a snowy slope in the snowboarding area. They are wearing a black jacket and a black hat, and they are surrounded by other skiers and snowboarders. The snowboarders appear to be enjoying their time on the hill, with some skiing off to the side and others further ahead. The scene captures a casual and fun atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 267537, "caption": " In the image, a group of people is sitting on a set of steps, enjoying a rainy day while watching some sports. They are all wearing raincoats holding umbrellas, which are open to protect themselves from the rain. There are six people in total, with four of them sitting next to each other and the remaining two sitting slightly apart. The scene includes a couple of chairs and a couple of umbrellas.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["tie", "umbrella", "bench", "person"], "hallucination_idxs": [73], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 195645, "caption": "According to the picture, there is a group of people gathered around a dining table. There are several individuals, including one woman, engaging in conversation or sharing a meal. The table appears to have several cups, likely for drinks, placed on it.", "mscoco_hallucinated_words": [["cup", "cup"]], "mscoco_gt_words": ["handbag", "bench", "dining table", "bottle", "wine glass", "person"], "hallucination_idxs": [40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.3333333333333333}}, {"image_id": 155671, "caption": "According to the picture, there is a street with traffic driving down it, including cars, trucks, and a bus. The street is lined with tall trees, some of which are visible along the edges of the road and in the middle. There are also traffic lights, which help regulate the flow of traffic, and pedestrians walking along the road. In the sky above, clouds are present, creating a cloudy atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 412894, "caption": "According to the picture, a busy city street is depicted with people walking, crossing the street, and waiting in line. A sign is hanging above the street, possibly advertising information for the city. There are multiple traffic lights and pedestrian lights in the scene, guiding the foot traffic and managing the flow of the crowds. The street is lively and bustling, reflecting the city vibe.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "person", "handbag", "backpack"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 32625, "caption": "The image features a dining table with various food items. There is a plate of food on the table, which includes food items like soup and salad, giving it a variety of dishes. The table is surrounded by chairs for dining and some other kitchen utensils.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["cup", "dining table", "bowl", "fork", "book"], "hallucination_idxs": [42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.2}}, {"image_id": 18688, "caption": "According to the image, a baseball game is happening in a public place. There are multiple people on the field, actively participating in the game. Two people are wearing baseball jerseys \u2013 one at home plate and one at the catcher position. Another person is wearing a shirt, likely a uniform for the game. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 343534, "caption": "According to the image, there is a person in the form of a doll standing next to two stuffed animals. The person is wearing a sweater, and the doll is wearing a pink dress and a beret. One of the stuffed animals is a teddy bear, while the other is a bear of a different type. The teddy bear is sitting on a table, while the other stuffed animal is positioned in front of the person.", "mscoco_hallucinated_words": [["person", "person"], ["person", "person"], ["bear", "bear"], ["table", "dining table"], ["person", "person"]], "mscoco_gt_words": ["teddy bear", "potted plant", "dog", "book", "chair"], "hallucination_idxs": [8, 23, 56, 69, 82], "metrics": {"CHAIRs": 1, "CHAIRi": 0.7142857142857143, "Recall": 0.2}}, {"image_id": 370657, "caption": "In the image, a group of people is riding a motorcycle together, with a mother and her two children in the back. There is a man wearing a face mask sitting in front of the children, possibly to protect them from airborne particles. There are two additional adults in the group, one riding a motorcycle beside the mother and child, and another standing next to them. They are traveling down the road, possibly on an excursion or enjoying a day out.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 481596, "caption": "According to the picture, a man is sitting in the middle of a sports field surrounded by many chairs and a crowd of people. He is holding a hot dog in his hand and smiling, indicating that he is enjoying the event or game happening around him. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["hot dog", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 460347, "caption": "According to the picture, there is a large bus driving down the street. It's parked at an intersection, with cars scattered on the road both ahead of it and behind. There are several cars visible on the street, with some parked closer to the bus and others further away. The scene takes place on an empty road with a median down the middle, indicating that it is likely a high traffic area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 398753, "caption": "According to the picture, it is a sunny day with a cloudy sky, as a baseball game is in progress on an outdoor field. The field is surrounded by a crowd of people, with many individuals spread across the field, watching the game. There are several people scattered around the field, some of them holding baseball bats. It appears to be an exciting and lively scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 67833, "caption": "The image features a dining table set for a meal. It's adorned with two different kinds of pizza, placed on a tray and a plate, respectively. One of the pizzas is topped with vegetables, and the other one is a seafood pizza. There are also several bottles on the table, including a wine bottle, a bottle of water, and a bottle of sauce. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "bowl", "bottle", "fork", "wine glass", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 272880, "caption": " The image displays a plate with a piece of pizza on it, which appears to be a toasted pizza slice. The pizza slice is placed at the center of the plate, highlighted by the white paper holding it, possibly referring to a menu or a flyer.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 254368, "caption": "According to the picture, there is a man in a bed eating food. The man is wearing a white robe and sitting on the edge of the bed. He is holding a donut in his hand as he eats it. A plate is also present near the man, possibly with the food he is eating.", "mscoco_hallucinated_words": [["donut", "donut"]], "mscoco_gt_words": ["pizza", "bed", "person"], "hallucination_idxs": [35], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 193108, "caption": "According to the picture, there are multiple people in the scene, including a baseball player standing in the outfield, holding a baseball bat and getting ready to swing. The player is wearing a black uniform, and there are also other people present in the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 459141, "caption": "According to the picture, a woman is riding a bicycle down a street. She is wearing a hat, carrying a bag, and has a baby with her on the bike. She is dressed in a light jacket, and the baby might be a child of hers. The bicycle is the main focus of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "potted plant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 231534, "caption": "The image features an airport scene with various vehicles and an airplane. The airplane is parked at an airport tarmac, and it appears to be large, with multiple propellers on its engines. There are several vehicles in the scene, including three trucks and two buses.", "mscoco_hallucinated_words": [["bus", "bus"]], "mscoco_gt_words": ["truck", "suitcase", "handbag", "airplane", "person"], "hallucination_idxs": [49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.4}}, {"image_id": 227031, "caption": "In the image, a man and a woman are cutting a cake together, surrounded by a group of people. The man and woman are standing next to each other, with the groom holding a knife, and the bride holding a piece of the cake. The cake is placed on a table, and there are several chairs visible in the scene, possibly indicating that the people in attendance are sitting down.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["knife", "dining table", "tie", "potted plant", "cake", "person"], "hallucination_idxs": [62], "metrics": {"CHAIRs": 1, "CHAIRi": 0.07142857142857142, "Recall": 0.6666666666666666}}, {"image_id": 209347, "caption": "In the image, a woman is sitting on a train and looking at her cell phone, possibly using it for navigation or communication. She is wearing a jacket, and there are two other people in the scene. Another person is also sitting further back in the train. All three individuals are sitting and appear to be relaxing while traveling. The woman is the main focus, while the other people seem to be passengers or accompanying her on her journey.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "tie", "train", "cell phone", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 523292, "caption": " The image features a cat sitting on top of a white and grey laptop, which is positioned on a table or a chair in a room. The cat appears to be examining the laptop screen, possibly trying to understand what it shows. The image captures the cat's attention as it reaches out its paw and touches the computer screen. The scene suggests a domestic setting, likely a living room, with a couch in the background.", "mscoco_hallucinated_words": [["couch", "couch"]], "mscoco_gt_words": ["tv", "dining table", "laptop", "cat", "chair"], "hallucination_idxs": [79], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.8}}, {"image_id": 12375, "caption": " The image features a plate filled with a variety of food, including meat, vegetables, and carrots. The meat appears to be roasted chicken, while the vegetables include the carrots and tucked carrots. There is also potatoes present on the plate. The food is arranged neatly and neatly presented on the dining table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["carrot", "broccoli", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 13550, "caption": " According to the picture, a woman is squatting down next to a baby elephant while feeding it, likely with a bottle or a handful of bananas. The woman is wearing sunglasses, possibly enjoying the moment with the elephant. There are three other people in the scene, all located at different parts of the area. The elephant appears to be appreciative of the woman's efforts to feed it.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["banana", "banana"]], "mscoco_gt_words": ["person", "handbag", "elephant"], "hallucination_idxs": [22, 27], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.6666666666666666}}, {"image_id": 146837, "caption": " According to the picture, there is a young man riding a skateboard down a street, performing a trick in the street. He appears to be in mid-air while navigating the city streets. The scene includes various people passing by the street, with a group of them on the sidewalk and others scattered in different areas of the street. Additionally, there are several cars parked nearby, further emphasizing the urban environment of the scene.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["motorcycle", "skateboard", "person"], "hallucination_idxs": [69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 67042, "caption": "According to the picture, a man is standing in a large, arched building, talking on his cell phone. He is surrounded by a group of people, some of whom are walking around the building. The building is filled with various architectural elements, such as arches and dome-like structures, creating a visually appealing and unique atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "person", "cell phone"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 209925, "caption": " The image features a baseball player on the mound, preparing to throw a pitch during a game. He is wearing an orange jersey and a baseball glove on his left hand, and he appears to be focusing on his pitching motion. The player can be seen at the center of the image, highlighting his actions on the field. The game is being played on a field surrounded by grass, with players spread out across various parts of the field.", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [23], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 2154, "caption": " The image shows a shopping cart parked on the sidewalk next to a bench, both situated near a street corner. There are a few people standing near the shopping cart, and one person is further away. Several umbrellas are visible, with one umbrella near the center of the image and another closer to the right edge. There is a bench situated in front of the storefront, and a traffic light is located on the opposite side of the street. The scene appears to be a quiet street corner with a person waiting and a shopping cart left behind after someone has finished shopping.", "mscoco_hallucinated_words": [["traffic light", "traffic light"]], "mscoco_gt_words": ["car", "umbrella", "bench", "person"], "hallucination_idxs": [75], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.75}}, {"image_id": 274931, "caption": "The image depicts a sidewalk with several old cars parked along it, creating an interesting scene. The cars are lined up in a row, with some parked closer to the right side of the image and others further back. There are two parked cars towards the right side of the image, followed by three more cars further down the sidewalk. A few people are walking along the sidewalk, adding to the scene.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["traffic light", "fire hydrant", "car"], "hallucination_idxs": [68], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.3333333333333333}}, {"image_id": 104326, "caption": "The image features a small, green-colored kitchen with tables and chairs. The kitchen is equipped with appliances such as two ovens and two dishwashers. The dining area is surrounded by chairs, and there are also a few potted plants throughout the room. The table in the dining area is surrounded by chairs, giving it a cozy feel. Overall, the kitchen appears to be well-organized and nicely decorated.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "knife", "oven", "dining table", "potted plant", "sink", "microwave", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 233888, "caption": "According to the picture, a young boy is watching a baseball game on a television in a living room. He is sitting on the floor, engrossed in the game, and appears to be enjoying himself. The television is located in the center of the image and displays the baseball game. In the room, there are three individuals, including the boy, who are either watching the game or enjoying the room's decor.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tv", "potted plant", "baseball bat", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 213905, "caption": "According to the picture, a female tennis player is congratulating a fellow female tennis player, who is holding a tennis racket and smiling. The two tennis players are standing on the tennis court, likely celebrating a match that they just finished.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "tennis racket", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 277479, "caption": " The image features a young child, possibly a toddler, standing in front of an open refrigerator. The child is positioned right by the refrigerator door, looking inside. There is a bottle visible near the bottom of the fridge, which appears to have several pieces of food or beverages. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 300784, "caption": "The image shows a group of children, including a little girl, sitting on a bench together. There are six children in total, and they appear to be enjoying a social gathering. They are holding various objects, including a cane and a book. A couch is also visible in the scene.", "mscoco_hallucinated_words": [["book", "book"]], "mscoco_gt_words": ["person", "bench", "dog", "couch"], "hallucination_idxs": [47], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.75}}, {"image_id": 183766, "caption": "The image features a beautifully decorated cake, sitting on a white plate. The cake is surrounded by cherries, which add to its delicious appearance. Several cherries are placed on the table, creating a nice display.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "fork", "knife", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 169169, "caption": "In the image, there is a large banner promoting the event \"Spring in the City\". The banner has the words \"Spring in the City\" written on it, covering a significant portion of the banner. The words are displayed across the banner, promoting the event around various city locations.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "person", "potted plant", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 409346, "caption": " In the picture, a woman and a man are standing next to a table filled with various foods, including sandwiches, cakes, and appetizers. The woman appears to be enjoying the assorted dishes, possibly at a social gathering. The table also has a bowl, a plate, and a cup. ", "mscoco_hallucinated_words": [["sandwich", "sandwich"], ["cup", "cup"]], "mscoco_gt_words": ["dining table", "bowl", "spoon", "cake", "person"], "hallucination_idxs": [21, 56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.8}}, {"image_id": 522262, "caption": " In the image, there is a row of parked bicycles along a sidewalk, likely outside of a park or a public area. The bikes are of different colors and are lined up next to each other, creating a line of bicycles on display.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 79014, "caption": "The image captures a man holding an orange under his face, creating an illusion of a fruit on his nose. The orange is being held against a glass window, likely at a store or a public place, making the visual effect more intriguing. A mirror can also be seen behind the man, reflecting the orange and creating a double-illusion. The man appears to be looking at himself in the mirror through the window, further enhancing the illusion.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 344942, "caption": "The image shows a laptop computer with a small cardboard robot figurine sitting on top of it. The robot is positioned in the center of the screen and appears to be staring out towards the viewer. The robot's presence on the laptop screen may represent a new friend, a funny meme, or simply an expression of joy and excitement.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "laptop", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 275929, "caption": " The image features several white porcelain toilets sitting on steps, with their bowls removed, creating a collection of white chairs on the stairs. The toilets appear to be leaning against a wall or staircase, possibly waiting to be repaired or disposed of. There is a bowl on the ground nearby, possibly from one of the removed toilet bowls. There are several steps in the scene, including a set of stairs going up and down, with the toilets positioned near the bottom. The scene captures the unsightly nature of the discarded toilets, presenting an untidy and unappealing display.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["sink", "bowl", "toilet"], "hallucination_idxs": [21], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.6666666666666666}}, {"image_id": 465601, "caption": "The image features a man standing in the rain holding an umbrella. He is standing in the center of the scene and there are several other people around him. There are two umbrellas visible in the scene, one being held by the man in the rain and the other located at the back of the picture. The people around the man are walking, and some of them have umbrellas as well. The rain is light and not causing too much inconvenience for those standing outdoors.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "umbrella", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 300124, "caption": " The image features a couch on the sidewalk, with a microwave placed on top of it. The microwave is positioned on the side of the couch, partially covering it. The couch is situated in front of a brick wall and appears to have been placed on the sidewalk, possibly for disposal or to create a unique street-side seating arrangement.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["microwave", "couch", "chair", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 46919, "caption": "According to the picture, there is a person wearing snowshoes and carrying a backpack walking down a snowy slope. Another person, also wearing snowshoes, is accompanying the first person. The two individuals are enjoying the hike on the snowy slope.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 434022, "caption": " The image showcases a train traveling down the train tracks. The majority of the train can be seen along the center of the image, with the engine and many car wheels visible. Additionally, there are a few other cars near the front of the train. The train seems to be moving at a moderate speed, with various train cars spread across the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 490491, "caption": "According to the picture, a street sign and traffic sign are located in a gravel-like area under the sky. The signs are situated near the street in a town. There are multiple traffic lights in the scene, which might be controlling the traffic flow in the area.", "mscoco_hallucinated_words": [["traffic light", "traffic light"]], "mscoco_gt_words": ["stop sign", "car", "person"], "hallucination_idxs": [35], "metrics": {"CHAIRs": 1, "CHAIRi": 1.0, "Recall": 0.0}}, {"image_id": 175878, "caption": " A group of three people is standing outside, talking and socializing. Among them, there is a man and a woman, both standing on the right side, and a third person standing on the left. They appear to be having a good time and enjoy each other's company.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "cell phone", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 12669, "caption": "According to the picture, there is a group of people in a room, with some of them wearing white shirts and others wearing pants. A woman is standing and holding a glass of wine, and there are several other people in the room, seemingly engaged in conversation or looking around. The room appears to be a gallery or exhibition space, possibly for a wine tasting event or similar activity.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["wine glass", "handbag", "vase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 192496, "caption": "The image features a brown and white train parked on a train track, with a platform nearby. The train appears to be on the move, as it is equipped with a set of wheels. Another train is also visible on the left side of the scene, adding an element of train travel to the picture. The platform appears to be the designated spot where the train has stopped, making it easy for passengers to board and disembark.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 133146, "caption": "According to the picture, the image features a large open room filled with various furnishings and decor. The room appears to be a living space with three chairs and a sofa. The furniture includes two brown chairs positioned near the back of the room, one white chair towards the front, and a brown sofa that covers the majority of the room's space. There are several bottles, books, and vases spread around the living area, including a potted plant near the front of the room.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["potted plant", "potted plant"]], "mscoco_gt_words": ["tv", "vase", "book", "chair", "couch"], "hallucination_idxs": [72, 86], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.8}}, {"image_id": 53744, "caption": " In the image, there is a cutting board with several pieces of food placed on top of it. There are two sliced tomatoes, with one situated towards the left side and the other towards the center. Among the tomatoes, there is also a piece of cucumber. Near the tomatoes, there is a knife, possibly used to cut the vegetables.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "knife", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 375902, "caption": "The image features a woman skiing down a snowy slope, holding a pair of crossed skis across her chest, representing her victory in a skiing competition. She is smiling and appears to be enjoying her skiing experience.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 498747, "caption": "According to the picture, the scene features a man in the foreground talking on his cell phone. Another man is in the background, sitting at a table with a group of people. The man in the background appears to be taking a photo, while the woman in the middle of the scene might be the focus of the picture.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["cell phone", "bird", "person", "bottle"], "hallucination_idxs": [29], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 52435, "caption": "The image features a lively beach scene with people enjoying the waves and spending time in the water. There are many people spread out across the beach, some closer to the shore and others further out. Some people are standing, while others are in the water, likely surfing, swimming, or just enjoying the beach environment.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "person", "umbrella", "bird"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 301977, "caption": " A cow is standing in front of a store selling food and beverages. The cow appears to be curious about the different bottles on display, ranging from sodas to juices. The store has several bottles positioned on the shelves, creating an assortment of choices for the cow. A bottle can be seen in the left side of the picture, while another bottle is located near the center. There are also other bottles positioned throughout the store.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cow", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 261097, "caption": "The image features a young man wearing white shorts walking off the court after a match, holding a tennis racket in his hand. He is leaving the court at the end of the match and appears to be walking towards the exit or the stands. Several people can be seen in the image, including spectators or fans gathered around the area watching the match. There is a chair in the scene, placed near the edge of the court, likely for a person to sit on while watching the match or waiting for the player to exit.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "tennis racket", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 227204, "caption": "According to the picture, three women are sitting at a dining table, enjoying a meal together. They are smiling and engaged in conversation. On the table, there are a variety of food items such as bananas, bread, and eggs. There is also a cup and a bottle, which might be drinks or dessert. They are all relaxed, enjoying their time at the dining table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "bowl", "spoon", "bottle", "cell phone", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5555555555555556}}, {"image_id": 353981, "caption": "The image is of an airplane on a runway, likely a plane from the past. There is another plane in the background, and it appears to be closer to the left side of the scene. The image is in black and white, likely taken during the early years of the airplane's life. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 185327, "caption": "In the image, a group of elephants is walking down the street in front of a car, accompanied by a person who is likely guiding them or controlling their movement. There are a total of three elephants visible, with two of them in the foreground and one slightly behind them.", "mscoco_hallucinated_words": [["elephant", "elephant"], ["elephant", "elephant"]], "mscoco_gt_words": ["horse", "bus", "cow", "car", "person"], "hallucination_idxs": [7, 39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.4}}, {"image_id": 348654, "caption": "According to the picture, there is a cell phone sitting on a docking station in front of a TV screen. The TV is positioned on the left side of the scene, while the cell phone is placed on the right side. A computer or keyboard is also present in the scene, positioned above the cell phone and TV.", "mscoco_hallucinated_words": [["keyboard", "keyboard"]], "mscoco_gt_words": ["cell phone", "laptop", "tv", "dining table"], "hallucination_idxs": [48], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.75}}, {"image_id": 482432, "caption": "According to the picture, the image features a bottle of toothbrush and toothpaste on it. The bottle is placed towards the left side of the image, and the toothbrush and toothpaste are positioned just above it. There is a toothbrush sitting on top of the bottle and another one located to the right, near the bottom of the image.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["bottle", "bottle"], ["bottle", "bottle"]], "mscoco_gt_words": ["toothbrush", "sink", "bowl"], "hallucination_idxs": [9, 18, 49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 454161, "caption": "The image features a large, personal-sized pizza with many toppings, such as mushrooms and vegetables, sitting on a plate. A wine glass is placed next to the pizza, ready to be enjoyed with the meal.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["wine glass", "pizza", "fork", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 144723, "caption": "The image captures a dog in action as it jumps and leaps into the air to catch a red frisbee that the child is throwing. The dog, which is wearing a backpack, is running with full speed and excitement to catch the frisbee, displaying its agility and playful behavior. The scene showcases the dog's determination and willingness to have fun.", "mscoco_hallucinated_words": [["child", "person"], ["backpack", "backpack"]], "mscoco_gt_words": ["truck", "frisbee", "car", "dog"], "hallucination_idxs": [22, 33], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.5}}, {"image_id": 129707, "caption": "The image captures a person holding a cell phone in their hand, displaying a large black telephone. The person is likely using the phone to make a call or access a feature on the device.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "cell phone", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 379842, "caption": "The image features a white Nintendo Wii remote and a white Nintendo Wii game, both sitting on a table. There are two people in the picture, one closer to the camera and another one on the left side. The remote and game are placed next to each other, indicating they are part of the same package or collection.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["book", "remote", "dining table"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 203110, "caption": "The scene is of a blue and white trolley traveling down the street, surrounded by several people. There is a car, situated in the middle of the street near the train, as well as a person standing nearby. Another person is located at the left edge of the image, while two more people are situated on the right side of the street, walking alongside the trolley.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bus", "train", "orange", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 556478, "caption": " The image features two pizzas, each positioned on a pizza tray placed on a stove. One of the pizzas has a base topped with a variety of ingredients. The other pizza is covered in spinach and alfalfa. They are located on a kitchen counter near the oven, occupying the center part of the scene. Various kitchen tools and utensils can be seen, such as a bottle and two knives, spread throughout the image, including one near the left side of the picture and two close to the pizzas.", "mscoco_hallucinated_words": [["knife", "knife"]], "mscoco_gt_words": ["pizza", "cup", "oven", "bottle"], "hallucination_idxs": [75], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.75}}, {"image_id": 103815, "caption": "According to the picture, two baseball players are on the scene. One of them, the pitcher, is actively throwing a baseball, while the other one, a batter, is holding a baseball bat, ready to hit the ball. The baseball, in the air, is positioned on the left side of the image. Several other baseball-related objects, such as a glove and a baseball glove, are dispersed throughout the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 451943, "caption": " The image features a nicely made bed with two white pillows and a nightstand, situated in a room. The bed is positioned near a window and well-lit, with lamps positioned in different areas of the room. The nightstand is placed close to the bed and houses a few items such as a bottle and a vase. The arrangement of the bed and furniture is neat, suggesting that the bedroom is well-maintained.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["vase", "vase"]], "mscoco_gt_words": ["backpack", "suitcase", "bed", "dining table"], "hallucination_idxs": [56, 59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.25}}, {"image_id": 268854, "caption": "In the image, a group of people is gathered on a beach, with some of them carrying items. The people are engaged in various activities, such as walking around, standing, and carrying items. The beach is lined with small boats, with some of them closer to the shore and others further out. This creates a lively atmosphere on the sandy shore.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "person", "boat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 34464, "caption": "In the image, a group of people is enjoying a day at the beach. Some of them are surfing on a small red surfboard, which is on the surface of the water. There are at least 7 people standing in the water, with some closer to the shore and others further out. The beach is lively with people, and there are also a couple more people standing up near the shore.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 495513, "caption": "In the image, there are two people standing close to each other and wearing coats, posing in the middle of a group of people. They have two hot dogs placed on sticks and are smiling. There are at least four other people in the background, and some of them are holding bottles. One bottle can be seen towards the left, and the other three are situated at different positions in the scene.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["bottle", "bottle"]], "mscoco_gt_words": ["fork", "hot dog", "person"], "hallucination_idxs": [56, 59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 529256, "caption": "According to the picture, a bed in a room is occupied by two dogs, one on the left side and the other on the right side. They appear to be relaxing and enjoying the comfortable bed. A variety of cats are also present in the room, and they are scattered across the scene. One cat is sitting close to the left dog, while the other cat can be seen near the right side of the bed. There is also another cat located near the top left corner of the room. The cats are not necessarily interacting with the dogs but can be seen in the same space.", "mscoco_hallucinated_words": [["cat", "cat"], ["cat", "cat"], ["cat", "cat"], ["cat", "cat"], ["cat", "cat"]], "mscoco_gt_words": ["book", "bed", "dog"], "hallucination_idxs": [43, 60, 72, 88, 100], "metrics": {"CHAIRs": 1, "CHAIRi": 0.45454545454545453, "Recall": 0.6666666666666666}}, {"image_id": 37907, "caption": "According to the picture, there is a black and white cat sitting on top of a television, looking at it. The cat is on the left side of the TV and appears to be focused on the screen. There are a few people in the scene, with one person standing at the front left side of the image and another one at the back right side.", "mscoco_hallucinated_words": [["person", "person"], ["person", "person"]], "mscoco_gt_words": ["cat", "potted plant", "tv"], "hallucination_idxs": [46, 53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 506401, "caption": "The scene takes place in a city, with traffic on the street in front of a large building. A single traffic light can be found near the center of the intersection, directing traffic towards multiple directions. There are also several cars on the street, including a couple of vehicles close to the traffic light, and two more further away, as well as another car to the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "traffic light", "cat", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 406611, "caption": "The image depicts a crowded ski resort with a large crowd of people gathered on the slopes, enjoying their time on the snow. The people are spread out across a ski run, some skiing, others walking around or standing. There are several skiers visible in the image, with some carrying skis. The crowd is diverse, with people of various heights, all sharing a fun day at the resort.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 267225, "caption": "According to the picture, the scene features a white truck with satellite equipment parked in front of a building on the right side. There are a few other trucks in the area, including two parked next to each other and another truck with a satellite dish mounted on top. There is a person walking on the left side of the image and several other individuals scattered around, perhaps observing or interacting with the trucks.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "remote", "tv", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 278365, "caption": "According to the picture, two people, a woman and a man, are sitting at a table and enjoying a meal together. They are eating pizza and drinking beer, as pizza slices and cups can be seen on the table in front of them. The man is holding a camera, possibly capturing the two communicating over dinner. The scene takes place outdoors on a deck, with a bottle on a nearby counter, indicating that they are enjoying an evening meal.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["pizza", "cup", "dining table", "cell phone", "chair", "person"], "hallucination_idxs": [75], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 0.6666666666666666}}, {"image_id": 69757, "caption": "According to the picture, there is a woman sitting at a dining table with a man, all of them enjoying a meal together. The woman is holding chocolate, and the man holding a knife, suggesting they are enjoying a piece of chocolate cake. The table is set with various items, including cups, knives, and a fork, indicating a casual and intimate dining experience. The dining table is surrounded by chairs on three sides, allowing for comfortable seating and a relaxing atmosphere.", "mscoco_hallucinated_words": [["fork", "fork"]], "mscoco_gt_words": ["cup", "knife", "dining table", "cake", "wine glass", "chair", "person"], "hallucination_idxs": [64], "metrics": {"CHAIRs": 1, "CHAIRi": 0.07692307692307693, "Recall": 0.8571428571428571}}, {"image_id": 432213, "caption": "The image features a table with food on it. There are several items, including a sandwich with a meat and vegetable filling, and a plate with french fries. A glass of drink is visible towards the right side of the table, and there are also several spoons scattered throughout the table. The food is likely served in a restaurant, and the scene has a friendly atmosphere with a touch of enjoyment.", "mscoco_hallucinated_words": [["spoon", "spoon"]], "mscoco_gt_words": ["knife", "dining table", "bowl", "sandwich", "wine glass", "person"], "hallucination_idxs": [51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.3333333333333333}}, {"image_id": 368367, "caption": " According to the picture, a woman is sitting at a table in a restaurant, talking on her cell phone. She is smiling and appears to be enjoying a conversation. There are several chairs in the scene, with two chairs located on the left side, one placed near the center, and another chair on the right side.", "mscoco_hallucinated_words": [["table", "dining table"], ["chair", "chair"], ["chair", "chair"], ["chair", "chair"]], "mscoco_gt_words": ["cell phone", "cup", "person"], "hallucination_idxs": [11, 36, 43, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 201555, "caption": "The image depicts a street with a significant amount of dirt and sand covering much of it. A sign reads \"Grand Royal Ascent,\" making the scene appear to be related to a historical construction or construction project. The street is lined with lights, which add a sense of structure to the scene. There are several cars parked along the street, and a few others are also visible in the background. Some people are also seen walking on the sidewalk, and a few cars are parked near them.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["traffic light", "car", "truck"], "hallucination_idxs": [80], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 92131, "caption": "The image features a large, green bus with people standing around it. There are multiple people in the scene, with some in closer proximity to the bus and others more spread out. Some of the people are standing and chatting with each other, while others are waiting to board the bus or may already be on board. The bus appears to be on a street or road, with cars parked nearby and additional people standing in the vicinity. The atmosphere is of people waiting to catch the bus and having discussions while they wait.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["bus", "person", "handbag"], "hallucination_idxs": [75], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.3333333333333333}}, {"image_id": 21779, "caption": "The scene in the image shows a woman lying down on the floor, smiling and clutching a large suitcase. The suitcase is positioned behind her, with the bottom left corner visible near the left side of the image, and the top part of the bag extending towards the right side. The woman's face is the center of attention, with her legs visible at the bottom of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "suitcase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 565148, "caption": "According to the picture, there is a baseball game happening on the field. A batter stands in the middle of the field, holding a baseball bat and preparing to swing at a pitch. The player is surrounded by a crowd of people, including an umpire, who is likely watching him during the game. There is another person in the picture, likely a catcher or an umpire as well.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 160183, "caption": "A fire truck is parked outside a building. The truck has a fire ladder extending from the top of it, which is likely used to reach high areas during emergencies. The building appears to be a warehouse, possibly a fire station, and there are several firefighters gathered around the fire truck, presumably having a conversation.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 555456, "caption": " The image features a snowy ski slope where people are skiing. There are several skiers present, some on the left side of the slope and others on the right. One person appears to be skiing down the slope, which is the main focal point of the scene. Some of the skiers are wearing skis, while others do not have them, but they still seem to be enjoying the activity.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 235595, "caption": " The image showcases a row of parked motorcycles on the side of the road, positioned in front of a store or shop, possibly a coffee shop. The motorcycles are a mix of red, maroon, and blue shades, with some sitting closer together and some more spaced out. In addition, a car can be seen parked just in front of the motorcycles. The cars are positioned further down the street, and a few more motorcycles are visible, including one at the very far end of the street, beyond the other parked motorcycles.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "motorcycle", "potted plant", "car", "parking meter", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 489909, "caption": " The image showcases a wide variety of fruits and vegetables, primarily consisting of bananas, oranges, and a few cucumbers. There are three bananas in the image, one placed in the middle and the other two towards the right side. There are multiple oranges, with at least six oranges displayed, ranging from the left side to the right side of the image. The oranges are of different sizes, creating a visually appealing display.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "orange", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 501080, "caption": " The image features a vase arrangement filled with brightly colored flowers. The vase itself is in the center of the arrangement, with a group of pink flowers surrounding it. There is another group of pink flowers located on the left side of the vase, creating a visually appealing composition.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "potted plant", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 405183, "caption": "The image shows a young girl playing a video game, with a Wii controller in her hand. She is surrounded by various people, who are watching her play and supporting her from different parts of the room.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "handbag", "tv", "potted plant", "book", "chair", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.125}}, {"image_id": 409138, "caption": " The image features a white and green bus parked on the street, near a parking lot and surrounded by trees. The bus is parked in the middle of the street, away from any sidewalks. There are a few cars parked close to the bus, and another one is parked further down the street. The scene appears to be a typical street with a bus stop, surrounded by parking lots and traffic near a shopping center.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 411840, "caption": "According to the picture, there are two female tennis players walking off the court after a match. One of the players is walking towards the right edge of the court, while the other one is walking in the opposite direction, closer to the left edge. They both carry tennis rackets, one of which can be seen close to the right player and the other closer to the left one.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "chair", "backpack", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 333712, "caption": "The image features a piece of cheese pizza with bacon toppings. The pizza is placed on a cutting board, and there is a fork positioned next to it, ready to be used for enjoying the food.", "mscoco_hallucinated_words": [["fork", "fork"]], "mscoco_gt_words": ["pizza", "spoon", "dining table"], "hallucination_idxs": [25], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 148977, "caption": "According to the picture, there is a blue teddy bear sitting on a desk, wearing headphones and listening to music. In the background, various items can be seen, including several books scattered around the room and a laptop situated closer to the bottom left of the image. There are also two bottles on the desk, with one placed on the left side and the other located at the top right of the image.", "mscoco_hallucinated_words": [["laptop", "laptop"]], "mscoco_gt_words": ["keyboard", "teddy bear", "bear", "dining table", "bottle", "book"], "hallucination_idxs": [42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 399058, "caption": "A man is riding on an elephant, traveling down a road. The elephant appears to be very large, as it is the main subject in the scene. The man is positioned on the elephant's back, seemingly enjoying the unique experience. The road is a relatively narrow one, and there is a truck driving alongside the elephant and the man, contributing to the scene's amusing and surreal nature. Additionally, a traffic light can be seen in the vicinity, providing further context to the location and its traffic patterns.", "mscoco_hallucinated_words": [["traffic light", "traffic light"]], "mscoco_gt_words": ["truck", "bicycle", "elephant", "orange", "person"], "hallucination_idxs": [81], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.6}}, {"image_id": 183571, "caption": " According to the picture, there is a man standing on a small, red boat next to a dog on a grassy area, with the boat occupying the majority of the image. The man appears to be holding the dog in his arms, with both the man and dog smiling. The dog appears to be enjoying the situation. There is another person also present in the scene, but they are not interacting with the man and dog. The boat is positioned on the edge of a body of water, possibly a canal or a river.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "boat", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 325836, "caption": "The image features a motorcycle on display, surrounded by a crowd of people. There are several people in the scene, with a few near the motorcycle and the others scattered around the room. The display appears to be a private or exclusive event, given the presence of numerous attendees in the room. The motorcycle is likely a customized or unique design.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 437910, "caption": "The image depicts a small bathroom with a sink. A toilet is located in the room, and the sink is situated near the toilet. The bathroom is cluttered with various items, including newspapers, cups, and food remnants. A chair can be seen in the room as well, but it is not clearly visible due to the clutter.", "mscoco_hallucinated_words": [["cup", "cup"], ["chair", "chair"]], "mscoco_gt_words": ["sink", "toilet", "bottle"], "hallucination_idxs": [38, 45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 155885, "caption": "According to the image, the scene is a black and white photo of a harbor filled with boats. There are several boats of different sizes scattered throughout the scene. Some of these boats are closer to the shore, while others are further out in the water.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 467887, "caption": " A man is sitting at a table, using a laptop computer. He is wearing glasses and appears to be focused on working on the laptop. The table is adorned with a few vases, one located in the center, another near the left side, and a third closer to the right side. The man is seated in the middle of the table, and a chair is placed behind him. The scene showcases a living room setting with a sofa situated behind the table, and a potted plant on the right side of the room.", "mscoco_hallucinated_words": [["vase", "vase"], ["potted plant", "potted plant"]], "mscoco_gt_words": ["dining table", "laptop", "mouse", "chair", "person", "couch"], "hallucination_idxs": [35, 94], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.8333333333333334}}, {"image_id": 174402, "caption": "According to the picture, the scene is set on a tennis court with multiple people present, one of them being the main focus. The man wearing a yellow shirt is holding a tennis racket, presumably ready to play or practice. Another person is visible, though not in the foreground. There are also some other people in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 352734, "caption": "According to the picture, a young boy is sitting at a dining table eating chocolate cake. He is taking a bite out of the cake with a spoon, feeding himself the sweet treat. There is another person in the background, possibly a parent or family member watching over the boy.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "spoon", "cake", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 346716, "caption": "According to the picture, there is a street with a red double-decker bus driving down it. The bus is a classic one with white accents. Another red double-decker bus is present, this time in the background, traveling in the opposite direction. There are several people on the sidewalk, some of them walking and others standing. A few cars are also present in the scene, with one on the left side of the street and the other one driving down the street towards the right side. The overall atmosphere is of a busy urban street with a mix of public transportation and traffic.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 451144, "caption": "The image features two men standing on top of a snow-covered mountain, looking out at the view. They are likely enjoying the scenery from a vantage point. The mountain has a lot of snow accumulation on its slopes, making it appear like a winter wonderland. There are also a few other people in the scene, spread out at different positions in the background, likely engaging in snow sports or simply enjoying the mountain view.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 295656, "caption": "According to the picture, there is a group of people, including a man and a woman, standing at a table and eating food together. They appear to be casual, friendly, and relaxed as they enjoy the dining experience. Additionally, there are various types of food items, such as pieces of cake and slices of pizza, on the table in front of them. Among the group, there are a few people with wine glasses and bow ties. It seems that they are having a fun time and possibly gathering for a meal or celebration.", "mscoco_hallucinated_words": [["table", "dining table"], ["cake", "cake"], ["table", "dining table"], ["wine glass", "wine glass"]], "mscoco_gt_words": ["pizza", "clock", "tv", "tie", "car", "chair", "person"], "hallucination_idxs": [22, 59, 67, 83], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.42857142857142855}}, {"image_id": 129886, "caption": " According to the picture, a small group of people is skiing down a snowy slope, enjoying their time on the mountains. In the scene, one person is in the foreground skiing down the slope, and there are several other individuals scattered around the area. Some people appear to be further up the slope while others are closer to the bottom. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 557998, "caption": "According to the picture, there is a young woman sitting at a table with a blue liquid in front of her. She appears to be looking at the liquid as she might be studying it. Near the table, there are a few bottles and an empty bottle on the floor, suggesting different types of drinks available. Additionally, there is a notebook placed next to the bottles, possibly for taking notes during the study.", "mscoco_hallucinated_words": [["notebook", "laptop"]], "mscoco_gt_words": ["dining table", "bottle", "wine glass", "chair", "person"], "hallucination_idxs": [67], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6}}, {"image_id": 401010, "caption": "The image features a bedroom with a neatly made bed. The bed is situated in the middle of the room, surrounded by a few other objects such as a pillow, a few bowls, and a remote control. The room is well-lit, giving the bedroom a pleasant atmosphere. There is another chair in the room positioned near the wall, and two other people can be seen. One person is sitting on a chair next to the bed, while the other is standing near the wall, looking towards the camera.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["remote", "remote"], ["person", "person"], ["person", "person"]], "mscoco_gt_words": ["sink", "chair", "bed", "tv"], "hallucination_idxs": [35, 39, 69, 75], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4444444444444444, "Recall": 0.5}}, {"image_id": 567562, "caption": " The scene shows a woman and two children sitting at a dining table, enjoying a meal together. The woman has placed a variety of dishes in front of her children, including two bowls of slices pizza and two bottles of soda. They have a casual, relaxed atmosphere as they eat and spend time together. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "dining table", "bowl", "spoon", "bottle", "fork", "broccoli", "book", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.45454545454545453}}, {"image_id": 313928, "caption": "The image features a small outdoor setting by the ocean, with a table and chairs set up on a balcony or patio area. There is a yellow surfboard placed on the table, creating a unique and fun setting for someone to enjoy their meal or time outdoors. Near the table, there's a chair positioned close by, allowing the person to sit down and experience the view and ambiance.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["surfboard", "dining table", "bottle", "wine glass", "chair"], "hallucination_idxs": [65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6}}, {"image_id": 490284, "caption": "According to the picture, the main focus is a black dog sitting on a couch. The dog appears to be looking up at the camera, providing a relaxed and comforting expression. The background includes a room with two other couches positioned towards the left and right sides of the frame. There is a woman sitting on one of the couches, located in the center, while another person is situated towards the left edge of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["couch", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 346940, "caption": "The image captures a kitchen scene with an oven and tea kettle, both being heated by steam emanating from their spouts. A cup is placed on the countertop, likely containing some tea or coffee, which is positioned near the teapot. The sink is also present in the scene, both in the foreground and background. Various bottles can be seen throughout the kitchen, with some placed near the sink and the tea kettle. The countertop is the main focus of the scene, featuring multiple items in various locations, making it a busy kitchen setting.", "mscoco_hallucinated_words": [["cup", "cup"], ["sink", "sink"], ["bottle", "bottle"], ["sink", "sink"]], "mscoco_gt_words": ["bowl", "knife", "oven"], "hallucination_idxs": [24, 46, 62, 75], "metrics": {"CHAIRs": 1, "CHAIRi": 0.8, "Recall": 0.3333333333333333}}, {"image_id": 222782, "caption": "The image features an older photograph in black and white of a woman walking down a sidewalk past a gas station sign that reads \"Amoco Gas.\" There are several cars parked alongside the street, one at the end of the sidewalk and the others in front of the store. The scene is set in a suburban area, with houses and trees along the sidewalk.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "cell phone", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 255934, "caption": "The image features a large elephant and a smaller baby elephant walking together in a dirt area. The baby elephant is next to the adult elephant, creating a unique and special bond between them.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "handbag", "elephant", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 278627, "caption": "The image is a collage of food and dining experiences, featuring multiple scenes with people enjoying their meals. The various food items in the image are rice, shrimp, squid, salad, tomato, and noodles.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "spoon", "potted plant", "bottle", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.125}}, {"image_id": 400851, "caption": "The image depicts a beach with a large crowd of people enjoying their time on the sand. People are scattered across the beach, some sitting on the sand, while others are standing or walking around. The beach is also filled with beach chairs, indicating that some people brought their belongings along for the day at the beach.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "umbrella", "chair", "backpack", "boat", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 472610, "caption": "According to the picture, a woman is walking her dog in a grassy field. The woman is standing with her dog, which is next to her. The dog is a little bit bigger than the woman, and it appears to be a pit bull. They are surrounded by a herd of cows, who are spread out across the field. In total, there are around six cows in the scene, with a few of them closer to the couple and others farther away. The field is covered in grass, providing a green background for the animals.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cow", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 78565, "caption": "The image depicts a group of people enjoying a day at the beach. There are at least [IDK] people scattered across the scene, with some in boats and others not. Some of the people are standing, while others are sitting on the beach. The atmosphere is lively and bustling, with people engaged in various activities on the beach.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "boat", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 388237, "caption": " According to the picture, a person is riding a skateboard down a street, surrounded by a group of people who are watching him. There are also two other skateboards nearby, as well as several people standing on the sidewalk. Some of the individuals are situated near a park bench and a car is parked further away. The scene features various street elements, such as traffic lights and parking spots, with a sense of activity and excitement.", "mscoco_hallucinated_words": [["bench", "bench"], ["car", "car"], ["traffic light", "traffic light"]], "mscoco_gt_words": ["skateboard", "chair", "person", "bottle"], "hallucination_idxs": [53, 56, 71], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.5}}, {"image_id": 289640, "caption": "According to the picture, there is a man sitting at a table holding a Frisbee over his head. The Frisbee is placed on top of a bowl located at the center of the table, covering the man's face. A second person appears in the scene, standing behind the table. Two chairs are located in the room, with one at the left side of the table and another at the right side. A handbag is also visible in the scene, placed near the man who is holding the Frisbee.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["handbag", "handbag"]], "mscoco_gt_words": ["cup", "dining table", "frisbee", "orange", "wine glass", "chair", "person"], "hallucination_idxs": [28, 81], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5714285714285714}}, {"image_id": 122602, "caption": "According to the picture, there is a desk covered with paper, including cards, a cutting mat, and scissors. A few items are placed on the desk, including a bottle, a knife, and a few pairs of scissors. Some scissors are located closer to the center of the image, while others are positioned towards the left and right sides of the desk. A few additional items are also scattered around, such as a spray bottle, a glue bottle, and a pair of glasses near the left side of the image.", "mscoco_hallucinated_words": [["knife", "knife"]], "mscoco_gt_words": ["dining table", "scissors", "bottle"], "hallucination_idxs": [37], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 1.0}}, {"image_id": 24104, "caption": " The image features a neatly organized home office setup with three computer screens and other office equipment. The desk holds two monitors placed side by side, while a laptop can be seen towards the right side of the image. There is also a keyboard and a mouse visible, along with a cell phone in the upper left corner of the image. A few books can be spotted scattered around the area, adding a cozy reading touch to the office environment.", "mscoco_hallucinated_words": [["cell phone", "cell phone"]], "mscoco_gt_words": ["keyboard", "cup", "tv", "dining table", "laptop", "mouse", "book", "bed"], "hallucination_idxs": [55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.75}}, {"image_id": 183709, "caption": "According to the picture, there is a group of people, including a man in a black jacket, standing on the sidewalk near a street corner. They are waiting for a traffic light to change, and some of them are looking at the traffic light. There is another traffic light visible in the scene. The majority of the people are gathered around the intersection and near the traffic lights, suggesting that they are waiting for the traffic signal to turn green or red.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "handbag", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 59710, "caption": "The image displays a small black refrigerator sitting on the floor in a bedroom, open to reveal a neatly organized interior. There are several white pieces of clothing, such as dresses and tops, hanging and folded neatly in the refrigerator. The items appear to be tidily arranged, and the closet is empty, making the refrigerator the primary storage space for clothes. The overall setting and arrangement of the appliances and clothing suggest an organized and efficient use of the limited space available in a small bedroom.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "refrigerator", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 279407, "caption": "In the image, a young boy wearing a baseball glove is in the spotlight. He is holding a baseball, and a baseball has been thrown off to the side, likely by the boy. There are a few other baseball gloves visible on the field, but they appear to be stationary. A few additional people can be seen in the background, possibly sitting or standing close by. Overall, the image captures the excitement of a young baseball player about to make a throw or catch the ball.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "bottle", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 314729, "caption": "According to the picture, a vintage black and white photograph captures a baseball player in a baseball uniform, holding a baseball bat and getting ready to swing. The player is wearing an old-fashioned uniform and standing in a classic baseball pose. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 527497, "caption": "According to the picture, there is a computer setup with a monitor, keyboard, and mouse sitting on a counter in a coworking space. The monitor is turned on, indicating that it is currently in use. The keyboard is placed closer to the center of the counter, while the mouse is situated near the left edge. The computer setup is sitting near the center of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "tv", "dining table", "laptop", "mouse"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 411564, "caption": "According to the picture, a man is sitting on a window sill a kitchen, holding a cell phone and looking out the window. The man is leaning against the window frame, with a cupboard located behind him. Another cupboard is positioned to the left of the scene. There are also a few bottles visible, placed in various locations around the kitchen. One bottle is near the right side of the scene, while another is towards the middle and yet another can be seen towards the left side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "dining table", "bottle", "cell phone", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 320350, "caption": "In the image, a couple is posing with their two dogs on a snowy field. They are both dressed up for the occasion, and their skis are clearly visible, indicating that they are about to embark on a skiing excursion together. The couple is standing close to each other, and their dogs are placed neatly between them, creating a harmonious scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "skis", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 401857, "caption": " In the image, multiple people are gathered around a baseball field as a baseball player, possibly a batter, is up to bat during a game. The batter is holding a baseball bat, completing the swing, and there is an umpire and a catcher in the vicinity. Other players are positioned around the field, seemingly paying attention to the action. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "bench", "baseball bat", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 285302, "caption": "According to the picture, a young boy is sitting at a dining table, cutting paper scissors. He is focused on his task and appears to be enjoying himself. The table has several chairs surrounding it, providing seating options for other family members or guests. There is a variety of books placed on the table, likely for the child's entertainment and learning.", "mscoco_hallucinated_words": [["book", "book"]], "mscoco_gt_words": ["person", "chair", "scissors", "dining table"], "hallucination_idxs": [55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 1.0}}, {"image_id": 61959, "caption": "The scene in the image is a busy intersection where a white bus, possibly a city bus, is traveling along the road. The bus is driving past a stop sign and appears to be on its route, traveling towards an intersection with multiple cars around it.", "mscoco_hallucinated_words": [["stop sign", "stop sign"]], "mscoco_gt_words": ["traffic light", "bus", "truck", "car"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 229659, "caption": "The image features a laptop placed on a table in a restaurant, with chairs surrounding it. There are two people in the room: one person sitting closer to the laptop, and another person who seems to be farther away. There are multiple bottles, cups, and a spoon on the table, as well as a carafe situated near the middle of the scene. The restaurant has a modern and cozy atmosphere, with chairs positioned around a wooden table.", "mscoco_hallucinated_words": [["cup", "cup"], ["spoon", "spoon"]], "mscoco_gt_words": ["clock", "tv", "dining table", "laptop", "bottle", "chair", "person"], "hallucination_idxs": [49, 53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.15384615384615385, "Recall": 0.7142857142857143}}, {"image_id": 473118, "caption": " In the image, a young man is riding a skateboard down a ramp, performing a trick in the park. The skateboard is located near the top of the ramp and the person is skillfully balancing while in motion. Several other people can be seen in the surrounding area, likely watching or waiting to skate. There are also some bicycles, including one parked in the middle of the park and another in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 391222, "caption": "According to the image, two women are standing in the snow, each holding a pair of skis. They are looking at a sign that is posted on the ground, possibly providing information about the trails. The women might be planning to embark on a skiing or snowshoeing excursion, as they are wearing backpacks, which could contain skiing equipment or other essentials for their activity. The scene features snow-covered landscape, as well as some trees and some benches nearby.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [85], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 416104, "caption": "In the image, a group of people is gathered around a table at an outdoor dining area, enjoying their time together. They are sitting on chairs under umbrellas, with some people standing around the table. It appears to be a lively and social scene, as they engage in conversation and enjoy their meals. The table is set with a variety of beverages and a few cups, providing refreshment for the patrons.", "mscoco_hallucinated_words": [["cup", "cup"]], "mscoco_gt_words": ["umbrella", "dining table", "bottle", "wine glass", "chair", "person"], "hallucination_idxs": [72], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.6666666666666666}}, {"image_id": 511852, "caption": " The image features two people skiing down a snowy slope. There is a man in front and a woman behind him. They are both wearing skis and have backpacks on their backs. One person is closer to the left side of the scene, and the other is closer to the right side. They appear to be in the process of skiing down the mountain, enjoying the winter activities. The scene captures the two individuals on their skis with the snow-covered mountain in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 59386, "caption": " In the image, a group of giraffes is gathered around a tree, feeding on leaves. The giraffes are placed on a grassy field with various sizes. Some are larger than others, and they appear to be eating the leaves from the tree, which is located in the center of the group. There are a total of six giraffes in the scene, all spread out across the grass field.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["zebra", "giraffe", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 540180, "caption": "The image features a green traffic light with a smiley face on it, sitting at the side of a street. The street is located on the side of a city, with multiple cars parked or driving in various directions. There is a total of seven cars visible in the scene, two in the front of the street, one in the center, and four more parked in the background. The traffic light is situated close to these vehicles, with one car directly to its right and another one positioned towards the left side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 153692, "caption": "According to the picture, a man is riding a bicycle down a street, with a dog following closely behind. The man is wearing a blue shirt, and the dog is wearing a black and grey shirt. The man and the dog are both crossing at the same intersection. There are two traffic lights visible in the scene, one at the left edge of the image and the other near the middle. A car is parked further away from the scene, possibly indicating a parking area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "traffic light", "dog", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 265628, "caption": "The image features a white toilet with the seat on the floor, accompanied by a man nearby. There are two men in total, one standing further back from the toilet, and the other standing closer to the toilet, both wearing shoes. The white toilet takes up the majority of the image, with the men standing in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "person", "toilet"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 499594, "caption": "According to the picture, a dinner plate features a variety of foods. The plate contains a large piece of meat, such as beef or pork, several bowties, and asparagus. The vegetables, including broccoli, are arranged on the plate in a visually appealing way.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "bottle", "fork", "broccoli"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 5934, "caption": " The image showcases a baseball glove and a baseball sitting on a table. The baseball glove is positioned closer to the top of the image and the baseball is placed in the middle of the table, surrounded by the baseball glove. The ball appears to be on top of the glove, with a ball within it. The scene gives an impression of a baseball game in progress or a baseball player preparing for an upcoming game.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"], ["player", "person"]], "mscoco_gt_words": ["bench", "sports ball", "baseball glove"], "hallucination_idxs": [12, 36, 75], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.6666666666666666}}, {"image_id": 47828, "caption": "The image features a city or urban setting with a river running through it. There are several boats visible along the river at night, possibly illuminated by lights, creating a scenic and festive atmosphere. Several people are walking along the riverbank, with a few individuals closer to the middle of the scene. They appear to be enjoying the evening and the view of the boats and bridge.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "boat", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 47916, "caption": " In the image, a white dog and a brown dog are running together in a grassy field, chasing after a frisbee. The two dogs seem to be having fun playing with each other, enjoying their time in the grassy field. There are several other dogs present in the area, some running or milling about, while others are scattered throughout the field.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bench", "dog", "frisbee", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 255483, "caption": "According to the image, there is a person sitting at a desk, holding something in their hand. They appear to be reading or editing a piece of paper. There are several papers laid out around them, one of which is open and partially visible. It seems that the person is working on a newspaper article. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "person", "scissors", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 131138, "caption": "The image showcases a neatly organized office computer setup featuring two screens and a keyboard on a desk. One monitor is positioned towards the front side, while the other monitor is closer to the middle of the desk. The keyboard is placed near the two monitors. A computer mouse is situated nearby, towards the back of the desk. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "tv", "dining table", "spoon", "laptop", "potted plant", "mouse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5555555555555556}}, {"image_id": 258209, "caption": " According to the picture, a man is sitting on a bench in a public area. He is dressed in a suit and tie and appears to be reading a book, relaxing and enjoying the moment. There are also several other benches in the area.", "mscoco_hallucinated_words": [["tie", "tie"]], "mscoco_gt_words": ["bicycle", "book", "bench", "person"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.75}}, {"image_id": 347453, "caption": "The scene takes place in a bustling city street with multiple red buses traveling down the road. The street is lined with tall buildings, creating a bustling urban environment. There are a total of five buses visible in the scene, with one slightly in front of the others. There are also a few cars on the street, and two trains can be seen in the background.", "mscoco_hallucinated_words": [["train", "train"]], "mscoco_gt_words": ["bus", "bench", "traffic light", "car", "person"], "hallucination_idxs": [65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.4}}, {"image_id": 285820, "caption": " The image depicts a small room containing a long couch, a table, and a refrigerator, all arranged around a large window. There are two televisions in the room, one of which is larger than the other. A dining area is positioned near the refrigerator and the other TV is against the wall. There are three cereal bowls on the table. The room appears well-lit, and the furniture appears to be black in color.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"], ["refrigerator", "refrigerator"]], "mscoco_gt_words": ["bench", "tv", "dining table", "bowl", "cell phone", "book", "couch"], "hallucination_idxs": [16, 49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5714285714285714}}, {"image_id": 245664, "caption": "According to the picture, there are multiple electronic devices displayed on a wooden desk, creating a workplace atmosphere. The image includes two laptops, a laptop and a desktop, both open and actively being used. A keyboard and a mouse are positioned near the laptops, allowing for efficient computing.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "tv", "dining table", "laptop", "mouse", "book", "wine glass"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 453731, "caption": "The image features a dog laying down on a wooden floor, surrounded by various bottles of different shapes and sizes. The bottles are placed in a large wooden cupboard or crate, and they seem to be a part of the setting. The dog is resting comfortably, possibly enjoying its surroundings.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dog", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 524601, "caption": "In the image, four people are playing a video game together, with one person holding a Wii remote and the other three people holding Wii remotes as well. They are all standing in a living room, enjoying the game. In addition to the Wii remotes, there are two backpacks placed in the living room, possibly belonging to the players or stored in the room.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["remote", "refrigerator", "book", "chair", "person", "couch"], "hallucination_idxs": [54], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.3333333333333333}}, {"image_id": 253262, "caption": "The scene is set at an airport with a small plane parked on the tarmac. There are several people present, some of them standing closer to the airplane while others are farther away, possibly indicating they are in the process of boarding. There is also an umbrella nearby, which suggests that the atmosphere might be humid or rainy. Some of the people in the scene are wearing suitcases, likely indicating that they are going on a trip or returning from a trip.", "mscoco_hallucinated_words": [["umbrella", "umbrella"]], "mscoco_gt_words": ["truck", "suitcase", "tie", "airplane", "person"], "hallucination_idxs": [50], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6}}, {"image_id": 87052, "caption": " According to the picture, two people are standing in a kitchen, preparing food together. The man and the woman are both looking at the refrigerator, which is located in the middle of the kitchen. The refrigerator appears to have various food items, such as a bottle on its shelves, fruit, and sandwiches. The people are holding different food items while they are standing in the kitchen, suggesting they are preparing a meal or snack together.", "mscoco_hallucinated_words": [["sandwich", "sandwich"]], "mscoco_gt_words": ["cup", "oven", "refrigerator", "bowl", "bottle", "vase", "person"], "hallucination_idxs": [59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.42857142857142855}}, {"image_id": 357578, "caption": " In the image, there is a blue fire truck parked on the side of a street with its red lights on. It's near some cars, including a white one next to it. There are multiple other vehicles present on the street, and they are parked in different positions. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 418692, "caption": " In the image, a man is sitting on a couch in a living room, enjoying the company of a woman. They appear to be relaxing and having a good time. The living room is well-furnished with a couch, a chair, and a table. There are also two ceiling fans, one on the right and another one on the left, providing lighting for the room. The atmosphere is warm and cozy, with a sofa on the right side, chairs on the left and at the back, and a table placed in the middle of the room.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"]], "mscoco_gt_words": ["remote", "cell phone", "book", "chair", "person", "couch"], "hallucination_idxs": [48, 100], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.5}}, {"image_id": 259345, "caption": " According to the picture, a man is sitting on a bench, holding an umbrella, and smiling. He is dressed in a colorful outfit, which includes a multicolored shirt and a long beard. He is also wearing a baseball cap. Several chairs and a potted plant are visible in the background.", "mscoco_hallucinated_words": [["bench", "bench"], ["potted plant", "potted plant"]], "mscoco_gt_words": ["umbrella", "chair", "person"], "hallucination_idxs": [11, 50], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 1.0}}, {"image_id": 119709, "caption": "According to the picture, a woman is standing at a table covered in multiple layers of marble and surrounded by cookies. She is dressed in an apron and has headphones on. The table is lined with a mat, which likely serves as a surface for rolling and storing the cookies. There are numerous glazed donuts arranged across the table, each placed on top of the marble surface. The cookies appear to be a mix of small and large sizes, and they are likely being baked or assembled as a part of a cooking project or event.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "donut", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 443299, "caption": "According to the picture, there are two people, a man and a woman, sitting on a bench in front of a wall. The man is sitting closer to the left side of the bench, while the woman sits to the right of him. Each of them is holding a cell phone, focusing on the device. The scene is set in a public area, possibly a waiting room or a bench area, as the bench is a popular seating option in such places.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "cell phone", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 120776, "caption": "According to the picture, there is a group of people gathered near a large sculpture of a smiling face, possibly under a sculpture with an oversized eye. They are standing or sitting nearby, observing the artwork. There are numerous umbrellas surrounding the area, providing shade for the visitors and creating a festive atmosphere. There are also a few chairs in the vicinity, likely for people to sit on. Overall, it appears to be a lively gathering with people enjoying the sculpture and their surroundings.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["tie", "umbrella", "person"], "hallucination_idxs": [65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 411765, "caption": " The scene is set on a tennis court where a man is playing tennis, holding a tennis racket and preparing to hit a ball. The ball is positioned near the left side of the court, close to where the player is standing. There is also another person standing in the background, not far from the left side of the court.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 447991, "caption": "In the image, a woman is walking down a street in the city, possibly in China or an Asian city, possibly on vacation. She is carrying a handbag and walking past a parked bus and truck. There are several other people in the scene, some of whom are walking and others are standing around. A couple of cars also present, with one parked on the street and another further back.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "handbag", "traffic light", "cell phone", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 276719, "caption": " According to the picture, a small dog is walking down the street, accompanied by a woman. The dog is wearing a bow tie, and several people are standing next to each other, possibly waiting for the dog to cross the street. There are several cars parked along the street, and a few people are walking around the area. A sign is visible in the image, indicating that there is a parade taking place.", "mscoco_hallucinated_words": [["tie", "tie"]], "mscoco_gt_words": ["truck", "handbag", "motorcycle", "dog", "car", "chair", "person"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.42857142857142855}}, {"image_id": 422328, "caption": "According to the image, a horse is standing in the snow, pulling a covered snow. The horse is wearing a blanket and there are several people the scene, some of whom are standing close to the horse while others are standing away. They may be watching or helping with the horse.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["horse", "snowboard", "car", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 354559, "caption": "In the image, there is a woman standing against a white wall. She is wearing a black and white stripe dress, and the dress has straps on the top and bottom. The woman has long brown hair and is posing for a picture. She is holding a teddy bear with her right arm while standing close to the camera. The teddy bear is fairly large and takes up most of the woman's arm. She is smiling, enjoying the moment with the toy.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "bear", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 438788, "caption": "The image features a large, beautifully decorated cake sitting on a table, surrounded by a group of people. There are at least four people visible in the image, with some of them close to the cake and others farther away. It appears that everyone is gathered around the cake, admiring its delicious appearance and preparing to enjoy it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "knife", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 468005, "caption": " The image features a large elephant, possibly wearing a hat, walking down a dirt path or driveway near the ocean. The elephant is the centerpiece of the scene and appears to be the main attraction. There are several people standing nearby, some of whom seem to be tourists observing the elephant's progress.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "person", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 545594, "caption": " The image features a school bus with a group of people, including children, on board. One man can be seen standing on the bus, wearing a tie-dyed shirt, and holding onto handrails. There are several children on the bus, with one standing closer to the man, and another in the background. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "chair", "orange", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 219546, "caption": "The image features a dining table with various food items, including a chicken dish, a bowl, and various types of meat. There are several knives placed throughout the table, indicating that it's being used for preparing or serving food. In addition to the knives, there are also bottles and cups on the table, and a wine bottle and a wine glass are also on the dining table. ", "mscoco_hallucinated_words": [["cup", "cup"], ["wine glass", "wine glass"]], "mscoco_gt_words": ["knife", "dining table", "bowl", "spoon", "bottle", "fork", "chair"], "hallucination_idxs": [57, 68], "metrics": {"CHAIRs": 1, "CHAIRi": 0.18181818181818182, "Recall": 0.5714285714285714}}, {"image_id": 386645, "caption": "The image depicts a busy street scene with several people walking and some parked cars. A truck and a bus can be seen on the road, with the bus positioned towards the middle of the image. There are also other people walking around, some closer to the camera and others further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "motorcycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 155131, "caption": "The image features a woman wearing a white top and skirt, swinging her tennis racket to hit the ball during a match. She is in the middle of making a play, showcasing her athletic skill. The background features multiple people watching the game, which appears to be a sporting event or a tennis match. There is a crowd gathered around the court, observing and enjoying the game. The woman in the center of the action is focused on the sport, determined to make a play and possibly score a point.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "tie", "tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 188752, "caption": "According to the picture, there is a man standing in front of a fruit stand, holding a bunch of blue bananas. The bananas are hanging from the stand, and there are several other bananas visible in the scene, including those on the left and right sides of the picture. The man appears to be the attraction and seller of the blue bananas.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["apple", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 228350, "caption": "The image depicts a bustling outdoor dining area with a large group of people enjoying their meals. There are many people seated at various tables, some of which are surrounded by umbrellas while others are more open. The dining area is situated near a building, with a potted plant visible in the vicinity. ", "mscoco_hallucinated_words": [["potted plant", "potted plant"]], "mscoco_gt_words": ["cup", "umbrella", "dining table", "bowl", "tie", "wine glass", "chair", "person"], "hallucination_idxs": [51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.375}}, {"image_id": 437720, "caption": "According to the image, there is a woman sitting at a desk in a room. She appears to be typing on a computer. There are several chairs in the room, including a red chair and some other furniture such as a desk. The woman is naked.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "vase", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 378831, "caption": " The image shows a woman walking down the street, carrying a large grocery bag on her shoulder. She is wearing a blue shirt and appears to be walking along the roadway. There are two fire hydrants visible, one closer to the left side and the other closer to the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fire hydrant", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 375503, "caption": "According to the image, there is a room with several pieces of furniture, such as chairs and a clock. The clock is situated in the middle of the room, and there are two chairs placed in front of it. Another chair is mounted on the wall above the first clock. The room is decorated with various wooden items, including two staircases. The staircases are ornate with decorative elements.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "tv", "dining table", "laptop", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 330369, "caption": " According to the picture, the image features a red and white fishing boat docked at a pier in the ocean. The boat appears to be a large fishing vessel, with a white cabin and red paint on the lower part. It seems to be moored at a harbor or pier, possibly being used for recreational purposes or fishing. The boat is in the foreground of the image, taking up a considerable portion of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 288576, "caption": "According to the picture, a man is playing with a Frisbee in a park, throwing it into a tree stump. The man appears to be in a hurry and slightly off balance as he tries to make the throw. The location is a park, and there are a few other individuals scattered throughout the park as well.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "frisbee", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 32190, "caption": "The image features a small black horse toy sitting on the side of a road. The toy looks like it is being used as a traffic hazard. There are several cars parked on the side of the road, with a few closer to the curb and others further away. There is also a fire hydrant present near the curb.", "mscoco_hallucinated_words": [["fire hydrant", "fire hydrant"]], "mscoco_gt_words": ["horse", "car", "person"], "hallucination_idxs": [57], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 535403, "caption": "The image portrays a bathroom with two sinks, a pair of pants and a hairdryer placed on a counter. The bathroom features mirrors and multiple bottles, including one on the left sink and two on the right. In addition to the bottles, there is a comb visible in the scene, as well as a collection of hair dryers. The two sinks are located near each other and are positioned at the end of the bathroom, giving the space an open and spacious feel.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "potted plant", "bottle", "toilet", "sink", "hair drier"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 449798, "caption": "A computer desk is piled with a variety of items, such as toys and figurines, creating a fun atmosphere. The keyboard and mouse are neatly positioned on the desk, and there is a computer monitor on the right side. There are multiple figurines placed around the room, likely representing various characters and themes.", "mscoco_hallucinated_words": [["monitor", "tv"]], "mscoco_gt_words": ["keyboard", "dining table", "laptop", "mouse", "cell phone", "book"], "hallucination_idxs": [38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6666666666666666}}, {"image_id": 375769, "caption": "According to the picture, a young boy is sitting at a table, eating a sandwich while wearing a helmet. He appears to be dining at a caf\u00e9 or restaurant, with several dining tables set around a larger table and various chairs throughout the area. Several bicycles can be seen around the caf\u00e9, adding to the ambiance of a lively scene.", "mscoco_hallucinated_words": [["sandwich", "sandwich"]], "mscoco_gt_words": ["bicycle", "cup", "umbrella", "dining table", "hot dog", "car", "chair", "person"], "hallucination_idxs": [16], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 426172, "caption": "The image showcases an assortment of food products displayed in a store. A large wooden table is located in the center of the store, which holds different items like candy, bottles, and books. There are a few bottles positioned near the edge of the table, and several books are placed above it. The store seems to be promoting a selection of food items, including wine and candy.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "bottle", "book", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 20650, "caption": "The image features a plate filled with various food items, including a delicious-looking sandwich, broccoli, and a couple of slices of tomato. The sandwich appears to be a grilled cheese and is served on a dinner plate, sitting atop a table or a counter. The broccoli is neatly arranged on the plate, and a couple of slices of tomato accompany the sandwich.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "sandwich", "fork", "broccoli", "orange"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 138078, "caption": "According to the picture, a young boy is standing on a baseball field, holding a baseball glove and waiting for the ball to be thrown. He is wearing a baseball uniform and appears to be ready to play. The baseball field is surrounded by a chain-link fence, indicating that it is a typical baseball field setting. A few other baseball players are dispersed around the scene, possibly getting ready for their turn to bat or playing their positions. The baseball field is well-maintained, with a dirt surface, and a variety of trees can be seen in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 152499, "caption": "According to the picture, a group of people is gathered around a swimming pool, enjoying a party. They are using pool toys to play in the water, with some people standing and others sitting on the pool's edge. The pool is the center of attention as they engage in various activities, such as playing games and socializing with one another. There are two umbrellas present in the scene, one of them close to the pool and the other further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "umbrella", "dining table", "potted plant", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 180764, "caption": "According to the picture, there is a man sitting at a table, enjoying a pink glazed doughnut. The man is in the process of biting the doughnut, with his mouth partially open as he takes a bite. The scene features a kitchen area, likely the location where the doughnut is being prepared or consumed. The doughnut is on the left side of the image and the man is on the right side, focusing on the food he's eating.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["microwave", "donut", "person", "bottle"], "hallucination_idxs": [12], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.5}}, {"image_id": 147762, "caption": "According to the picture, the image features a dessert on a plate. The dessert appears to be a cake, which has been decorated with a yellow topping, creating a pleasant and inviting appearance. The cake appears to have a unique texture, likely a combination of various ingredients.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "cake", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 113860, "caption": "According to the picture, a small barge boat is moving on the water, powered by a motor. The boat appears to be black in color and is traveling along a path of water.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 460866, "caption": "The image features a collection of various toothbrushes neatly arranged on a table. There are at least six different toothbrushes of different sizes, shapes, and colors, placed together, creating an interesting scene of toothbrushes in a mug.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toothbrush", "cup", "dining table", "bowl", "sink"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 177015, "caption": "The image features a man sitting on a couch, holding a cat on his lap. The cat appears to be a kitten, and it is the focus of the picture as it lays comfortably on the man's lap while he uses a laptop computer. The man and the cat can be seen in the center of the frame, surrounded by various other furniture items such as chairs and a table. There is another person in the background, sitting on a chair to the right of the man holding the cat and laptop.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["refrigerator", "laptop", "cat", "apple", "chair", "person", "couch"], "hallucination_idxs": [75], "metrics": {"CHAIRs": 1, "CHAIRi": 0.0625, "Recall": 0.7142857142857143}}, {"image_id": 333998, "caption": "The image features a blue kitchen with wooden cabinets, countertops, and a refrigerator. The kitchen is relatively small, with the sink and oven placed in the middle, while the microwave is located on the left side of the kitchen. The refrigerator can be found on the right side, just above the countertop. A knife can be seen on the countertop next to the sink, and a bottle is placed near the microwave. The scene suggests a well-maintained and clean kitchen with the essentials needed for daily cooking and cleaning activities.", "mscoco_hallucinated_words": [["knife", "knife"]], "mscoco_gt_words": ["cup", "suitcase", "oven", "refrigerator", "bowl", "bottle", "sink", "microwave", "person"], "hallucination_idxs": [61], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.5555555555555556}}, {"image_id": 47131, "caption": "The image depicts a train on its side, surrounded by people. Some people are standing near the train, while others are closer to the front. There are six people in total, with two of them partially visible, and the rest visible to varying degrees. It appears that the train has derailed or been involved in an accident, and these people are inspecting the incident possibly assisting in the process.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 425447, "caption": "The image features a truck parked on a street. The truck has a white color and is equipped with spraying water apparatus, possibly for firefighting purposes. There are several trees visible in the background, including one towards the left, another towards the right, and a couple more near the center of the scene. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "fire hydrant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 295105, "caption": "In a kitchen, a man is standing in front of an open refrigerator, looking inside. He appears to be contemplating what to eat or drink. The kitchen is equipped with a couple of ovens, one closer to the left side and the other further towards the right side. There is a sink in the left side of the kitchen, and a couple of bottles are placed at various heights in the refrigerator.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["oven", "refrigerator", "bowl", "bottle", "apple", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7142857142857143}}, {"image_id": 443259, "caption": "According to the picture, it is a large and spacious modern kitchen with a white and blue color scheme. The kitchen is equipped with a sink, refrigerator, and a dishwasher, both of which are located on the counter. The kitchen also has a stove, also situated on the counter. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["oven", "refrigerator", "dining table", "bowl", "potted plant", "bottle", "orange", "sink", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 537861, "caption": "In the image, there is a train traveling down the train tracks. The main part of the train occupies most of the image, with two side engines in close proximity to each other. It appears to be pulling into a train station.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 145020, "caption": "The image features a large donald duck statue surrounded by people standing in front of a large tree decorated with Christmas decorations, creating a festive scene. The donald duck statue is placed at the center of attention, with multiple people in the foreground, some of them standing near the tree and others standing farther away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "bird", "mouse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 249969, "caption": "The image features a beautifully prepared meat dish on a white plate, surrounded by a variety of vegetables such as broccoli, carrots, and potatoes. The dish appears to be cooked and ready to be served. The plate is placed on a dining table or a table setting, indicating that it's meant for consumption. Various utensils, including forks and knives, are arranged around the dish, ready for use during mealtime.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "spoon", "fork", "broccoli", "wine glass", "carrot", "bird"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.625}}, {"image_id": 146504, "caption": " In this image, a person is sitting at a desk using a computer mouse on a keyboard, likely working on a project. The keyboard is situated on the desk in front of the person, and the mouse is being used to maneuver around the keyboard. A cup can be seen nearby on the desk, adding a touch of comfort to the work environment.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "dining table", "laptop", "mouse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 40037, "caption": "The image features a tall clock tower with a clock face, standing next to a church building. The clock tower is surrounded by a large, square area, which could be a city block or a park. The clock tower is a tall building, with the top of it reaching over 30 feet in height. The tall clock tower is a central attraction in the scene, with the clock face and its architectural features attracting attention. There are several people walking around or near the building, as it is a significant feature in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 343218, "caption": " The image depicts a man standing on a tennis court, holding a tennis racket in his hand, preparing to play a game of tennis. The court is surrounded by a net, and there are a few chairs placed near the court. Two cars can be seen parked behind the court, indicating that the court might be located in a public space.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["tennis racket", "sports ball", "person", "car"], "hallucination_idxs": [40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.75}}, {"image_id": 301837, "caption": "The image features a modern kitchen with a white and blue color scheme. The kitchen is equipped with a refrigerator, oven, and a microwave, all of which are white in color. There are also several knives on the countertop, adding to the kitchen's utility. The kitchen has multiple light sources, such as a sunlight and a set of recessed lights, creating a bright and welcoming space.", "mscoco_hallucinated_words": [["microwave", "microwave"], ["knife", "knife"]], "mscoco_gt_words": ["clock", "oven", "refrigerator", "apple", "sink"], "hallucination_idxs": [26, 40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.4}}, {"image_id": 309071, "caption": " According to the picture, there is a man standing next to a bicycle, appearing to adjust or fix a flat tire. He is wearing a black shirt and is bending down to access the tire.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bench", "potted plant", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 240681, "caption": " The scene depicts a man standing next to a blue motorcycle on the sidewalk, possibly taking a photo or video on his cell phone. He is wearing a backpack and holding his phone, which appears to be a cell phone or smartphone. In the background, there are a few cars parked along the street, indicating a presence of traffic in the area. There is also a person on the other side of the street, further away from the motorcycle and the main subject.", "mscoco_hallucinated_words": [["cell phone", "cell phone"], ["phone", "cell phone"], ["cell phone", "cell phone"], ["smartphone", "cell phone"]], "mscoco_gt_words": ["backpack", "motorcycle", "person", "car"], "hallucination_idxs": [23, 34, 41, 44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 1.0}}, {"image_id": 178753, "caption": " The scene features a young woman with braces, who is sitting on a bench and holding a cell phone in her hand. She looks at the camera as she answers the call. The setting appears to be a dining or waiting area, with multiple chairs surrounding her. There are some chairs in the foreground, with some positioned closer to the camera. In the background, there are other chairs and a bench, which adds to the crowd ambiance of the scene.", "mscoco_hallucinated_words": [["bench", "bench"], ["bench", "bench"]], "mscoco_gt_words": ["cell phone", "chair", "person"], "hallucination_idxs": [14, 78], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 1.0}}, {"image_id": 278890, "caption": " In the image, a person is holding a cell phone in their hand, possibly taking a picture or using it for other purposes. A couple of other people are present in the scene as well. The main focus of the image is the cell phone, while the other individuals are in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 47781, "caption": "According to the image, a modern bedroom features a large bed situated in the middle of the room. There are two people in the room \u2013 one is located near the bed and the other is positioned near the back. Various objects such as bottles, cups, and bowls are visible in the room, as well as a lamp positioned near the bed. The bedroom setting also includes a desk and chair, along with a closet.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cup", "cup"], ["bowl", "bowl"], ["desk", "dining table"], ["chair", "chair"]], "mscoco_gt_words": ["tv", "vase", "book", "bed", "person"], "hallucination_idxs": [47, 49, 52, 75, 77], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5555555555555556, "Recall": 0.4}}, {"image_id": 542792, "caption": "According to the picture, a large baseball bat is positioned on top of a building. The bat is sitting close to the edge of the building, taking up a considerable amount of space. The building appears to be in the middle of a city block with other buildings nearby. The bat is noticeable due to its size and its prominence in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 293703, "caption": "The image features a group of people at a table in a restaurant, enjoying their meal. They are eating pizza, with a man holding a slice in his hand. The table is set with various drinks and wine glasses. There are four wine glasses placed along the table, two on the left side and two on the right side. In addition to the pizza, there are plates of food on the table. There is a plate with spaghetti, another with meatballs, and another plate with slices of a cake-like dessert. All of these food items are displayed prominently at the dining table, suggesting that the diners are enjoying a casual and relaxed meal.", "mscoco_hallucinated_words": [["wine glass", "wine glass"], ["wine glass", "wine glass"]], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "bowl", "bottle", "fork", "chair", "person"], "hallucination_idxs": [41, 47], "metrics": {"CHAIRs": 1, "CHAIRi": 0.18181818181818182, "Recall": 0.3333333333333333}}, {"image_id": 318777, "caption": "According to the picture, a baseball game is in progress on a large field. The ball has just been thrown, and a baseball player is positioned near the center of the field, preparing to catch the ball. Several other players are scattered around the field, some of them standing and others in various positions. The field is filled with people, creating a lively atmosphere during the game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 14773, "caption": "The image features a colorful, customized car made out of hot dogs, french fries, and other food items placed on top of a plate. The car has a pink and white color scheme, and various vegetables and drinks can be seen surrounding it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "spoon", "bottle", "fork", "hot dog", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 521863, "caption": "The image captures a skateboarder, a man, riding down a public sidewalk in a skateboarding down a busy street or public area while a group of people and a few bicycles move around. The crowd includes people in various locations throughout the scene, with some standing near the street and some gathered closer to the sidewalk.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 250540, "caption": "In the image, a group of men on motorcycles is riding down a street in a seemingly abandoned area. There are five motorcycles visible, including two at the back, two in the middle, and one at the front. The riders are wearing helmets, possibly for safety and enjoyment.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "motorcycle", "cell phone", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 211604, "caption": " The image captures a young woman lying on the floor, with a tennis racket nearby. She is wearing a baseball cap, a skirt, and black tennis shoes. Apart from the woman, there is a backpack nearby, possibly belonging to her or someone else in the room.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 298067, "caption": "The image features meat and a plate of food, including vegetables. The plate of food contains various pieces of chicken, some with dipping sauce, and there are also vegetables on the plate. There is a bottle of beer and another beer nearby in the image. The beer bottle is to the left and the beer is to the right of the plate of food. The total area of the images is around 700 pixels wide by 500 pixels tall.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "cup", "dining table", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 563938, "caption": " The image shows a group of people, including a man, a woman, and another man sitting on a couch, suggesting a friendly gathering. All three people are wearing sunglasses, creating a playful atmosphere. They are each holding cups, which could be drinks or snacks. The scene is likely set inside an enclosed space, such as a patio or a living room, with the furniture taking up most of the area.", "mscoco_hallucinated_words": [["cup", "cup"]], "mscoco_gt_words": ["bench", "tie", "bottle", "person", "couch"], "hallucination_idxs": [44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.4}}, {"image_id": 461311, "caption": "According to the picture, there is a large group of people gathered in a park, likely enjoying flying kites. The park is filled with numerous kites of various shapes and sizes, creating a lively atmosphere. The people are scattered throughout the park, standing on the grass, all engaged in the fun activity. Some kites are flying high in the sky, adding to the overall visual appeal of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["kite", "handbag", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 47940, "caption": "According to the picture, a blue teddy bear is sitting on a bed next to a pile of books. The teddy bear appears to be the main focus of the scene, while the books are spread out around it, creating a cozy and playful atmosphere. In the background, a computer can be seen, further highlighting the mix of study materials and toys in the scene.", "mscoco_hallucinated_words": [["teddy bear", "teddy bear"], ["teddy bear", "teddy bear"]], "mscoco_gt_words": ["book", "laptop", "bed", "person"], "hallucination_idxs": [7, 22], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.75}}, {"image_id": 261318, "caption": " A man is seated on a chair, talking on his cell phone. The chair he is sitting on is one of several chairs in the scene. There are also two suitcases nearby, likely stored in the chairs. The man is wearing a red shirt and appears to be on a sidewalk or courtyard area. In the background, a trash can can be seen on the right side of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["suitcase", "handbag", "bench", "cell phone", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 134768, "caption": "The image features two items sitting on a desk in the foreground, a computer keyboard and a speaker. The keyboard appears to be in focus, while the speaker is partially behind it. The keys on the keyboard are visible, creating an interesting perspective of the device. The speaker has a pair of speakers, suggesting that it's not a single, portable speaker.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "laptop", "mouse", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 526414, "caption": " The image showcases a cozy home office with a bedroom in the background. There are three people in the room: one in the left side, one in the center, and another on the right side. The main focus of the room is the sewing machine placed on the large wooden dining table in the center of the space. There are also several items scattered around, including a couple of bottles, a laptop, and a television situated in the living room.", "mscoco_hallucinated_words": [["person", "person"], ["bottle", "bottle"]], "mscoco_gt_words": ["remote", "cup", "tv", "dining table", "bowl", "laptop", "cat", "book", "scissors", "couch"], "hallucination_idxs": [17, 76], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.3}}, {"image_id": 273132, "caption": "In the image, an elderly woman is sitting on a bench, leaning against a fence. She is wearing a hat, and looks like she might be posing for a picture. The woman is situated on the left side of the bench, with the fence appearing on the right side of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 505213, "caption": " The scene depicts a man holding a hot dog in his hand, taking a big bite out of it. He is laughing and having a good time, enjoying his snack. Another man can be seen in the background, sitting on the bench. The hot dog is placed in the center of the image.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["sandwich", "hot dog", "person"], "hallucination_idxs": [46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 192656, "caption": "According to the picture, there is a baseball player in a black uniform, waving to the audience with his glove. The player appears to be walking on a grassy field while holding the baseball glove, possibly preparing to throw the ball to a teammate. The overall scene depicts a baseball player actively engaging in the game, with the player standing on a field.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 272728, "caption": "The image features a beautiful blue vase filled with various flowers, creating a charming arrangement. The flowers, likely a mixture of pink and purple varieties, are prominently displayed, adding color and vibrancy to the scene. The vase is positioned in the middle of a table or tray, creating a focal point in the room. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "chair", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 509811, "caption": " According to the picture, a person is seated in the middle of a large room filled with numerous tables. There are several laptop computers arranged on the tables, some of which are open, others closed. The tables are placed in various positions, with some placed close to others on the floor. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "apple", "book", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 412407, "caption": " According to the picture, there are several people standing in different locations throughout the scene. In the center of the image, a woman is holding a cell phone in her hand, possibly showing the screen to someone else in the room. Another cell phone is also visible in the upper-right corner of the image, further demonstrating the importance of cell phones in the scene. A few other cell phones are scattered around the background, suggesting multiple people in the room holding cell phones.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "clock", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 353658, "caption": "According to the picture, there is a woman walking down a street with an umbrella, possibly to protect herself from the rain. She is accompanied by another woman, who is also carrying an umbrella. Both women appear to be walking together, possibly enjoying a day outdoors.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "suitcase", "handbag", "umbrella", "carrot", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 441363, "caption": "The image features a man standing on a sports field with a red shirt, swinging a tennis racket and preparing to hit a tennis ball. There are several other people present in the scene, with some further away from the main action and others closer to the man with the tennis racket. There is also a clock in the picture, located in the background, suggesting it might be an official sports clock.", "mscoco_hallucinated_words": [["clock", "clock"], ["clock", "clock"]], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [60, 77], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 331084, "caption": " The image displays a woman standing at a dining table with several pizzas prepared and ready to be served. She is working on cutting the pizzas and is using a knife placed on the cutting board. A variety of pizzas are spread out on the table, with some at the left edge, one in the middle, and another on the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "knife", "dining table", "bowl", "spoon", "carrot", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 282553, "caption": "According to the picture, a man is walking down a dirt path near a pond. The dirt path appears to be near a park, as there are a few benches visible in the scene. The man is the only person present in the image. There are no other people walking in the vicinity or in the park.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["cell phone", "handbag", "person"], "hallucination_idxs": [32], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.3333333333333333}}, {"image_id": 328818, "caption": "According to the picture, there is a woman standing next to a bench, leaning over it as if she is stretching or adjusting her shoe. She appears to be getting ready to ride a bicycle, which is parked nearby. The scene is set in a park-like environment, with a few trees and some pedestrians walking nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bench", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 3103, "caption": "According to the picture, there are several people in the image, including a man who is the primary focus. He is wearing a blue shirt and is performing a trick on his skateboard, possibly jumping over a ramp. The other people are observing the action or waiting their turn to skate.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 188136, "caption": "According to the picture, it depicts a large, empty kitchen with a dining room and a storage room. The kitchen is equipped with appliances, including a refrigerator, a sink, and a range. There are also bowls on the countertops, adding to the kitchen's setting. The space appears spacious and well-organized, with various items spread throughout the room.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"]], "mscoco_gt_words": ["dining table", "bowl", "orange", "sink", "carrot", "person"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 449675, "caption": " The image features food on a table, specifically a sandwich and a salad. The sandwich is cut in half, and the salad is placed on the side. There are also knives, ready to be used for dining. The scene takes place on a dining table, which appears to be made of wood, with chairs situated at the sides. This table is occupied by the sandwich and salad, as well as various utensils such as knives.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["cup", "knife", "dining table", "spoon", "bottle", "sandwich", "fork", "carrot"], "hallucination_idxs": [61], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.375}}, {"image_id": 191691, "caption": "According to the picture, the man is shaving his face in a bathroom, likely getting ready for the day. He is standing near the sink, focusing on the shaving process, while also being aware of his surroundings. There are multiple bottles in the bathroom, one of them being close to the sink and another placed towards the top left corner of the image.", "mscoco_hallucinated_words": [["sink", "sink"], ["sink", "sink"]], "mscoco_gt_words": ["toothbrush", "person", "bottle"], "hallucination_idxs": [27, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.6666666666666666}}, {"image_id": 89378, "caption": "According to the picture, a man is standing next to a red motorcycle. He is posing next to the bike as if he is taking a photo, possibly in front of the building in the background. Additionally, there is another person in the scene, likely positioned in the foreground.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 99874, "caption": "According to the picture, a woman is sitting at a table in a restaurant, holding a cell phone to her ear and talking. She appears to be focused on the call and the food in front of her. In the background, there is another person at the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "dining table", "bottle", "cell phone", "cat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 437895, "caption": "The image features a colorful and intricate four-poster bed with a mat covering it, situated in the middle of the room. There is another piece of carpet located above the bed. The room is decorated with various items such as a plant in one corner, a vase near the middle of the image, and a lamp on the right side. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 557447, "caption": "According to the picture, a man is sitting at a table, holding a document, possibly pointing out a location or an important detail to his co-workers. He is dressed in a suit, and there is another person sitting next to him, also in a suit. A few other chairs can be seen in the scene, but no people are occupying them at the moment. The man is focused on his work, with a look of interest or attention on his face.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["knife", "dining table", "tie", "spoon", "fork", "person"], "hallucination_idxs": [55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.3333333333333333}}, {"image_id": 159979, "caption": "In the image, there is a group of people gathered near a body of water. They are sitting around tables and benches, enjoying their time on the water. Some people are seated under various types of umbrellas and shade, while others are standing nearby. There are several umbrellas surrounding the group, providing shade for those who seek it. In total, there are 11 umbrellas spread out, some larger and some smaller.", "mscoco_hallucinated_words": [["table", "dining table"], ["bench", "bench"]], "mscoco_gt_words": ["bowl", "boat", "umbrella", "person"], "hallucination_idxs": [21, 23], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.5}}, {"image_id": 357, "caption": "According to the picture, multiple people are participating in a baseball game, taking place on a field. There are 10 players visible, with some standing close to the batter during a potential swing, and others dispersed around the playing area. In the center of the image, a baseball is on the ground, likely ready to be caught by the catcher after the swing. The players are wearing baseball uniforms and gloves, indicating a professional setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 538891, "caption": " The image features a young woman holding a tennis racket, likely in the middle of a game on the court. She is focused on the task at hand and ready to hit the ball. A few other people are present in the scene, located further behind the woman, but they are not in the direct spotlight. The scene is set on a tennis court, which indicates that the woman is participating in a tennis match.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 561967, "caption": "The image features a train traveling down a train track on a cloudy day. The train is pulling a long vehicle, likely a car or truck, which appears to be carrying several containers or packages. A few small trees can be spotted in the background along the track, adding to the scenery.", "mscoco_hallucinated_words": [["truck", "truck"], ["container", "bowl"]], "mscoco_gt_words": ["traffic light", "train", "car"], "hallucination_idxs": [27, 35], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.6666666666666666}}, {"image_id": 565778, "caption": " The image features a blue train traveling down a track, passing through a tunnel. The train is in front of the scene and appears to be moving towards the right side of the image. There are two passenger cars on the train, one slightly behind and one in the middle, making up a total of three cars visible in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "train", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 7125, "caption": "According to the picture, there are several people walking and riding bicycles on a paved path near a body of water, possibly a river or a lake. A man can be seen riding a bicycle, with a woman close to him on a different bike. Additionally, there are several other people on the path, with a few walking and some riding bicycles. The path appears to be next to the water, providing a scenic view for the cyclists.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "dog", "bottle", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 221350, "caption": "A man is standing in front of a chest of drawers, with a red tie draped over his white shirt. He is holding the tie in his hand, revealing its unique and colorful design with hearts drawn on it. The image is set in a well-appointed dining room, with a couple of chairs on the right and two more on the left. The room is spacious, allowing ample room for various dining experiences.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "dining table", "tie", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 280930, "caption": "The image features a young girl standing in a kitchen, smiling and holding food she's preparing to eat. The girl appears to be enjoying herself while she cooks in the kitchen.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "oven", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 151081, "caption": "In the image, there is a tractor driving down a dirt road on the left side of the picture. An airplane is flying low overhead, seemingly on a path to fly over a field or a body of water. Another airplane is also visible in the sky, likely a small helicopter, flying close to the first airplane, possibly preparing for landing or taking off. The field in the image is quite large, covering a significant portion of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 122232, "caption": "The image shows a man playing tennis, standing on a tennis court. He is holding a tennis racket and has placed several tennis balls on the court, which can be seen around him. The man is wearing a blue shirt and black shorts, and he appears to be playing a game of tennis, hitting the balls around him.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 134223, "caption": "According to the picture, there is a variety of food on a plate, including a hot dog, french fries, and vegetables. The hot dog is placed on a platter, and fries are placed in smaller dishes on the same plate. There are also a few carrots present on the plate. The food is set on a table, waiting to be served to diners.", "mscoco_hallucinated_words": [["carrot", "carrot"]], "mscoco_gt_words": ["bowl", "hot dog", "broccoli", "dining table"], "hallucination_idxs": [52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 198163, "caption": "According to the picture, there is a man sitting at a table, surrounded by various dishes. He appears to be focused on something, possibly creating or painting something. The dishes are arranged all around the table, with some placed on the sides and others in the middle.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 3742, "caption": " The image features a vase filled with various apples and oranges, along with a purple flower, placed on a table in front of a window. In addition to the fruits, there are a few apples scattered around the vase and a few more around the base of the window, creating a visually appealing arrangement of color and shape.", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["bowl", "vase", "apple", "dining table"], "hallucination_idxs": [10], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.75}}, {"image_id": 217593, "caption": "According to the picture, a young boy is standing on a baseball field, holding a baseball and getting ready to throw the ball to another player. He is wearing a baseball uniform and appears to be a catcher.", "mscoco_hallucinated_words": [["ball", "sports ball"]], "mscoco_gt_words": ["handbag", "chair", "baseball bat", "person", "baseball glove"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.2}}, {"image_id": 279078, "caption": " In the image, a young girl is standing in front of an open refrigerator with bottles inside. The refrigerator is filled with a variety of beverages, with numerous bottles of different shapes and sizes scattered throughout the scene. The girl looks like she is either excited or curious about the contents of the refrigerator.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 572081, "caption": "The image depicts a tennis match taking place on a court. A man is playing tennis, holding a tennis racket and preparing to hit the ball. He is surrounded by a crowd of people, some of whom are watching the game closely, while others are standing further away. In the background, there are multiple chairs that have been set up, likely for spectators to sit during the match.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "chair", "sports ball", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 477016, "caption": "The image captures a group of people gathered around an oven with a miniature kitchen inside. A woman is standing in front of the oven, and she appears to be the center of attention. There are at least two other people, both standing behind her. They might be her colleagues or friends, and they seem to be enjoying the miniature kitchen.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "oven", "backpack", "potted plant", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 301266, "caption": "The image depicts a group of people gathered around a kitchen table, working together on food preparation. There are four people visible, with one in the background and three in the foreground, all actively engaged in the task.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "spoon", "book", "fork", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 475813, "caption": "The image features a small child standing in a room, holding a cell phone in her hand and looking towards the cell phone. There are several chairs scattered around the room, including one near the center and a few others located closer to the edges of the image. The young girl appears to be standing near the left side of the room, with two other chairs situated nearby, one towards the middle and the other on the right side. The scene suggests that the child is either playing or interacting with her surroundings.", "mscoco_hallucinated_words": [["cell phone", "cell phone"], ["cell phone", "cell phone"], ["chair", "chair"], ["chair", "chair"]], "mscoco_gt_words": ["remote", "handbag", "person"], "hallucination_idxs": [13, 22, 28, 70], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5714285714285714, "Recall": 0.3333333333333333}}, {"image_id": 558784, "caption": "According to the picture, a man is standing on a grassy field, holding a golf club and preparing to hit a golf ball. He is wearing a backpack, which appears to be quite small.", "mscoco_hallucinated_words": [["ball", "sports ball"]], "mscoco_gt_words": ["backpack", "frisbee", "person"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 308543, "caption": " The image captures a person holding two silver spoons in their hand, possibly preparing to eat or clean them. The spoons are large, appearing to be held together with both hands. The background shows a sink, and a few other items can be seen in the scene. The left side of the image features a chair, and a cup is positioned to the right of the sink.", "mscoco_hallucinated_words": [["chair", "chair"], ["cup", "cup"]], "mscoco_gt_words": ["sink", "spoon", "person"], "hallucination_idxs": [61, 65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 1.0}}, {"image_id": 359164, "caption": "According to the picture, there are three dogs in a grassy area. Two dogs, a brown and a black one, are chasing the third one, a brown dog. The two dogs are running towards the third one, which is running away from them. The scene takes place in a grassy area, with several chairs placed around it, and a bottle of juice is visible in the background.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["car", "chair", "dog", "dining table"], "hallucination_idxs": [69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.5}}, {"image_id": 539951, "caption": "The image showcases a room that has a checkerboard floor and a kitchen set up inside, complete with a refrigerator, a stove, and a sink. There are multiple ovens present, possibly reflecting the car theme in the room, as well as a microwave oven. The kitchen also has a sink, creating a more realistic scene for the lego replica.", "mscoco_hallucinated_words": [["car", "car"], ["microwave", "microwave"]], "mscoco_gt_words": ["sink", "clock", "oven", "refrigerator"], "hallucination_idxs": [38, 48], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.75}}, {"image_id": 143479, "caption": " The scene features a man holding a pan full of various hot dogs. He is standing in front of a bbq grill, possibly preparing food for an outdoor gathering or a party. Another person is nearby, possibly offering advice or support.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["hot dog", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 475024, "caption": "The image features a blue vase filled with a variety of red and orange flowers. The vase is placed on a table, which is positioned next to a large mirror, creating a decorative display. The flowers are arranged in the vase in various ways, creating a pleasing visual effect.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "orange", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 238065, "caption": "According to the image, there is a young man riding a skateboard in the air on a cement ramp. He is wearing a white shirt and appears to be enjoying the moment. The ramp is on the left side of the image, with the skateboarder performing a trick on the edge of the ramp. There are also other skateboards visible in the scene, but it is unclear if they belong to the person in the image or are simply in the background. The setting suggests an outdoor environment, perhaps at a skate park.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 314016, "caption": "The image features a bathroom with various items on display. There is a toilet located in the center of the room, and on the wall, shelves house a collection of books, bottles, and other objects. A sign with text, \"magic eye\", can be seen on the shelf. Among the books, there are a couple of bottles and other items. The arrangement of objects creates an interesting and unique setup for the bathroom.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "book", "toilet", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 322163, "caption": "In the image, a man wearing a white shirt and apron is preparing a meal, surrounded by other people who are watching him. There are several other people in the kitchen, some of whom are standing and others are sitting. A few of them are closer to the preparing chef, possibly waiting for the food to be ready. The kitchen appears to be well-equipped with a variety of food ingredients, such as bowls, bottles, and a pizza placed on a table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "pizza", "oven", "dining table", "bowl", "potted plant", "bottle", "microwave", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5555555555555556}}, {"image_id": 230983, "caption": "According to the picture, a young man is seen performing a skateboarding trick in a park or sidewalk area. He is in the process of jumping up in the air on his skateboard, likely landing on a green object such as a small drainpipe. Another person is present in the scene, possibly watching or participating in the action.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 564629, "caption": "According to the image, there is a very large, brown wooden kitchen area with multiple wood surfaces. The kitchen is equipped with several appliances, including a stove and sink, while a sink is also present in the room. A black motorhome is parked inside the room, and there are various brown and black knives throughout the scene.", "mscoco_hallucinated_words": [["knife", "knife"]], "mscoco_gt_words": ["sink", "microwave", "oven", "refrigerator"], "hallucination_idxs": [60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 53431, "caption": " A man is sitting on a bench, using a laptop. He is wearing a sweater and appears to be enjoying his time outdoors. A dog is also in the scene, sitting near the man. The bench is placed near some pillars, possibly in a courtyard or outdoor area.", "mscoco_hallucinated_words": [["bench", "bench"], ["dog", "dog"], ["bench", "bench"]], "mscoco_gt_words": ["laptop", "chair", "person"], "hallucination_idxs": [6, 27, 40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.6666666666666666}}, {"image_id": 441840, "caption": " The image depicts a young boy playing baseball with a baseball bat in a grassy field. The boy is holding the bat, getting ready to swing at a ball as an adult looks on. Several other people are present in the scene, with some standing in the background, while others are scattered around the field.", "mscoco_hallucinated_words": [["ball", "sports ball"]], "mscoco_gt_words": ["truck", "car", "baseball bat", "person", "baseball glove"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.4}}, {"image_id": 53465, "caption": " The image features a large cake with personalized inscriptions on it, such as \u201cWelcome to our meeting\u201d and \u201cTo John\u201d. The cake appears to have been placed on a tray, sitting on a table along with other dishes and cups. There are multiple people standing in the background around the table, and it seems like they are either enjoying a gathering or possibly part of the event where the cake was served.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "bowl", "spoon", "cake", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 381031, "caption": "The image features various food items, including a couple of slices of apple, peanut butter, bread, ketchup, and a container of juice. There are also cups, bottles, and a spoon on the table, indicating a breakfast or snack setup.", "mscoco_hallucinated_words": [["container", "bowl"]], "mscoco_gt_words": ["cup", "dining table", "spoon", "bottle", "apple", "microwave", "donut"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.7142857142857143}}, {"image_id": 530520, "caption": "According to the picture, there is a group of people gathered on the grassy lawn, sitting on the ground, with one man and one woman in particular engaged in an activity. The man is using a laptop, while the woman is also using a laptop. They are working together, possibly collaborating or sharing the laptop.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "motorcycle", "laptop", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 539555, "caption": "A man is performing an impressive jumping trick with a yellow frisbee, caught mid-air. There are a couple of chairs visible in the background, but most people are seated to the right of the image, surrounding the man doing the trick.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["skateboard", "frisbee", "person"], "hallucination_idxs": [21], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 279521, "caption": "In the image, two men are standing on the sidewalk next to a street train. They are engaged in a conversation and appear to be waiting for the street train to arrive. The train is positioned in the foreground of the image, covering a large portion of the scene. The men are both wearing backpacks and carrying items as they engage in their conversation. A couple of handbags can also be seen nearby. Overall, the scene suggests that the two men are discussing something while waiting for the street train.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "handbag", "train", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 560439, "caption": " According to the picture, a group of people, including three people, are enjoying a snowy day by skiing down a snow-covered slope. They are all bundled up and wearing snowshoes, walking side by side and passing some time together in the snow.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "snowboard", "backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 248353, "caption": "According to the picture, there is a ski slope with a snowy ground, and a group of skiers are visible on the slope. A man and a woman are among the group, and they seem to be skiing down the slope together. Other skiers can be seen scattered around the slope, enjoying the activity on this sunny day. There are also a couple of sets of skis in the scene, including one set of skis just below the snowy slope, and another set positioned closer to the right side of the picture.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 9628, "caption": "According to the picture, it appears to be a man in his late teens or early twenties performing an impressive trick while riding on a wakeboard in the water. The man is in the air, soaring through the air, as he's enjoying the ride. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 330050, "caption": "According to the picture, the image features a man and a woman standing on a tennis court, holding tennis rackets and smiling at each other. They seem to be enjoying playing tennis together on the court. The court is not particularly crowded, as there are no other people in the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 254161, "caption": "In the image, a man stands on a brick courtyard surrounded by tall buildings. He is looking up at a kite possibly taking a picture or flying it. The kite can be seen in the sky, positioned in the upper left portion of the image, and the person appears to be enjoying the activity in the middle of the city. The setting creates a lively atmosphere, with numerous buildings visible in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "kite", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 275749, "caption": "According to the picture, there is a group of people in a room, likely a social gathering. They are standing around, talking and having a good time. Some of them are wearing ties. The room features multiple bottles, cups, and computer screens, indicating that the event may be centered around these items. There is a bottle near the right side of the scene, another close to the middle, and two more towards the left side. Additionally, there are cups placed near the computer monitors and the right side of the room.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "tv", "dining table", "laptop", "mouse", "bottle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.625}}, {"image_id": 163518, "caption": " According to the picture, there is a woman standing in a kitchen, working on preparing food in a dining table. She appears to be an employee at a restaurant, possibly a chef or cook, busy with tasks and making sure everything is organized and ready for service.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["oven", "dining table", "bowl", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 156696, "caption": "The image features a black and white scene of a train station with a large, old-fashioned locomotive parked at a platform. There are two people standing near the front of the train, one on the right side and one on the left. There are two more individuals in the scene, one closer to the front of the train and one at the end of the train on the left side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "train", "handbag", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 525202, "caption": "According to the picture, the image features a large pizza on a table, with various toppings cut and ready to be served. The pizza is placed on a plate, and there are several people around the table, enjoying the food. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "spoon", "bottle", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 423984, "caption": " The image features a plate filled with a variety of food items. The main ingredients include carrots, cooked pasta, and a bowl of salad. The carrots are arranged on the plate in a circular pattern around the pasta, with some towards the top and others at the bottom. The salad bowl contains a mixture of vegetables, including several broccoli florets and some more carrots, all on the left side of the plate. ", "mscoco_hallucinated_words": [["bowl", "bowl"], ["bowl", "bowl"]], "mscoco_gt_words": ["carrot", "broccoli", "dining table"], "hallucination_idxs": [24, 56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 499486, "caption": "According to the picture, there are two cats sitting on a bed in a room. One cat is positioned on the left side of the bed, while the other cat is sitting on the right side. The bed is covered with sheets, and there is another sheet placed on the ground close to the bed. There are also other items on the bed, such as two pillows and a cell phone.", "mscoco_hallucinated_words": [["cell phone", "cell phone"]], "mscoco_gt_words": ["book", "cat", "bed"], "hallucination_idxs": [76], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.6666666666666666}}, {"image_id": 118124, "caption": "The image features a young man sitting in the back seat of a car. He is wearing glasses and a tie, while also sporting a black shirt. The tie appears to be a purple color, adding a pop of color to the overall outfit. The man appears to be smiling for the camera, giving a joyful expression.", "mscoco_hallucinated_words": [["seat", "chair"]], "mscoco_gt_words": ["tie", "person", "car"], "hallucination_idxs": [10], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 288729, "caption": "The image features a cake decorated as a circus-themed tent, with a white, yellow, and red cake shaped like a tent. The cake has a red base and a yellow canopy, and there are small dolls inside, sitting at a table and enjoying the circus atmosphere. Additionally, there are some figurines placed around the cake, contributing to the circus-themed appearance.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["teddy bear", "bear", "elephant", "cake", "person"], "hallucination_idxs": [46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.2}}, {"image_id": 177721, "caption": "The image captures a young man performing a skateboard trick on a ramp. He is actively balancing on the skateboard and showcasing his skills. The ramp seems to be a popular spot for skateboarding, with a few people nearby watching the performance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 372220, "caption": "In the image, a man with a beard and a cap, wearing a black outfit, is using his cell phone while standing in a public place. He appears to be focused on the device, possibly texting or checking his messages. There are a few other people in the scene, with one person standing in the background, another on the left side, and a third one on the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "horse", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 252625, "caption": "The image depicts a city street with several people and vehicles. There are various shops along the sidewalk, and a few people are walking down the street. A few cars are also present, including one at the top of the picture and another further down. The street is lined with buildings and a person is visible in the distance, indicating a bicyclist. Several bicycles can be seen on the street, with one closer to the bottom of the picture and a couple more further up. Some bicycles are parked and others are in motion, adding to the scene's vibrancy.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["traffic light", "potted plant", "person", "bicycle"], "hallucination_idxs": [32], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 493610, "caption": "According to the picture, there is a large truck filled with people, including children, as it drives down a dirt road. The truck is surrounded by a large crowd of people, some of whom are standing and others are sitting on benches or on the ground. The scene appears to be crowded and engaging, with the people enjoying the outdoor gathering.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["truck", "person", "bus", "car"], "hallucination_idxs": [46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 302789, "caption": " The image features a room with a small bed, a suitcase, and a backpack. The suitcase is placed on the floor next to the bed, while the backpack is situated on the left side of the room. The room appears to be a small kitchen area with a sink, a refrigerator, and an oven. Various utensils, including a knife, a spoon, and a bowl, are also present in the room. The items are neatly organized, with a focus on the backpack and suitcase in the scene.", "mscoco_hallucinated_words": [["oven", "oven"], ["knife", "knife"], ["spoon", "spoon"], ["bowl", "bowl"]], "mscoco_gt_words": ["suitcase", "handbag", "refrigerator", "backpack", "bottle", "sink", "bed"], "hallucination_idxs": [60, 67, 70, 74], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.7142857142857143}}, {"image_id": 30667, "caption": "The image shows a black dog sitting on a person's lap, enjoying the presence of someone or something. Another person can be seen sitting next to the first person, and both of them appear to be on a boat or a large green seat. The people and the dog are surrounded by various books, suggesting that they might be engaging in a reading activity or just have books nearby.", "mscoco_hallucinated_words": [["seat", "chair"], ["book", "book"], ["book", "book"]], "mscoco_gt_words": ["person", "boat", "dog"], "hallucination_idxs": [47, 58, 73], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3, "Recall": 1.0}}, {"image_id": 114108, "caption": "The image features a large pizza with a variety of toppings, cheese, mushrooms, and possibly vegetables, sitting on a dining table. The pizza is ready to be served to a group of people, who are gathered around the table, enjoying their food. The table is filled with various cups, forks, and knives, which are likely to be used for serving or eating the pizza.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "spoon", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 328110, "caption": "In the image, a woman is sitting on a bench with her baby, who is holding a doll. There are other people in the scene, including a man nearby. Numerous handbags are scattered around, some placed on the bench and others hanging from the hands of various individuals in the picture.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "handbag", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 366111, "caption": "The image portrays a group of people riding their bicycles down a city street. There are multiple bicycles in the scene with multiple men visible riding them. Some of the riders are wearing hats, indicating the outfit may be formal attire.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 532481, "caption": "In the image, there is a man enjoying kite flying and riding a surfboard. The surfer is riding in the ocean, with the kite flown above in the sky. Another person can be seen on the beach, watching the action unfold. This scene appears to take place near the coast, likely in a beach town or near a residential area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "surfboard", "kite", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 291634, "caption": "The image features several motorcycles parked along a street, creating a unique sight. The motorcycles are lined up in a row, with some parked closer to the street and others further back. There are various makes and models on display, adding to the interest of the scene. The motorcycles are parked side by side, creating a visually interesting arrangement.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 463678, "caption": "According to the picture, there is a man standing on a tennis court, playing tennis. He is wearing a white shirt and black shorts, and holds a tennis racket in his hand, preparing to hit the tennis ball. The image captures a moment during the man's tennis match, as he readies himself to make a play.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 70815, "caption": "According to the black and white image, a dog is sitting on the sidewalk, waiting for its owner to return from wherever they might have gone. The dog is wearing a leash and is positioned in the middle of the sidewalk. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "person", "truck", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 367872, "caption": "According to the picture, an ongoing baseball game is taking place in the image. There is a baseball player in the scene, swinging a bat as he attempts to hit the ball. A crowd of people, likely teammates and supporters, surrounds the player. The crowd includes several individuals in various positions around the area, perhaps watching the game or waiting for their turn to play.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 353898, "caption": " A person is sitting on a park bench surrounded by a large group of white birds, likely pigeons, in various positions. They are scattered throughout the area, some closer to the person and others further away. Some birds are sitting on the ground while others are flying above. The person appears to be reading a book, seemingly enjoying the surroundings.", "mscoco_hallucinated_words": [["book", "book"]], "mscoco_gt_words": ["bench", "cell phone", "backpack", "bird", "person"], "hallucination_idxs": [61], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.6}}, {"image_id": 88269, "caption": "The image features a delicious meal on a plate consisting of two slices of grilled cheese sandwiches topped with tomato soup, with a spoon and a knife nearby. The sandwiches are located near the edges of the plate, and the soup seems to be in the middle. There is a bottle of tomato sauce placed near the knife, possibly to be used as a condiment or as a dip for the sandwiches.", "mscoco_hallucinated_words": [["knife", "knife"], ["bottle", "bottle"], ["knife", "knife"]], "mscoco_gt_words": ["sandwich", "spoon", "bowl"], "hallucination_idxs": [27, 54, 61], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 0.6666666666666666}}, {"image_id": 463211, "caption": " The image shows a young girl standing next to a school bus, possibly waiting for it to pick her up or get on it. There is another girl, maybe her younger sister, in the picture. The kids are both wearing backpacks, one of them larger and the other one smaller.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bus", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 396269, "caption": "The image features a beautiful blue vase filled with a variety of different flowers. The vase is surrounded by a large floral arrangement, creating a visually appealing and lively scene. There is a gold-painted chair located next to the vase, while another chair is placed in the room further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "chair", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 98979, "caption": "According to the picture, there is a small, silver microwave oven sitting on a countertop in a kitchen. It is positioned near the left side of the kitchen, taking up a significant portion of the counter space. In the background, there are various kitchen appliances, including a refrigerator located in the center of the kitchen and two more bottles near the refrigerator.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"], ["bottle", "bottle"], ["refrigerator", "refrigerator"]], "mscoco_gt_words": ["microwave", "knife", "oven"], "hallucination_idxs": [54, 65, 68], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6, "Recall": 0.6666666666666666}}, {"image_id": 244455, "caption": "The image features two police officers riding motorcycles down a street. One motorcycle is closer to the right side of the street, while the other is closer to the left. They appear to be wearing helmets, indicating that they are riding with caution.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 379443, "caption": "The image features a pair of scissors in a unique position. The scissors are held in place on a black mesh sheet, possibly for craft or sewing purposes. It appears to be a pair of household scissors, with the handles and blades creating a symmetrical shape. The scissors are both near each other and occupy a significant portion of the image's area, creating a distinct visual aspect.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "scissors", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 536154, "caption": "The image depicts a young boy sitting on the floor surrounded by a large pile of stuffed animals. The boy is holding a teddy bear, which is the centerpiece of the pile. There are many other stuffed animals of various sizes and types around the young boy, creating a large collection of toys.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "cat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 374061, "caption": "According to the picture, there is a black cat sleeping on a bed, with a teddy bear, possibly a stuffed animal, resting on top of the cushion next to the cat. The cat and the teddy bear appear to be in a comfortable position while cuddled together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "bear", "cat", "bed", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 560018, "caption": "According to the picture, it depicts a group of people enjoying the ocean waves, some riding surfboards, while others are standing in the water. The people are spread out across the ocean, with some closer to the shore and others further out. The scene captures the beauty of the ocean and the skillful navigating of the surfers.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "kite", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 353320, "caption": " The image depicts a bench with a guitar on it, sitting on the sidewalk outside a parking garage. The bench is surrounded by a number of luggage bags, creating a scenery where various people have placed their belongings. There are at least three backpacks and two handbags placed around the bench, with one of them being closer to the guitar. The presence of multiple suitcases may indicate a busy location or travelers who have temporarily stored their luggage nearby.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["handbag", "bench", "suitcase", "person"], "hallucination_idxs": [47], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 1.0}}, {"image_id": 256628, "caption": "According to the picture, a large, blue dog is laying on a wooden table. The dog is enjoying its time on the table and appears to be relaxed. There are two chairs nearby, one on the left side and another on the right side, offering seating options for people or additional seating for the dog. Additionally, there is a bowl positioned towards the left edge of the table, possibly holding a beverage or a treat for the dog.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"], ["person", "person"], ["table", "dining table"]], "mscoco_gt_words": ["bowl", "chair", "dog"], "hallucination_idxs": [15, 25, 54, 75], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 1.0}}, {"image_id": 158806, "caption": "In the image, there is a person in the background, and a large brown and black dog is the main focus of the scene. The dog is sitting on a blue plate that has two slices of toasted sandwiches on it. The person in the background appears to be holding a sandwich as well, suggesting that there may be a connection between them and the dog. There is another person slightly off-center to the left side of the image. The people and the dog seem to be engaging in a social event.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 424270, "caption": " The image shows a young girl sitting on a bed in a room. She is positioned in the center of the bed, looking directly at the camera with a smile on her face. The bed is a large size and takes up a significant portion of the room. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bed", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 368335, "caption": "The scene features a horse standing in the middle of a parking lot surrounded by multiple vehicles of various types, including cars and trucks. Some of the cars are parked next to each other, while others are spread across the parking lot. The horse is standing out due to its brown color amidst the cars and trucks of various sizes and colors.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "horse", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 483227, "caption": " The image shows a group of young children gathered on a tennis court, holding tennis rackets and smiling. There are ten children in total, ranging in age and positioning themselves in various spots on the tennis court. Some children are standing closer to the center of the court, while others are positioned towards the sides or back of the court.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 228306, "caption": "The image features a blue double-decker bus driving down the street, close to the sidewalk. This bus is a public transportation, with various people visible on the street, waiting at bus stops. There are several bus stops scattered throughout the scene, with some located near the bus itself. The bus is traveling towards the right side of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 416384, "caption": "In the image, a woman is sitting on a large white chair, likely a picnic chair, holding a cell phone in her hand while she is taking a break. There are multiple other chairs and benches surrounding her, with some chairs appearing on either side of the woman, and another bench in the background. ", "mscoco_hallucinated_words": [["bench", "bench"], ["bench", "bench"]], "mscoco_gt_words": ["cell phone", "chair", "person"], "hallucination_idxs": [39, 56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 1.0}}, {"image_id": 364783, "caption": "The image depicts a bathroom with red walls, a white toilet, and a sink. The bathroom is clean and well-maintained, featuring a minimalist design. The white toilet sits under the sink and is situated on the left side of the image, while the sink is on the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toothbrush", "cup", "potted plant", "bottle", "toilet", "sink", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 335041, "caption": "In the image, a person is walking down a snow-covered street wearing a red jacket and carrying a pair of skis on his back. The snow piled high along the sides of the street highlights the efforts of the snow removal crews. Several cars are parked on the street, adding an element of urban scenery. Some of the cars appear closer to the front of the image, while others are further back.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 531327, "caption": "According to the picture, the image features a bowl filled with a variety of vegetables, such as broccoli, carrots, and green beans. The bowl is placed on a dining table, and there are several knives around it. The table is set for a meal, with a fork and a spoon present.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["bowl", "bowl"]], "mscoco_gt_words": ["knife", "dining table", "spoon", "fork", "broccoli", "wine glass", "carrot"], "hallucination_idxs": [9, 28], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.8571428571428571}}, {"image_id": 451228, "caption": "According to the image, there is a busy city street filled with people. The street is lined with tall buildings, and the city is decorated with various lights and wreathes, creating a festive atmosphere. There are multiple red double-decker buses on the street, as well as some pedestrians, including a couple in the middle of the image. The street has a mix of day and nighttime lighting, creating an interesting scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "bus", "person", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 256365, "caption": "According to the picture, there are several cupcakes on a plate. The cupcakes are decorated with white frosting, creating a visually appealing display. Each cupcake is neatly placed on the plate, and they are arranged in an alternating pattern. There are eight cupcakes in total, with five cupcakes on the left side, two in the middle, and three on the right side. The cupcakes are different, creating a visually interesting display.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "cup", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 502936, "caption": " According to the picture, a man is playing tennis on a court. He is holding a tennis racket and preparing to hit the ball with a tennis swing. He is wearing a white shirt and black shorts, and he is standing on the tennis court. A few other people can be seen in the vicinity, likely participating in the event as well.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 451449, "caption": "According to the picture, a small dog is lying on top of books in a bed, surrounded by various books and other objects. The books are spread out in different parts of the scene, both in the background and the foreground, providing a cluttered and interesting environment for the dog to sleep in.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "bed", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 51054, "caption": "A woman is sitting on a bed, talking on her cell phone. She is wearing pajamas and appears to be in a bedroom. Behind her, there is another bed with two bedposts, and in the room, there are two bottles. The image appears to be in black and white, as it features the woman sitting in a more blurry background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "bottle", "cell phone", "bed", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 1268, "caption": "According to the picture, there is a group of people standing around, sitting, and walking along the side of a large water body. Among the crowd, a woman appears to be taking a picture of the water. There are a few other individuals in the scene, including a man sitting on the ground towards the left, and two people on the right side. All the individuals seem to be enjoying their time, appreciating the beauty of the water.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "cell phone", "backpack", "bird", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 62692, "caption": "The scene takes place on a tennis court where a young woman is playing a game of tennis. She is holding a tennis racket and has just started to swing the racket to hit the tennis ball. The ball is located near the center of the court.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 542771, "caption": " According to the picture, there is a black and white photograph depicting a woman sitting at a table. Surrounding her, there are various chairs, arranged in different positions. In the space, there are several vases, one in the center and others scattered around the room. Additionally, a bottle and a book can be seen, possibly belonging to the woman. The room has an artistic, vintage look with various decorations and furnishings.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["dining table", "potted plant", "vase", "book", "chair", "person"], "hallucination_idxs": [56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.8333333333333334}}, {"image_id": 575406, "caption": "In the image, a blue and white bus-style vehicle is parked on the side of a road, with several people walking around it. There are multiple people visible in the scene, with some standing near the bus and others walking further away. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "bus", "potted plant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 244834, "caption": "According to the picture, a woman is walking down the street, holding an umbrella to protect herself from the rain. She is accompanied by a man, who is also carrying an umbrella. Both individuals are dressed in blue and yellow attire, and are walking down the sidewalk, hand in hand. Another woman is also visible in the scene, holding an umbrella and wearing a hat.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "clock", "handbag", "umbrella", "potted plant", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 151159, "caption": " The image depicts a park setting with a skateboard park located within it. Two men are present in the scene, both riding skateboards, with one of them wearing a hat. There are also several benches placed around the skateboard area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 277073, "caption": "In the image, a man and a woman are riding a motor scooter down a busy street. The man is driving the scooter, while the woman is sitting behind him. They are surrounded by various vehicles, including cars and trucks, which are also traveling on the street. Some of the cars are parked, while others are moving. The street appears to be quite busy, with many vehicles and pedestrians present.", "mscoco_hallucinated_words": [["truck", "truck"]], "mscoco_gt_words": ["bicycle", "motorcycle", "car", "backpack", "person"], "hallucination_idxs": [44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.6}}, {"image_id": 377879, "caption": "According to the picture, the scene features a large number of tanks, which are being transported on a flatbed trailer. There are many tanks parked on the train, occupying different spots and positions along the length of the train. Some of the tanks are positioned in the middle, while others are closer to the front and the back of the train. The train appears to be moving, likely on a track.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "train", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 436814, "caption": "According to the picture, the room features a desk with a small laptop and a few other items arranged neatly on it. The desk has a potted plant positioned to the right of the laptop, as well as various notes and documents scattered throughout the workspace. The room is also equipped with a chair near the desk, giving the impression of a small office set-up.", "mscoco_hallucinated_words": [["potted plant", "potted plant"]], "mscoco_gt_words": ["laptop", "mouse", "chair", "dining table"], "hallucination_idxs": [28], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.75}}, {"image_id": 63934, "caption": "According to the picture, a nun is skateboarding down a street in the city. She is wearing all black and riding her skateboard just outside the car parked nearby. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "skateboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 275611, "caption": "The image features a slice of pizza on a plate, ready to be enjoyed. The pizza is placed on a dining table, giving it an inviting atmosphere for people to gather and eat.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["pizza", "knife", "dining table"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 516248, "caption": "The image features a person sitting at a desk with two keyboards and a mouse. The person is in the process of using a computer mouse. There is another mouse sitting on the keyboard, suggesting that the individual might be working on a computer or engaged in some other activity.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "tv", "dining table", "laptop", "mouse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 478380, "caption": " The scene captures a man driving a bus along a winding road while herding a group of sheep. A group of at least 10 to 15 large sheep can be seen scattered across the image, with some in the front of the bus and others following behind. One of the sheep appears to be standing on a car, as if the animal were somehow on top of the vehicle. The bus appears to be driving on the road while the man is controlling the vehicle.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "sheep", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 422326, "caption": " The image features a parking lot with several motorcycles parked on the pavement. There is a motorcycle on the right side of the parking lot, and a motorcycle with a sidecar parked next to it. In addition to these motorcycles, there are at least six other motorcycles parked around the area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 110449, "caption": "In the image, two men are standing next to each other behind a counter, holding remote controls in their hands. They both wear headphones, indicating that they might be involved in music-related activities or entertainment. One man is standing closer to the camera, while the other man is positioned next to him. A cell phone can be seen on the counter in front of the men.", "mscoco_hallucinated_words": [["cell phone", "cell phone"]], "mscoco_gt_words": ["remote", "tv", "dining table", "laptop", "person"], "hallucination_idxs": [60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.4}}, {"image_id": 45550, "caption": " The image features a man who appears to be taking a selfie, smiling and posing with food in his hand. He is holding a piece of fried fish, likely [IDK] fried, and has a cup of soup alongside the fried fish. ", "mscoco_hallucinated_words": [["cup", "cup"]], "mscoco_gt_words": ["sandwich", "clock", "person", "bowl"], "hallucination_idxs": [40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 444913, "caption": "According to the picture, the scene depicts a man standing in a grassy field holding a baseball and preparing to swing at it. He is wearing a baseball cap, appears to be enjoying his time in the park.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["frisbee", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 82709, "caption": " In the image, a black and white dog is jumping up into the air while trying to catch a frisbee in its mouth. The dog is surrounded by people who are either watching the event or possibly getting ready to throw frisbees themselves. There are several people standing throughout the area, with some near the frisbee, others more towards the left side, and a few scattered throughout the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dog", "bottle", "frisbee", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 132796, "caption": " In the scene, two people \u2013 a man and a woman \u2013 are sitting on elephants, which appear to be comfortable seats. They are enjoying their time together as they ride through a grassy area. There is another elephant in the background, but it doesn't have any people on it at this moment.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "elephant", "backpack", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 303099, "caption": "The image shows two cats sitting on a toilet, with one cat on the left side and another cat on the right. They are placed on top of the toilet seat, looking down. The bathroom setting is adorned with various decorative patterns, and there is a toilet nearby the cats, likely to the left of them. Additionally, a shower can be seen in the background of the scene, on the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cat", "toilet", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 271565, "caption": "According to the picture, a group of people is riding motorcycles down a street. There are at least four motorcycles visible in the image, with some of the riders wearing helmets. The motorcycles are parked on the side of the road, with some of them closer to the center of the street, and others positioned closer to the edges. The riders seem to be travelling through the road, possibly enjoying a ride together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 205193, "caption": " The image depicts a kitchen countertop with a tea kettle and some tea on one side, and a toaster and some oranges on the other. The tea kettle is positioned close to the toaster and seems to be next to the oranges.", "mscoco_hallucinated_words": [["orange", "orange"], ["orange", "orange"]], "mscoco_gt_words": ["toaster", "microwave", "oven", "person"], "hallucination_idxs": [22, 43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 272566, "caption": " The scene features a man sitting outside on the ground in front of a fruit display, surrounded by bananas. He is holding a banana in his hand, seemingly enjoying the fruit. The fruit display includes multiple bunches of bananas, and some are placed on the ground while others are in baskets. There is also a chair in the background of the scene.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["cup", "apple", "person", "banana"], "hallucination_idxs": [61], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 170099, "caption": "In the image, a man, possibly a politician, is standing in front of a couch and giving a speech or addressing a crowd. He is wearing a red tie and holding a sign, gesturing as he speaks. There is a large picture or a poster visible in the background, likely featuring the man or his message.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "orange", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 14940, "caption": " According to the picture, the scene features an orange cat and a small stuffed dog lying together on a couch. The cat is curled up around the stuffed dog, giving the appearance that they are cuddled up. There is a blue pillow on the couch, and another cat is situated in the background, closer to the top of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "cat", "orange", "chair", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 408449, "caption": "According to the picture, there is a man riding a skateboard down the sidewalk in a city area. The man is skateboarding alone, without any companions in sight. The scene is mostly empty except for the man and the skateboard he is riding.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "skateboard", "bus", "handbag", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 140702, "caption": " A baby elephant is walking through a grassy field, accompanied by its mother. The baby elephant is small and cute, while the mother is larger and more prominent. There are also several birds present in the scene, scattered throughout the field.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "person", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 354540, "caption": "According to the picture, there is a man sitting at a kitchen counter, working on a laptop computer. The laptop is connected to a toaster oven, which is plugged into an outlet on the counter. The man is also holding a pencil in his hand. There are two other individuals in the kitchen, but they are not as prominent in the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toaster", "oven", "laptop", "cell phone", "microwave", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 19763, "caption": " The image features a piece of chocolate cake on a plate, sitting on a restaurant table. The cake has multiple toppings, likely including chocolate and other ingredients, and is accompanied by two beers. One beer is placed closer to the center of the table, and the other is placed more towards the left side. The table is set with silverware, including spoons, which are strategically placed around the table, allowing for easy access during the meal.", "mscoco_hallucinated_words": [["cake", "cake"], ["cake", "cake"], ["spoon", "spoon"]], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "fork", "chair", "person"], "hallucination_idxs": [7, 19, 69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 0.14285714285714285}}, {"image_id": 191301, "caption": "The image features two pieces of chocolate cake, accompanied by ice cream, placed on a white table. There are two spoons present, one near the left side of the plate and the other slightly to the right. The table is placed on a countertop or a dining area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "spoon", "cake", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 124236, "caption": "The image features a kitchen with various cooking appliances and objects. A stove and oven are placed in the room, along with a refrigerator and a sink. A table sits against the far left side of the room, filled with bowls, bottles, and other kitchen items. There are several chairs positioned in the kitchen, with one located at the front left corner, another near the middle left side, and a third situated towards the back.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"], ["sink", "sink"], ["bottle", "bottle"], ["chair", "chair"]], "mscoco_gt_words": ["toaster", "oven", "dining table", "bowl", "spoon"], "hallucination_idxs": [25, 28, 46, 56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.6}}, {"image_id": 344930, "caption": "According to the picture, there is a red fire hydrant sitting on the grassy area, surrounded by trees nearby. A car, possibly a truck, can be seen parked in the distance on the right side of the image. The car is parked on the side of the road, which is also grassy. A few other vehicles are visible along the street, including a truck closer to the left side of the image, and another car on the right side, behind the fire hydrant.", "mscoco_hallucinated_words": [["truck", "truck"], ["truck", "truck"]], "mscoco_gt_words": ["fire hydrant", "orange", "car"], "hallucination_idxs": [27, 72], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.6666666666666666}}, {"image_id": 14285, "caption": "According to the image, a cat is laying on a bed in a bedroom, surrounded by books and other personal items as decorations. The cat appears to be enjoying its time in the bedroom, looking at the camera. The bedroom has several furnishings, including a few books and a television located in the back corner of the room.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "tv", "cat", "chair", "book", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 82488, "caption": "The image features two beds in a hotel room, each one with a single pillow, and they are positioned next to each other. The beds are neatly made, with blankets and sheets arranged nicely. There are no people in the room. The walls and ceiling are painted with a warm, orange color. The hotel room is relatively clean and organized.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["remote", "clock", "dining table", "orange", "book", "bed"], "hallucination_idxs": [42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 348251, "caption": "A woman is skiing down a snowy slope, wearing a backpack and holding a pair of skis. She is smiling and enjoying the experience. In the background, multiple other skiers can be seen skiing on the slope. One person is at the top of the hill and another is farther down, both skiing in different directions. The scene conveys the fun and excitement of skiing during wintertime.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 150703, "caption": "According to the picture, a hot dog is being served alongside a beer, which is placed in a cup. The hot dog is the main dish and is sitting in the middle of the table. The beer and the hot dog are two popular food items that can be commonly found in many cafes and restaurants. The image gives the impression that it is a casual and relaxed setting with a tasty treat of a hot dog and a beer.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "cup", "hot dog", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 381615, "caption": "According to the picture, the image features a room filled with various plants. There is a beautiful garden visible through the window, showcasing a multitude of plants arranged throughout the space. The room is further furnished with a couch and a chair, making it a comfortable spot to relax and enjoy the greenery. Plants are present on both the wall and floor, further enhancing the natural appeal of the room.", "mscoco_hallucinated_words": [["couch", "couch"]], "mscoco_gt_words": ["cup", "potted plant", "vase", "book", "chair"], "hallucination_idxs": [42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.2}}, {"image_id": 352703, "caption": "According to the picture, there is a woman in the image who is preparing food in a wok over a grill. She is wearing an apron and is surrounded by several other people, who are standing in different positions around her. There are also a few tables placed on the outdoor scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "handbag", "oven", "dining table", "bowl", "spoon", "bottle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 111109, "caption": "This image features a young boy standing on a stool, possibly posing for a photograph, wearing a suit and tie. The black-and-white photo captures the boy smiling and enjoying the moment. The scene is set in a room with a couch, situated at the right side of the image, and a chair located in the lower left corner. The room also has a TV in the upper left corner. A few books are scattered around the room, with one close to the couch and another near the chair. ", "mscoco_hallucinated_words": [["couch", "couch"], ["book", "book"], ["couch", "couch"]], "mscoco_gt_words": ["tv", "tie", "motorcycle", "chair", "person"], "hallucination_idxs": [44, 79, 91], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3, "Recall": 0.8}}, {"image_id": 226802, "caption": "According to the picture, the scene is of a busy city street on a wet day. There are a number of buses traveling along the road, as well as a few cars. People are walking and crossing the street in large numbers. The scene appears to be lively and busy.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 49005, "caption": "The image features a baseball game in progress. Several players can be seen spread out across the field, both playing baseball and cheering on their teammates. One player stands out, as he is about to swing a baseball bat during a game. Other players in the field include baseball players, with one player standing closer to the batter and another player a bit further back. They are all focused on the upcoming play, enjoying the sportsmanship, and eagerly anticipating the outcome of the game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 410114, "caption": "The image features a group of cows walking down a dirt road, with a horse in front of them. The cows are spread out along the road, with some of them closer to the horse and others further away. The horse appears to be leading the group of cows, while they are all walking along the road.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "horse", "person", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 100274, "caption": "According to the picture, it depicts an extensive rail yard filled with multiple trains. The trains are of various sizes and are positioned throughout the area. Some trains are positioned in the foreground, while others are positioned in the background. There are eight trains visible in the image, with some arranged in a row and others scattered in different parts of the rail yard. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "train", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 317254, "caption": "In the image, a person wearing a helmet is riding a motorcycle down a city street. He appears to be in motion, turning in either direction. There is a car behind the motorcycle, possibly parked near the curb or traveling in the same direction. In the background, another car can be seen, moving along with the first vehicle.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 425569, "caption": "The image features a large, messy kitchen with various silver items across the countertops. The kitchen is filled with utensils, including knives, bowls, and spray bottles, creating a cluttered and busy scene. The silver items are positioned throughout the space, including various sizes of metal spoons, several bowls, a spray bottle, and multiple utensils, such as a knife and a spatula. The mix of metal and utensils gives the image a professional and industrial feel.", "mscoco_hallucinated_words": [["knife", "knife"], ["bottle", "bottle"], ["spoon", "spoon"], ["bottle", "bottle"], ["knife", "knife"]], "mscoco_gt_words": ["bowl", "sink", "oven"], "hallucination_idxs": [24, 30, 53, 60, 69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.7142857142857143, "Recall": 0.3333333333333333}}, {"image_id": 28842, "caption": "According to the picture, it appears to be a sunny day at a skate park, as a skateboarder is enjoying their time there. There are several other people scattered around the park, some of whom are watching the skateboarder as they perform their trick. In total, there are six people in the image, with some standing closer to the middle and others further along the park's edges. The park itself is surrounded by a fence, providing a sense of containment and safety for the skateboarders.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 319607, "caption": "According to the picture, a traffic light in a city street is red, indicating that there is a restriction. There are several cars in the area, some parked in the street and others in the distance, as well as people on the sidewalks. In addition to the traffic light, there is also a stop sign. Among the people present, one person is wearing a helmet while riding a bike, green, indicating that it is safe for cyclists to proceed.", "mscoco_hallucinated_words": [["stop sign", "stop sign"]], "mscoco_gt_words": ["bicycle", "bus", "traffic light", "car", "backpack", "person"], "hallucination_idxs": [60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.6666666666666666}}, {"image_id": 210372, "caption": "The image displays a small kitchen with red walls, cabinets, and a sink. The sink is located in the middle of the kitchen, near the left side, and appears to be in working condition. The kitchen has an open layout, making it feel spacious and airy. There is also a window on the left side of the kitchen, letting in natural light. The kitchen is quite empty, with no appliances or other items to be seen.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 156915, "caption": "According to the picture, there is a bouquet of flowers placed in a vase on a table. The bouquet features a red rose, which is the focus of the arrangement, as well as a mix of various flowers. Among these flowers are several daisies, contributing to the overall colorful composition. The arrangement is set against a darker background, creating a visually appealing scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 349642, "caption": "According to the image, a man and his dog are sitting on a bench, with the dog located closer to the right side of the scene. The man and the dog appear to be enjoying each other's company while sitting on the bench.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bench", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 423008, "caption": " The image depicts a scenic waterfront with numerous boats, including small boats and larger ones, lined up along the edge of a body of water. The boats are docked in various locations, possibly waiting to board or engage in various activities. People are spread across the scene, engaged in tasks, such as talking and walking.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["suitcase", "handbag", "umbrella", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 45435, "caption": "The image depicts a person's hand holding a cell phone, which is positioned in the center of the frame. The phone appears to be a small smartphone. The person's hand is quite prominent, making the phone the main focus of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "cup", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 35827, "caption": " According to the picture, there are two computers in the room, one laptop and one desktop. A man is sitting at the desktop, working on a laptop computer, which is placed on the right side of the room. He appears to be focused on the laptop and is likely using it to write, work, or perform other computer-related tasks.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "tv", "dining table", "laptop", "bottle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 72656, "caption": " The image features three young boys, two of them sitting on chairs at a small table and one standing near them. They are enjoying their time together, engaging in conversation while eating food and drinking from a bottle. The food appears to be pizza, sliced and placed on plates. There is also a fork placed on the table.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["fork", "fork"]], "mscoco_gt_words": ["pizza", "suitcase", "dining table", "bowl", "chair", "person"], "hallucination_idxs": [40, 59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.6666666666666666}}, {"image_id": 372087, "caption": "The image features a green and white train traveling down the tracks in the middle of a city, surrounded by a platform for passengers. The train is passing by a station, and there are several people standing on the platform, waiting for the train to arrive. Some of the people are standing on the left side, while others are positioned in the middle or right side of the platform. Additionally, there is a bicycle visible near the left edge of the image, perhaps belonging to one of the people waiting for the train.", "mscoco_hallucinated_words": [["bicycle", "bicycle"]], "mscoco_gt_words": ["train", "suitcase", "bench", "person"], "hallucination_idxs": [81], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.5}}, {"image_id": 336209, "caption": "According to the picture, a young man is performing a trick on a skateboard, jumping over a small ramp. He is wearing a black shirt and appears to be enjoying the activity.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 46031, "caption": " The image features a laptop computer, with its keyboard sitting on a desk next to it. The laptop appears to be turned on, with an image or photo displaying on its screen. In the vicinity of the laptop, there are several other items on the table, such as a mouse, multiple keyboards, and a phone. The phone is placed further down towards the left side of the desk.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "tv", "dining table", "laptop", "mouse", "cell phone"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 283118, "caption": "According to the picture, there is a family of four sitting at a table, enjoying a meal together. The family includes two children and two adults. All of the individuals are smiling and appear to be having a good time.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "handbag", "dining table", "bottle", "cell phone", "car", "backpack", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2222222222222222}}, {"image_id": 429598, "caption": "In the image, a kitchen has been constructed using brick and has a refrigerator in it. A small refrigerator is positioned to the right, partially visible from the front. There is a stove in the kitchen area, also visible from the front. On the right side, there is an oven with a sink next to it. The sink is positioned slightly above the oven.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "oven", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 333365, "caption": "According to the picture, there are several airplanes parked on the tarmac at an airport. There are at least four airplanes visible in the scene, with one of them being a red plane. The airplanes are parked in various positions, some closer to the others further away. The airport appears to be a busy terminal, with a lot of activity happening around it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 424174, "caption": "The image displays a large wooden table filled with an array of delicious fruits, primarily apples and bananas. A variety of apples can be seen scattered throughout the table, with some placed close together, while others are positioned further apart. There are many bananas of different sizes, placed both in the center of the table and near its edges. Some bananas are stacked on top of each other or positioned next to other fruits. This abundance of fruit creates a festive and appetizing scene, showcasing the variety of food available.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "apple", "orange", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 3035, "caption": "The image depicts a large airplane parked on the tarmac at an airport. The airplane is surrounded by numerous people, some of whom are standing near the airplane, while others are walking around it. The scene appears to be an ordinary day with people going about their business at the airport.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 177213, "caption": " The image features a delicious-looking pizza placed on a plate, which appears to be topped with various cheeses and possibly meat, sitting on a table. There are three forks on the table, one near the edge, another in the middle, and the last one close to the pizza. The forks might be used to eat the pizza or for serving it. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 438535, "caption": "According to the picture, there is a large clock located in the middle of a city, surrounded by a large number of people and tall buildings. The clock appears to be black and has two clock faces, one on top and the other on the side. The city scene is likely a popular spot for events or gatherings, as several people are milling about the area, going about their daily business. Some individuals are holding cell phones, while others are standing closer together or further apart, creating a diverse crowd.", "mscoco_hallucinated_words": [["cell phone", "cell phone"]], "mscoco_gt_words": ["backpack", "clock", "car", "person"], "hallucination_idxs": [83], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.5}}, {"image_id": 490711, "caption": "This image features a clean white kitchen with various appliances. A white refrigerator is situated on the left side of the room, while a sink is located on the right side. A microwave oven is also visible, positioned closer to the refrigerator on the left side of the kitchen.", "mscoco_hallucinated_words": [["microwave", "microwave"], ["oven", "oven"]], "mscoco_gt_words": ["sink", "refrigerator", "banana"], "hallucination_idxs": [35, 36], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.6666666666666666}}, {"image_id": 163728, "caption": "According to the picture, there is a small white cat sleeping on a table. The cat is lying down and resting its head on the edge of the table. The cat appears to be very comfortable in its sleep.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "laptop", "mouse", "cat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 456756, "caption": "The image features a large giraffe standing in a zoo or enclosed area, surrounded by several zebras. The giraffe is the main attraction, towering over the other animals. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["zebra", "giraffe", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 281028, "caption": "The image features a large room with various seats and chairs arranged throughout. There are two chairs positioned in front of the middle of the room, a couple of blue seats with armrests near the wall, and a few more chairs towards the back. Several people can be seen occupying some of the seats, making the scene feel somewhat busy.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "person", "chair", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 343937, "caption": "In the image, a woman is snowboarding down a snow-covered slope. She is wearing a white top and is positioned on the snowboard, looking to the right. The snowboard is in the foreground, with the woman in the background. The image captures the excitement of the woman snowboarding on the snow-covered slope.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 277051, "caption": " The image depicts a small bird, likely a couple of sparrows, sitting on a dining table next to a piece of pizza, surrounded by various bottles of liquid, possibly wine or sauce. The bird appears to be enjoying its meal, or just taking a rest in the vicinity of food and drinks. The scene suggests a casual, informal setting, where pizza and wine might be served for a social gathering.", "mscoco_hallucinated_words": [["pizza", "pizza"], ["pizza", "pizza"]], "mscoco_gt_words": ["knife", "dining table", "bottle", "bird", "chair"], "hallucination_idxs": [23, 69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.6}}, {"image_id": 145666, "caption": "The image features a pan filled with a variety of food, specifically broccoli and carrots. There are numerous broccoli florets scattered throughout the pan, along with several carrots of different sizes. The food appears to be fully cooked, as it is being prepared to be served. The pan is positioned on a stove top, likely in a kitchen, as food preparation and cooking takes place on this appliance.", "mscoco_hallucinated_words": [["stove", "oven"]], "mscoco_gt_words": ["bowl", "carrot", "broccoli", "dining table"], "hallucination_idxs": [58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 304984, "caption": "The image displays a plate of food on a table, which consists of sandwiches, pickles, and other items. There are two sandwiches present on the plate, with one sandwich situated closer to the right edge of the plate, and the other sandwich placed near the left side. In addition, there are a few pickles located on the plate, arranged in a mixed fashion.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["bowl", "sandwich", "cake", "fork", "orange", "wine glass"], "hallucination_idxs": [9], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.16666666666666666}}, {"image_id": 122467, "caption": "A person is holding a Nintendo Wii controller in their hand, likely playing a game. The image features a white Wii remote, which has been placed on the hand of the person. The location of the hand is near the center of the picture. The person's arm can be seen on the left side of the frame, and a chair can be spotted in the background.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["remote", "dining table", "person", "couch"], "hallucination_idxs": [66], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 518287, "caption": "According to the picture, the image features a smiling face on a toothbrush, sitting in an orange surface. The smiling face is created by two holes drilled into the toothbrush, one on each side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toothbrush", "cup", "orange", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 145448, "caption": "The image shows a kitten sitting on a large dining table, presumably placed there for comfort, as it lays on top of the dining table's blanket. A motorcycle and a car are also present in the scene. The motorcycle is closer to the foreground, while the car is further in the background. Several other people can be seen in the scene, some of them closer to the motorcycle, while others appear to be standing further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "motorcycle", "cat", "car", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 352900, "caption": " In the image, a plate of food can be seen on the dining table. The food appears to consist primarily of pasta and vegetables, with broccoli being prominently displayed on the plate. There is a close-up of the pasta and broccoli in the center of the dish, highlighting the various noodles and greenery. There is also a bottle of a drink or a wine nearby, likely accompanying the meal.", "mscoco_hallucinated_words": [["table", "dining table"], ["bottle", "bottle"]], "mscoco_gt_words": ["bowl", "fork", "knife", "broccoli"], "hallucination_idxs": [14, 63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 153607, "caption": " The image consists of a collage of pictures featuring blue vases. One of the vases has a bright yellow flower floating in the water, creating an interesting visual effect. Additionally, there are four vases in the picture, each containing water and placed on a table. Some of the vases appear closer to the top of the table, while others are more central or towards the bottom.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "orange", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 157581, "caption": "The image showcases a variety of wine bottles arranged on a dining table. There are at least 10 wine bottles of different sizes lined up, with some closer to the foreground and others farther back in the image. The bottles are arranged in various positions, showcasing the different kinds of wines available.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "bottle", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 227438, "caption": "The image features a large, well-decorated bed in a room. The bed is situated in the middle of the room, surrounded by a few chairs and decorative elements. The bed is well-lit, with lamps providing a warm atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "couch", "bed", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 116037, "caption": "In the image, a young girl swinging a baseball bat, preparing to hit the ball during a game. She is surrounded by a group of people, including a baseball player, who are watching her. Several people are in the scene, including an additional player, a sports coach, and others who are potentially teammates or family members.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 202658, "caption": "A toilet is sitting on a street, with an accompanying image on the toilet bowl, depicting a cute smiling face. This image has been placed on the toilet seat, possibly as a fun touch or to create a decorative feature. The toilet is surrounded by a metal cover on the ground, which likely serves as a manhole cover.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "chair", "toilet"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 18150, "caption": "According to the picture, there is a man sitting on a bed with a young boy, and they are both enjoying a slice of pizza. The man appears to be feeding the pizza to the boy, making it a fun and enjoyable bonding experience. The two individuals are sitting close to each other, with the man on the right side and the child on the left. The food item is on the right side of the bed, with two slices visible, one closer to the left and another closer to the right side. ", "mscoco_hallucinated_words": [["bed", "bed"], ["bed", "bed"]], "mscoco_gt_words": ["pizza", "bottle", "backpack", "person", "couch"], "hallucination_idxs": [12, 83], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.4}}, {"image_id": 418711, "caption": " In the image, a person riding a bicycle down a busy street, transporting a large quantity of bananas on the back of his bike. The bananas are visible in the back of the bike, and the person is wearing a helmet, indicating a safe mode of transportation. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "car", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 92107, "caption": "The image features a green bowl filled with a broccoli soup. The soup is topped with a generous amount of broccoli, adding to the flavor and texture of the dish. There are several pieces of bread in the bowl.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "spoon", "broccoli", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 491000, "caption": "In the image, a busy street is filled with people walking and riding bicycles. There are several individuals walking down the sidewalk, some being followed by others. Some of the bicyclists are riding side by side, creating a lively atmosphere. The street is lined with trees and buildings, and a car can be seen parked or driving by. This creates a typical urban setting with people and bicycles.", "mscoco_hallucinated_words": [["bicycle", "bicycle"], ["bicycle", "bicycle"]], "mscoco_gt_words": ["handbag", "motorcycle", "dog", "car", "person"], "hallucination_idxs": [14, 75], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.4}}, {"image_id": 401163, "caption": "According to the picture, there is a man sitting on a park bench in a park. Another man is standing nearby, enjoying the outdoor scene and observing the area. The bench appears to be located in a nicely landscaped area with a variety of trees in view. In addition, a couple of potted plants are placed close to the park bench where the man is sitting. The potted plants and park bench are central to the scene, surrounded by multiple trees and other park furniture in the background.", "mscoco_hallucinated_words": [["potted plant", "potted plant"], ["potted plant", "potted plant"]], "mscoco_gt_words": ["person", "bench", "dog"], "hallucination_idxs": [58, 74], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.6666666666666666}}, {"image_id": 39663, "caption": " The image shows an array of various items arranged across the floor in an organized manner. There is a black backpack on the ground, laying on its side. The backpack contains different items, including a laptop, a few books, and several scattered items such as a cell phone, a small box, and a couple of handbags. ", "mscoco_hallucinated_words": [["handbag", "handbag"]], "mscoco_gt_words": ["laptop", "book", "cell phone", "backpack", "scissors"], "hallucination_idxs": [63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.8}}, {"image_id": 268293, "caption": "According to the picture, a man is playing tennis on a court, holding a tennis racket and swinging it to hit a tennis ball. He is wearing a yellow shirt and tie, and appears focused on the ball. The tennis court is surrounded by chairs, some of which are occupied by people who could be spectators or other players.", "mscoco_hallucinated_words": [["tie", "tie"], ["chair", "chair"]], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [34, 49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 474499, "caption": "The image features a blue kitchen with a sink and two ovens. The kitchen is filled with appliances, including a refrigerator, an oven, a sink, and a dishwasher. There are also two cats visible in the scene. The kitchen is well-equipped and ready to be used.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"], ["cat", "cat"]], "mscoco_gt_words": ["clock", "oven", "spoon", "bottle", "vase", "apple", "sink"], "hallucination_idxs": [22, 38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.2857142857142857}}, {"image_id": 358086, "caption": " In the image, a group of people is playing a game of ultimate frisbee on a field. They are actively engaged in the game, with some of them jumping in the air to catch the frisbee. People are spread out across the field, with some standing closer to others and others further back.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "frisbee", "backpack", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 89541, "caption": "The image showcases a plastic container filled with various types of food. Among the different items, there are some strawberries and muffins arranged in different areas of the container. Some strawberries are on the right side, while others can be seen towards the left side. A few muffins are scattered throughout the container. This diverse selection of foods creates a delightful visual display.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "cake", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 239445, "caption": "The image captures a man wearing a white shirt and holding up a frisbee in each hand. He is smiling and standing on a grassy field, posing for the picture. The frisbees he is holding seem to be identical, and he is showcasing them as a part of his smile or pose. The man appears to be enjoying the moment.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["frisbee", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 105782, "caption": "According to the picture, a giraffe and some cars are seen in a park, surrounded by a vast green field. The giraffe is standing in the grass, on its hind legs, while its kite is flying high in the sky, capturing attention in various locations around the field. A few cars are parked outside the grassy area, providing a unique scene for both the giraffe and the park setting.", "mscoco_hallucinated_words": [["giraffe", "giraffe"], ["giraffe", "giraffe"], ["giraffe", "giraffe"]], "mscoco_gt_words": ["cat", "kite", "dog", "car"], "hallucination_idxs": [6, 24, 72], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.5}}, {"image_id": 487990, "caption": "According to the image, there is a person sitting at a table, holding a guitar and enjoying it. This person is surrounded by various items, including a variety of knives, forks, and spoons, spread out on the table. There is a bowl and a spoon in the middle of the table, and several other bowls scattered throughout the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "bowl", "spoon", "bottle", "cake", "fork", "donut", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 138834, "caption": "In the image, a man is walking down a snowy street while carrying two pairs of skis, with one pair attached to the back of his backpack and the other on his head. He appears to be getting into a truck, possibly to start a trip or return home. The truck is parked on the right side of the street, and another car can be seen on the left side. The scene creates a winter atmosphere, with snow covering the ground and the man walking through it.", "mscoco_hallucinated_words": [["backpack", "backpack"], ["truck", "truck"], ["truck", "truck"]], "mscoco_gt_words": ["car", "skis", "person"], "hallucination_idxs": [28, 43, 55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 1.0}}, {"image_id": 492057, "caption": "According to the picture, the image features a man standing in front of a mirror, holding a cell phone to his face and smiling. The man is wearing a jacket, a tie, and a baseball cap, which is likely a part of his outfit. The man is standing in the center of the image and the mirror reflects his appearance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "person", "cell phone"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 388215, "caption": "In the image, a man is playing tennis on a court, holding a tennis racket and preparing to hit the ball. He is wearing a white shirt and black shorts. The court is surrounded by safety fencing, possibly indicating the location of the court.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 111409, "caption": "According to the image, there is a man riding a motorcycle down a cement walkway, possibly on the sidewalk. The man is wearing a helmet and appears to be sitting on the back of the motorcycle. In the background, there are other people in the image, but they appear to be standing and not riding motorcycles.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 451391, "caption": "According to the picture, the image features a bouquet of green flowers, which is being held by a person. The bouquet is made up of a combination of flowers, including broccoli, cabbage, and cauliflower. The person holding the bouquet is standing in the middle of a market or store.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["broccoli", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 92462, "caption": " The image features a line of cows walking down the street, forming a herd as they make their way to the other side. There are multiple cows spread out along the street, with some near the center and others closer to the left and right sides. The scene is set in the midst of a city environment, with a car visible near the back of the herd.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "cow", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 397268, "caption": "According to the picture, there is a bicycle shop in front of a building with several bicycles of different types on display in the window. In total, there are three bikes displayed, including a single bike, a bicycle pair, and a single bike. There are also two benches, one at the left side of the store and another one towards the middle, creating an inviting atmosphere for potential customers.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "chair", "potted plant", "bench"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 127659, "caption": "According to the picture, it is a sunny day, and a large train is traveling along the train tracks. The train appears to be a passenger train, with a colorful front car. There is also a person sitting on a bench near a train station, waiting for the train to arrive. She is accompanied by a dog, which is sitting next to her. The train is visible in the background, and there are several other people around the station, some of whom are standing and others are sitting on benches.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["train", "person", "bench", "dog"], "hallucination_idxs": [35], "metrics": {"CHAIRs": 1, "CHAIRi": 0.08333333333333333, "Recall": 1.0}}, {"image_id": 251857, "caption": " In the image, a man is seen skateboarding in a park, while another person is watching him from the sidelines. There are three people visible in the park: the skateboarder, the observer, and another person further back, perhaps just enjoying the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "skateboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 243212, "caption": " In the image, a young boy is standing in a field, holding a blue frisbee in his hands. The frisbee is positioned in front of the boy, and he is likely waiting for someone to throw it or catch it during a game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "dog", "frisbee", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 444304, "caption": " According to the picture, a man is sitting on a bed, playing with a video game controller. Another person, possibly his friend, is in the room with him. The scene suggests the two people are enjoying a night in together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "person", "bed", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 305000, "caption": " According to the picture, there is a woman standing in front of a produce stand with various fruits and vegetables. She seems to be browsing the selection and looking at the offerings. In the image, there are numerous fruits such as a few apples, bananas, carrots, a few potatoes, and a melon. The vegetables include some broccoli and a handful of other fruits. The woman is wearing a black shirt and a headband.", "mscoco_hallucinated_words": [["carrot", "carrot"], ["broccoli", "broccoli"]], "mscoco_gt_words": ["apple", "person", "banana"], "hallucination_idxs": [51, 65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 1.0}}, {"image_id": 44136, "caption": " In the image, there are two men dressed in tuxedos sitting at a table, surrounded by a series of chairs. One of the men is posing, likely in an attempt to appear on camera. One man is holding a glass in his hand, possibly filled with an alcoholic beverage, while the other man appears to be wearing a tie. Both men are smiling, suggesting that they might be enjoying themselves at a party or event.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["dining table", "tie", "bottle", "wine glass", "person"], "hallucination_idxs": [21], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.6}}, {"image_id": 154846, "caption": " The image features a pizza filled with various toppings, such as cheese and mushrooms. It is placed on a dining table, and there are several people in the scene. Some are seated and others are standing around the table. There are four people in total, each holding a wine glass. One person is standing on the right side of the table, while another is standing on the left side. Another person is seated further back on the table, and the last one is standing at the bottom left corner. The scene gives an impression of a gathering where the pizza is being shared among friends.", "mscoco_hallucinated_words": [["wine glass", "wine glass"]], "mscoco_gt_words": ["pizza", "person", "dining table"], "hallucination_idxs": [54], "metrics": {"CHAIRs": 1, "CHAIRi": 0.09090909090909091, "Recall": 1.0}}, {"image_id": 567254, "caption": " The image shows a young child, likely a toddler or preschooler, standing in front of a television in their living room. They appear to be holding a remote control in their hand, possibly playing with it or trying to change the channel.", "mscoco_hallucinated_words": [["remote", "remote"]], "mscoco_gt_words": ["person", "vase", "tv", "bottle"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 79261, "caption": "According to the picture, a man is standing in a large, open area filled with water, cleaning the surface with a broom. The man is wearing a blue shirt and appears to be pushing the broom across the wet floor. The water is shallow, suggesting that it might be a part of a fountain or a pool. There are several people in the scene, some standing closer to the man, and others scattered around the periphery of the area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "clock", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 301402, "caption": " The image features a young man sitting in a chair with a laptop open in front of him. He is wearing a hat and sunglasses, and appears to be using the laptop. The chair is positioned at a dining table, which is surrounded by several chairs. There are also a few books on the table, perhaps used as references for the man's work or hobby. The dining table and chairs suggest that the setting is a living room or a study area.", "mscoco_hallucinated_words": [["book", "book"]], "mscoco_gt_words": ["dining table", "laptop", "potted plant", "bottle", "chair", "person"], "hallucination_idxs": [56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.08333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 415828, "caption": "In the image, a group of people is gathered on a street standing under a blue umbrella. They are engaged in conversation and appear to be enjoying each other's company. The street is lined with trees, and there are a few cars parked nearby. There are people carrying handbags, and some people holding umbrellas. This scene is typical of an urban setting, where people are interacting with each other.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "umbrella", "traffic light", "stop sign", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 543231, "caption": " In the image, a woman is standing in a living room, surrounded by other people who are also present in the scene. The woman is holding a video game controller, likely playing a video game, and she is the main focus of the scene. In the background, there are several chairs and a vase in different areas of the room, as well as some other people, including a person standing closer to the right side, another one standing near the back of the room, and another person sitting down.", "mscoco_hallucinated_words": [["chair", "chair"], ["vase", "vase"]], "mscoco_gt_words": ["remote", "bowl", "book", "bed", "person"], "hallucination_idxs": [57, 60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.2}}, {"image_id": 495357, "caption": "According to the picture, there is a motorcycle down in the street, surrounded by several people. Some of the people are walking, while others are sitting on the sidewalk. There are also a few cars in the background, with one parked on the side of the road and another driving down the street. The motorcyclist appears to be the central figure as he navigates through the crowd.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["motorcycle", "person", "handbag", "dog"], "hallucination_idxs": [39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 523175, "caption": "The image features a bowl of soup with a spoon in it, sitting on a table. The soup is green in color, giving it a unique appearance. There are numerous visible carrots dispersed throughout the bowl, which adds to the vegetable-based nature of the soup. Additionally, the soup has a visible small spoon in the image, suggesting it is being served with food.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "carrot", "spoon", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 184945, "caption": "The image features several motorcycles and bicycles parked on a sidewalk, with a few motorcycles and bicycles parked on the street. Some of the motorcycles are in the front row and a few others are scattered towards the back. The bicycles are more spread out, with a group towards the back and a smaller group closer to the front.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "potted plant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 49942, "caption": "The image features a man standing under an umbrella with his eyes open, looking straight at the camera. The umbrella is held up over his head, covering him partially. He is wearing a tie, and another tie can be seen near the bottom left corner of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 223019, "caption": "The image features a bright blue train, parked in a station. Several people are standing around the train, some of them possibly waiting to board or disembark from the train. There are a few people at the front of the train, a few more in the middle, and a few at the back. Some of the individuals are carrying bags, which could indicate they are getting off the train or are simply traveling with luggage. The scene suggests a busy train station with people moving around the train as part of their daily routine or travels.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["suitcase", "handbag", "train", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 42889, "caption": "The image features a teddy bear that has been positioned in front of a desk. The teddy bear is wearing headphones and holding a remote control. Near the teddy bear, there is a cell phone positioned to the left and a computer keyboard with a mouse close by, on the right side. There is another computer keyboard near the teddy bear, situated slightly above the first keyboard.", "mscoco_hallucinated_words": [["desk", "dining table"], ["remote", "remote"], ["cell phone", "cell phone"], ["computer", "laptop"], ["mouse", "mouse"], ["computer", "laptop"]], "mscoco_gt_words": ["keyboard", "teddy bear", "bear"], "hallucination_idxs": [14, 25, 36, 44, 48, 60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.46153846153846156, "Recall": 0.6666666666666666}}, {"image_id": 22112, "caption": " According to the picture, it features multiple items, including a pair of jars with fresh grass in them, sitting on a wooden surface. The jars are placed on both the left and right sides of the image. There are also some other items situated around them, possibly for display purposes or as part of a collection. Some of these items are spread across the image, including one closer to the top left corner, another near the bottom center, and a third on the right side. Overall, it appears to be an interesting display of items.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "potted plant", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 247587, "caption": "According to the picture, it appears to be a scene of a man standing on a surfboard in a sandy beach. The sand is distributed evenly across the entire scene, suggesting the beach may be a popular spot for water sports.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 171962, "caption": "According to the image, a large green bus and a small bus are parked on a street. The green bus is closer to the camera and appears to be the main bus in the scene. The smaller bus is parked behind it, partially hidden from view. There are several bicycles visible in the scene, with one bike parked underneath the smaller bus and another bike parked to the right of the green bus.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 545056, "caption": "The image showcases a large field filled with numerous cows grazing. The cows can be seen scattered throughout the grassy field, with some of them lying down, some standing, and others in various positions in between. The cows appear to be of different sizes, ranging from larger black cows to smaller ones. The green field provides an ideal environment for the cows to roam around and graze.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "sheep", "cow", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 401411, "caption": " There is a man standing in a kitchen, possibly a chef, surrounded by various food items and bottles of liquid. The man is wearing a chef's apron and appears to be the main focus of the scene. He is holding a knife and standing near the food he is preparing.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["knife", "knife"]], "mscoco_gt_words": ["cup", "oven", "dining table", "bowl", "book", "carrot", "person"], "hallucination_idxs": [19, 46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.14285714285714285}}, {"image_id": 2867, "caption": " According to the picture, a group of people, including a man and a woman, are standing together on a snowy slope, preparing to ski. They are holding their skis and discussing something. There are two other individuals who appear to be skiing in the distance, one close to the left side of the image and the other closer to the middle of the slope.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 246963, "caption": " The image features a group of motorcyclists riding down the street, enjoying their leisure time together. There are a few motorcycles in the scene, some of them with riders, and others without. Among the motorcyclists, there is a rider wearing a stop sign, indicating a fun ride with a stop sign in the middle of the group. The riders are gathered near the center of the image, creating a lively scene on the street.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "stop sign", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 498225, "caption": " A group of people, including a man and two women, are standing outside a cake shop, looking at the display cases. They are likely admiring the cakes, possibly discussing their choice. A few other people are visible in the scene, but they are not as close to the cake shop as the main group. One person is wearing a blue top, and another person has a handbag with them.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "cake", "cell phone", "donut", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 124601, "caption": "The image features a kitchen filled with appliances such as a microwave, a laptop with a screen turned off, a sink, and several cups. The kitchen has a single cup on the countertop and multiple cups on a shelf. A laptop is also present on a table, with the keyboard and mouse positioned next to it. There are no people in the scene, but various food items can be seen in the kitchen, including several bottles with a variety of beverages inside, as well as a bowl placed at the countertop.", "mscoco_hallucinated_words": [["mouse", "mouse"], ["person", "person"], ["bowl", "bowl"]], "mscoco_gt_words": ["keyboard", "cup", "dining table", "laptop", "bottle", "wine glass", "sink", "microwave"], "hallucination_idxs": [57, 66, 96], "metrics": {"CHAIRs": 1, "CHAIRi": 0.23076923076923078, "Recall": 0.875}}, {"image_id": 581749, "caption": "The image shows a table filled with various types of fruits and vegetables, including broccoli, oranges, and apples. The broccoli is spread across the table, covering a significant portion of the scene. Some oranges are scattered around the table, placed close to each other in different locations. Additionally, a few apples are visible, positioned alongside the oranges. ", "mscoco_hallucinated_words": [["apple", "apple"], ["apple", "apple"]], "mscoco_gt_words": ["broccoli", "orange", "dining table"], "hallucination_idxs": [20, 59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 227960, "caption": "The image features a dining table with various food items on it, including a dish filled with potatoes and a bowl of salad on the table. The dish is positioned at the center of the table, while the salad bowl is located near the left side of the table. In addition to the main dishes, there are multiple carrots, with some placed near the edges and others closer to the center of the table. There are also some cups placed around the table, likely for drinking or serving food. A spoon is situated in the salad bowl, possibly for serving or mixing the contents.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"], ["table", "dining table"], ["table", "dining table"], ["table", "dining table"], ["table", "dining table"]], "mscoco_gt_words": ["cup", "bowl", "spoon", "wine glass", "carrot"], "hallucination_idxs": [5, 26, 37, 51, 79, 89], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.8}}, {"image_id": 286503, "caption": "According to the picture, an adult elephant is standing in a large enclosure, surrounded by multiple people. The elephant appears to be taking interest in the people and is looking towards them. There are various handbags and purses being carried by the visitors in the enclosure, including some placed close to the elephant and others scattered throughout the area.", "mscoco_hallucinated_words": [["handbag", "handbag"], ["purse", "handbag"]], "mscoco_gt_words": ["person", "sports ball", "elephant"], "hallucination_idxs": [39, 41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.6666666666666666}}, {"image_id": 315740, "caption": "According to the picture, a large, modern kitchen is being showcased, complete with stainless steel appliances. The kitchen is well-equipped with a sink, a refrigerator, an oven, and a microwave. The kitchen is spacious and has a dining area, which is likely to be used for entertaining and meals. ", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"]], "mscoco_gt_words": ["oven", "tv", "bowl", "potted plant", "sink", "microwave", "chair", "couch"], "hallucination_idxs": [29], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.375}}, {"image_id": 142108, "caption": "In the image, a group of people, consisting of five individuals, is gathered around a dining table playing a game that involves blindfolded participants. The game is centered around trying to eat blindfolded, and bananas are being used as the food items in the game. Multiple bananas can be seen on the table, scattered around and placed in different locations.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 542933, "caption": "The image features a baseball game with a player up to bat, ready to swing at a ball. There are several people in the scene, some of whom are sitting on the benches, and others standing around the field. The crowd of people observing the player adds to the excitement and energy of the game. A baseball glove can also be seen, indicating that there are multiple players actively participating in the game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "chair", "baseball bat", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 475043, "caption": "In the picture, there is a woman sitting at a laptop, drinking while working on the device. Surrounding her are other people in the room, with one man in the right corner and another woman in the top center of the image. Both individuals are also using laptops and appear to be engrossed in their work.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "spoon", "laptop", "bottle", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 456784, "caption": "According to the picture, a man is cutting a cake with a knife. The cake is white and covered with chocolate icing. The man is holding the knife with both hands, as he is in the process of slicing the cake.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 33753, "caption": "According to the picture, there is a remote control sitting on a wooden bench in a park, possibly waiting to be picked up. The remote control is placed close to the middle of the bench, with one part of the remote visible. The rest of the park surrounding the bench is filled with various other wooden benches, possibly forming part of a larger seating area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "chair", "bench", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 448694, "caption": "In the image, a group of young men is playing a game of frisbee in a park. They are actively engaged in the game, with one man in mid-air the action, reaching for the frisbee. The other players are scattered around the field, ready to catch the frisbee or assist their teammate in getting it. There is a crowd of people watching the game, some of them closely observing the action while others are further away from the field.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["frisbee", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 99053, "caption": "The image features a bowl filled with a variety of food, including pasta and vegetables. The pasta and vegetables can be seen in the middle of the bowl, surrounded by meat, including beef, and multiple pieces of broccoli. A fork can be spotted in the bowl, likely being used to eat the food inside. The dining table or other surface on which the bowl is placed seems to have a tablecloth or another covering on top.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "fork", "broccoli", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 503210, "caption": "The image features a spacious living room with a blue couch, a blue chair, and a piano. The room is decorated with various paintings the wall and a potted plant. There are also two tables in the living room, one on the left side and another on the right side. The living room is well-furnished and inviting, creating a warm and inviting atmosphere.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["clock", "potted plant", "vase", "orange", "book", "chair", "couch"], "hallucination_idxs": [38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.42857142857142855}}, {"image_id": 142257, "caption": "The scene captures a busy intersection with various traffic lights and people walking around. One car is positioned in the foreground, partially obscured by the other vehicles and traffic lights in the scene. A blue car is parked to the side, with a few other cars around it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}], "overall_metrics": {"CHAIRs": 0.352, "CHAIRi": 0.10688920454545454, "Recall": 0.5517084282460136, "num_hallucinated_caps": 176.0, "num_caps": 500.0, "hallucinated_word_count": 301.0, "coco_word_count": 2816.0, "length_response": 71.182, "hallucinated_caps_ls": [144305, 324008, 455610, 401027, 253630, 326021, 317320, 267537, 195645, 32625, 343534, 254368, 231534, 227031, 523292, 13550, 146837, 209925, 2154, 274931, 300784, 409346, 275929, 490491, 133146, 498747, 185327, 348654, 482432, 144723, 379842, 556478, 451943, 495513, 529256, 37907, 278365, 69757, 432213, 368367, 201555, 92131, 333712, 148977, 399058, 437910, 467887, 295656, 557998, 401010, 313928, 346940, 388237, 289640, 122602, 24104, 285302, 61959, 229659, 391222, 416104, 5934, 258209, 347453, 285820, 524601, 253262, 87052, 418692, 259345, 120776, 276719, 219546, 228350, 32190, 449798, 375769, 180764, 177015, 333998, 343218, 301837, 240681, 178753, 47781, 293703, 211604, 563938, 47940, 526414, 505213, 441363, 282553, 188136, 449675, 191691, 557447, 159979, 561967, 134223, 3742, 217593, 475813, 558784, 308543, 359164, 539951, 564629, 53431, 441840, 381031, 539555, 423984, 499486, 118124, 288729, 252625, 493610, 302789, 30667, 353898, 88269, 98979, 353320, 256628, 416384, 531327, 542771, 277073, 436814, 275611, 110449, 45550, 205193, 272566, 19763, 124236, 344930, 82488, 381615, 111109, 425569, 319607, 72656, 372087, 438535, 490711, 277051, 145666, 304984, 122467, 352900, 18150, 491000, 401163, 39663, 268293, 474499, 105782, 138834, 127659, 305000, 44136, 154846, 567254, 301402, 543231, 495357, 42889, 401411, 124601, 581749, 227960, 286503, 315740, 503210]}}}