{"overall_metrics": {"CHAIRs": 0.3616161616161616, "CHAIRi": 0.11395912712157949, "Recall": 0.5455381784728611, "num_hallucinated_caps": 179.0, "num_caps": 495.0, "hallucinated_word_count": 329.0, "coco_word_count": 2887.0, "length_response": 71.13535353535353, "hallucinated_caps_ls": [144305, 308003, 54870, 324008, 577912, 234433, 39053, 493862, 369547, 401027, 253630, 326021, 25812, 195645, 155671, 32625, 343534, 398753, 254368, 209347, 523292, 12375, 146837, 209925, 2154, 274931, 104326, 233888, 183766, 169169, 344942, 275929, 490491, 175878, 53744, 185327, 482432, 129707, 556478, 451943, 37907, 506401, 406611, 278365, 432213, 92131, 21779, 235595, 467887, 295656, 401010, 313928, 490284, 388237, 122602, 59710, 411564, 285302, 265628, 5934, 131138, 258209, 347453, 285820, 245664, 259345, 422328, 354559, 468005, 545594, 155131, 437720, 378831, 375769, 20650, 177015, 295105, 443259, 537861, 249969, 301837, 240681, 178753, 47781, 542792, 293703, 250540, 298067, 563938, 526414, 331084, 282553, 328818, 188136, 191691, 557447, 151081, 134223, 198163, 217593, 477016, 558784, 359164, 230983, 564629, 53431, 441840, 381031, 530520, 9628, 163518, 118124, 288729, 177721, 372220, 302789, 30667, 328110, 463678, 70815, 88269, 98979, 379443, 374061, 560018, 353320, 483227, 531327, 277073, 436814, 110449, 45550, 303099, 205193, 170099, 14940, 354540, 191301, 124236, 344930, 14285, 381615, 352703, 111109, 49005, 317254, 425569, 45435, 35827, 72656, 372087, 283118, 490711, 163728, 281028, 277051, 304984, 145448, 352900, 227438, 116037, 92107, 474499, 105782, 492057, 444304, 305000, 567254, 301402, 543231, 184945, 49942, 42889, 401411, 124601, 227960, 33753, 99053, 503210]}, "results": {"sentences": [{"image_id": 144305, "caption": "There are a few computer devices displayed in the image. A laptop, microphone, and keyboard are placed on top of a laptop with a television set nearby. A cellphone is also visible, placed closer to the bottom of the scene. The various computer components are likely to be connected or in use together.", "mscoco_hallucinated_words": [["television", "tv"], ["cellphone", "cell phone"]], "mscoco_gt_words": ["keyboard", "laptop", "dining table"], "hallucination_idxs": [27, 32], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.6666666666666666}}, {"image_id": 164091, "caption": "In a scene involving two dolls and a baby, one doll is seated in a baby carriage, while the other dolls are positioned on a bed. The baby carriage has a smaller bed inside, indicating it's intended for a baby. The baby carriage is positioned behind the right doll, and the other two dolls are closer to the left side of the bed. The scene gives a feeling of a child's room.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "bed", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 308003, "caption": "The image captures a large wooden park bench with weathered paint peeling off, situated on the sidewalk near a dirt field. The park bench takes up the majority of the scene and is situated close to the foreground. The background features dirt and dry grass, with various pine trees scattered throughout the area. The wooden bench appears to be in disrepair, with peeling paint and a slightly distressed look. The park bench serves as a seating area for people and a place to sit and enjoy the outdoors.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["chair", "bench", "dining table"], "hallucination_idxs": [85], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.3333333333333333}}, {"image_id": 54870, "caption": "According to the image, the scene is a plate with a sandwich and salad. The sandwich, likely a hamburger or cheeseburger, is on the right side of the plate, and the salad is placed on the left side. There is a knife nearby, possibly used to cut the sandwich. The plate is placed on a table.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["sandwich", "fork", "knife"], "hallucination_idxs": [63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6666666666666666}}, {"image_id": 95908, "caption": "The image features a mouse pad, keyboard, and two computer mice on a desk. The mouse pads have a silver coffee cup on them, suggesting that the office setting might also be used for personal enjoyment. There are two computer mice, one close to the mousepad and the other towards the bottom of the scene. A keyboard is situated on the left side, further away from the mousepad and coffee cup.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "dining table", "laptop", "mouse"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 329614, "caption": "There is a large street in the image with a yellow church-style building on the left side and a car parked in front of the building. On the right side, there are several yellow church-style buildings as well as a yellow school bus, with one bus closer to the front of the street and the other further away. There are a few more cars, positioned throughout the scene. The clouds in the sky suggest that the setting is possibly outdoors and the weather may be gloomy or stormy.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 324008, "caption": "The image displays a tray filled with a variety of food items, including bananas and yogurt. The bananas are spread throughout the tray, with some closer to the edge and others towards the middle. There are also a couple of bottles, possibly containing yogurt or other beverages, placed on the tray. The food items seem to be ready for a meal, or a snack, and are arranged in an organized way on the tray.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["cup", "dining table", "train", "orange", "airplane", "person", "banana"], "hallucination_idxs": [44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.14285714285714285}}, {"image_id": 52700, "caption": " In the image, there is a long dining table filled with various types of cakes and cupcakes arranged in lines and placed neatly on the table. There are multiple plates covering the tabletop, each holding a different cake or cupcake. Some plates are closer to the edges of the table, while others are more centrally located. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "dining table", "spoon", "cake", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 501311, "caption": "The image captures a baseball game happening on a field. A man in a baseball uniform is on the pitcher's mound, holding a baseball as he is pitching. A few other players are also visible on the field, positioned close to the mound, perhaps as a part of the infield.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 247179, "caption": " There are two elephants in the scene, with people standing and sitting on them as well as in between. The people are wearing different types of clothing, including a few with white attire and a few with black attire. There is also a person standing nearby, perhaps watching the parade or waiting for the event to begin.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "clock", "person", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 577912, "caption": "According to the image, there is a small, beige-colored bathroom with a tub sitting in the center. A sink is placed next to the tub, and there are a toilet and a bowl close by. The toilet is positioned to the right of the sink, and the bowl is located underneath it. Several glass bottles can be seen in various places around the bathroom, such as near the sink, under the bowl, and on a shelf towards the right side. A chair can also be spotted in the left side of the image, possibly for someone to sit during their bathroom experience.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["bowl", "bowl"], ["bowl", "bowl"], ["chair", "chair"]], "mscoco_gt_words": ["sink", "toothbrush", "toilet", "bottle"], "hallucination_idxs": [36, 53, 80, 92], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.75}}, {"image_id": 196341, "caption": "In the image, there is a large clock located inside the building, with people walking around the clock near its location. The clock has round black numbers with white highlights, and appears to be the focal point of the scene. A group of people are walking in a clockwise direction, likely enjoying the building's architecture and taking advantage of the location.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 234433, "caption": "The scene features an old street sign left right turn with an arrow pointing up and only painted in black. There are several vehicles in various states of movement, including a car parked on the right side of the image, a truck further back and another car parked near the center. There is also a police car in the scene, positioned towards the middle of the image.", "mscoco_hallucinated_words": [["truck", "truck"]], "mscoco_gt_words": ["traffic light", "car", "person", "bicycle"], "hallucination_idxs": [44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.25}}, {"image_id": 455610, "caption": "The image showcases a red motorcycle parked outside a building, possibly in a parking lot or a parking spot. There are two people standing near the motorcycle; one person is closer to the motorcycle and another is a bit further away. Additionally, a car is parked next to the red motorcycle.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "motorcycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 39053, "caption": " There is a table in the image with several pieces of food on it. Among the food items are a couple of pita breads placed in the center of the table, a bowl filled with sauce near one side, and a few cups of beverages, including a wine glass and a bowl, scattered around the table. There are various utensils, such as a spoon and a fork placed near the food, as well as a knife on the right side of the table.", "mscoco_hallucinated_words": [["wine glass", "wine glass"], ["fork", "fork"], ["knife", "knife"]], "mscoco_gt_words": ["cup", "dining table", "bowl", "spoon", "cake", "cell phone", "apple"], "hallucination_idxs": [51, 73, 83], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2727272727272727, "Recall": 0.5714285714285714}}, {"image_id": 493862, "caption": " The image features a man in a grassy field standing with a tennis racket in his hand. He is holding the racket while standing in a position that suggests he is waiting for an opportunity to hit a ball. The field is vast and lush, providing a scenic setting for the man to enjoy his leisure time.", "mscoco_hallucinated_words": [["ball", "sports ball"]], "mscoco_gt_words": ["person", "tennis racket", "dog"], "hallucination_idxs": [39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 269873, "caption": " The image captures a busy sidewalk scene where a large group of people are standing and watching a trolley go by. They are on the sidewalk, near a street intersection, possibly waiting for the streetcar to pass by. The trolley is also prominent in the image, in the middle of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "person", "bus", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 524656, "caption": "The image features a small, well-organized kitchen with wooden cabinets. There are ovens and microwaves present in the kitchen, located at the top part of the picture. A sink is also present, located near the bottom of the picture.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toaster", "cup", "knife", "oven", "refrigerator", "bowl", "spoon", "bottle", "wine glass", "sink", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2727272727272727}}, {"image_id": 369547, "caption": " The image features a beach scene, with a dog and a bird sharing the same space. The dog is located mostly in the middle of the beach, while the bird can be seen closer to the right side of the scene. There is another bird towards the left, closer to the water. The beach is filled with pebbles and rocks. In the background, a person can be seen standing near the right side, partially obscured. Several bottles and a bottle are also visible near the person, but the contents are not indicated.", "mscoco_hallucinated_words": [["person", "person"], ["bottle", "bottle"], ["bottle", "bottle"], ["person", "person"]], "mscoco_gt_words": ["bird", "dog", "cat"], "hallucination_idxs": [72, 86, 89, 95], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4444444444444444, "Recall": 0.6666666666666666}}, {"image_id": 181446, "caption": " The image depicts a man reaching into an open refrigerator while standing inside a kitchen. The man is leaning in from the left side of the scene, peering into the refrigerator to get something. The kitchen itself is quite large, featuring multiple people in the room. Several people can be spotted around the refrigerator, some standing close to it and others situated in different parts of the room. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "bowl", "bottle", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 128119, "caption": "A large, white city bus is making its way down a street in front of an intersection, with a parking lane on the right side of the bus. There are two bicycles placed prominently on the front of the bus, one near the left side of the bus and the other closer to the center.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bus", "handbag", "cell phone", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 401027, "caption": "The image features a plate on a table with a sandwich cut in half and a bowl of sauce in front of it. The sandwich appears to be made with bread, cheese, and tomato slices, which makes it a typical sandwich. There are also a few carrots in the scene, one close to the left side of the plate and another one closer to the middle. The sandwich is placed in the middle of the plate, surrounded by various condiments and vegetables. The atmosphere seems to be of a relaxed dining setting.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["carrot", "carrot"]], "mscoco_gt_words": ["cup", "knife", "dining table", "sandwich", "fork"], "hallucination_idxs": [16, 51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.4}}, {"image_id": 253630, "caption": "The image showcases a living room with a coffee table located to the left, next to the wall. The coffee table is rectangular in shape and has a glass top. A flat screen television is positioned near the center of the room and is turned on. There are two sofas in the room, one closer to the television and the other further towards the left side of the image.", "mscoco_hallucinated_words": [["sofa", "couch"]], "mscoco_gt_words": ["potted plant", "chair", "tv", "dining table"], "hallucination_idxs": [53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 326021, "caption": "According to the picture, a woman is sitting at a dining table with a teddy bear. There is a dining table placed right in the middle of the scene, with two chairs to its left and two more on the right. The woman seems to be sitting at the table while the teddy bear is on a table in front of her. There is another teddy bear visible in the background, closer to the left edge of the image.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["teddy bear", "person", "dining table"], "hallucination_idxs": [34], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 1.0}}, {"image_id": 317320, "caption": "There are a group of people riding bicycles in the middle of a street. The bicycles are going in different directions, with one going right, another going left, and a third one going further left. A man is on the right side of the street riding a bicycle, while a woman is also on the right side of the street, riding another bicycle. A man is on the left side of the street, also riding a bicycle. A couple of bicycles are in the background, and a person is standing nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 507171, "caption": " The image captures a surfer riding a wave on a surfboard, likely enjoying the sport in a body of water. The surfer is located on the right side of the scene, with the water forming a background, as they are enjoying the waves.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "person", "cat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 161227, "caption": "In the image, there is a black leather chair occupied by a yellow and white pillow with a decorative pattern, and a laptop sitting on top of the pillow. The laptop and remote control are placed close to the middle of the chair, which has the cushion on it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "keyboard", "laptop", "cell phone", "chair", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 25812, "caption": "The image captures a person in ski gear, snowboarding down a hill. They appear to be wearing a white hat and a jacket, and are likely enjoying the snowy landscape. A snowboard, possibly belonging to the snowboarder, is visible towards the left side of the image. The hill is near a ski slope and is surrounded by trees. There are at least three other snowboards positioned throughout the area, two of which are relatively close to the snowboarder. A set of chairs can be seen in the background, possibly at the ski resort.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [91], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 1.0}}, {"image_id": 267537, "caption": " The image shows a group of women sitting together on a bench, possibly at a park or outside, while holding umbrellas to protect themselves from the weather. The women appear to be dressed in black and are seated on the bench, holding their umbrellas.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "umbrella", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 195645, "caption": " A gathering of people, including a baby, is standing around a table. They seem to be enjoying the gathering or event as some are smiling and others are engaged in conversation. Around the table, various food and drink items are located, such as a wine glass and cups. Several people are gathered in a circle, with some sitting while others are standing. A few cups are dispersed throughout the table, as well as a bottle of water in one of the cups. A chair can also be seen, possibly for someone seated or standing by the table. In the background, two more chairs can be seen.", "mscoco_hallucinated_words": [["cup", "cup"], ["cup", "cup"], ["cup", "cup"], ["chair", "chair"], ["chair", "chair"]], "mscoco_gt_words": ["handbag", "bench", "dining table", "bottle", "wine glass", "person"], "hallucination_idxs": [53, 73, 91, 94, 116], "metrics": {"CHAIRs": 1, "CHAIRi": 0.35714285714285715, "Recall": 0.6666666666666666}}, {"image_id": 155671, "caption": "According to the picture, a small car is traveling down a city street with several traffic lights. A second car is also present on the street, sitting at a red traffic light further down the road. The street is mostly empty of other vehicles, and there are a few people walking nearby. A few trees can be seen in the background, providing some greenery to the scene.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["traffic light", "car", "truck"], "hallucination_idxs": [54], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 412894, "caption": " The scene takes place in front of a building with several people standing and walking near it, forming a large group. Several pedestrians can be seen dispersed throughout the area, with some closer to the camera and others on the outskirts of the group. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "person", "handbag", "backpack"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 32625, "caption": "The image features a table filled with various food items, including two main dishes, cereal, soup, and two cups of beverage. On the table, you can see a white plate with two pieces of food, one containing a fork, and a bowl filled with soup. There is a pair of spoons, one located near the bowl and another closer to the cup on the right side of the table.", "mscoco_hallucinated_words": [["spoon", "spoon"]], "mscoco_gt_words": ["cup", "dining table", "bowl", "fork", "book"], "hallucination_idxs": [59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.8}}, {"image_id": 18688, "caption": "The image captures a baseball game in progress, with a focus on the players on the field. There are several people in the scene, some of whom are actively participating in the game while others are watching or standing in the background. The players appear to be scattered across various areas of the field, showcasing their baseball gear.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 343534, "caption": "According to the image, there is a small girl doll, dressed in a pink outfit, standing next to two stuffed bears, one brown and one black. The doll is holding a pink hat. There is also another stuffed bear, a brown one, on the left side of the image, slightly behind the main doll and the pink hat. The image provides an impression of the playroom setting, with several dolls and bears on display.", "mscoco_hallucinated_words": [["girl", "person"], ["bear", "bear"], ["bear", "bear"], ["bear", "bear"]], "mscoco_gt_words": ["teddy bear", "potted plant", "dog", "book", "chair"], "hallucination_idxs": [9, 23, 44, 82], "metrics": {"CHAIRs": 1, "CHAIRi": 1.0, "Recall": 0.0}}, {"image_id": 370657, "caption": "In the image, there is a group of people riding on the back of motorcycles, with a total of six people visible. One person is on the back of a motorcycle with a child and appears to be driving, while two people are behind them on the same motorcycle, seemingly riding as passengers. Two other individuals are riding motorcycles on their own, with one person in front and another behind. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 481596, "caption": "In the image, a man wearing a baseball cap is standing at a sports stadium, surrounded by numerous people in various spots throughout the crowd. The man is holding a couple of hot dogs in one hand, with one hand in his pocket. The scene shows a busy environment, with numerous people in the crowd, creating an energetic atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["hot dog", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 460347, "caption": "The image depicts a long street with a city bus in the middle, traveling down two lanes. There are several cars around the bus, some of which are driving in the same direction as the bus, while others are traveling in a perpendicular direction.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 398753, "caption": "The scene is a busy baseball field where a baseball game is in progress. Several people are scattered across the field, with some standing closer to the batter and others farther away. A group of spectators is also watching the game, sitting on various benches surrounding the field. ", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [48], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 67833, "caption": "The image features a dinner table with a couple of pizzas on it. There is a bottle of red wine, an empty wine glass, and a bowl of olives on the table. Two people, one standing near the center of the table and the other near the left side, are also present in the scene. The table seems to be set for a casual meal with friends or family, possibly enjoying homemade pizzas and sharing some appetizers.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "bowl", "bottle", "fork", "wine glass", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 272880, "caption": " The image features a plate filled with a delicious slice of pizza placed on the side, awaiting someone to enjoy it. There is a sign on top of the pizza, with the word \"mystic\" written on it. The plate sits on a table, and there are a few forks surrounding the food. Some utensils, including a knife and a fork, can be seen on the table as well.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 254368, "caption": "According to the picture, there is a man sitting in a bed eating a piece of food. There are two bowls placed near the bed, one located on the left side and the other one on the right. The man is holding a plate in his hands. The bowls are placed closer to the left edge of the image, while the plate is held closer to the center. There are also three drinking glasses placed at different positions in the image, one on the right side, one on the left side, and another closer to the middle.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["bowl", "bowl"]], "mscoco_gt_words": ["pizza", "bed", "person"], "hallucination_idxs": [22, 53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 193108, "caption": "According to the picture, there are two baseball players in the scene, both wearing baseball helmets, and they are preparing to play a game. One of the players, on the right side of the image, is holding a baseball bat and getting ready to swing at an oncoming ball, while the other player, on the left side, is in a defensive position. A large glove can be observed near the baseball player on the left side, likely used for catching.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 459141, "caption": "According to the image, a person wearing a straw hat is riding a bicycle. The person is holding a bag or basket on the front of the bicycle. There are also two other bicycles in the scene, one of them is slightly further from the camera. The overall scene is set in a residential or urban area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "potted plant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 231534, "caption": "There are several trucks lined up in front of an airplane on an airport runway, including a truck in the foreground and three more trucks further back. Some of the trucks appear to be moving and unloading while others are stationary. The large airplane, with one propeller visible, is parked near the trucks, suggesting a loading or unloading process. The airplane's landing gear is also clearly visible.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "suitcase", "handbag", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 227031, "caption": "In the image, a man and a woman are standing in front of a white cake, presumably being cut into for a party or celebration. They appear to be getting close to each other, likely to get a better view or position for the event. Another person is visible in the background, possibly waiting in line to get close to the cake.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "tie", "potted plant", "cake", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 209347, "caption": " The image depicts a woman sitting on a park bench, surrounded by potted plants. She is holding a cell phone in her left hand, looking down at it. The woman is wearing a jacket, likely a black leather one. ", "mscoco_hallucinated_words": [["bench", "bench"], ["potted plant", "potted plant"]], "mscoco_gt_words": ["handbag", "tie", "train", "cell phone", "person"], "hallucination_idxs": [9, 13], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.4}}, {"image_id": 523292, "caption": " The scene features a cat sitting on top of a table, positioned in the middle, close to a laptop that is open on a table. The cat appears to be interested in the laptop, looking at it from a close distance. The open laptop and the table are situated in the center of the image, with various objects, such as books and chairs, surrounding them. Some of the objects are positioned close to the cat and the laptop, while others are more spread out around the table. The couch is located on the left side of the image, partially visible at the edge.", "mscoco_hallucinated_words": [["book", "book"], ["couch", "couch"]], "mscoco_gt_words": ["tv", "dining table", "laptop", "cat", "chair"], "hallucination_idxs": [67, 99], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.8}}, {"image_id": 12375, "caption": " The image features a plate filled with a variety of foods, including meat, potatoes, carrots, and other vegetables. The meat appears to be either chicken or turkey, while the potatoes are either mashed or boiled. The carrots and other vegetables seem to be part of the side dishes or a complement to the meal. ", "mscoco_hallucinated_words": [["turkey", "bird"]], "mscoco_gt_words": ["carrot", "broccoli", "dining table"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 13550, "caption": " The image depicts a woman standing on a rock, near a body of water, and feeding an elephant. The elephant is the focal point of the scene, with several others present in the background. The woman appears to be engaging with the animals and interacting with them. There are a few other people in the scene, standing at different positions or walking.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "handbag", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 146837, "caption": "In the image, a young man is shown skateboarding, likely practicing or perfecting a trick. He is performing a trick on a rail, with the skateboard close to the top edge, while also balancing on it. The background includes numerous plants, which are spread throughout the scene, adding a green and natural touch to the image. There are several bushes present, as well as a few small trees. Additionally, a bench can be seen near the bottom left corner of the image, which may be part of a park or public area.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["motorcycle", "skateboard", "person"], "hallucination_idxs": [82], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 67042, "caption": "According to the picture, a man in a white shirt is standing in a building, possibly an archway. He is looking at his cell phone, checking it while standing in this location. Another man, wearing a tie, is in the same area, but further back and to the right. Both individuals are slightly leaning to the left, as they appear to be capturing information on their cell phones.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "person", "cell phone"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 209925, "caption": "According to the picture, a baseball player is getting ready to throw a baseball in the outfield. He is standing on a pitcher's mound, wearing an orange baseball uniform. The player is on the left side of the image, holding a baseball in his hand and getting into the pitching motion. He is focused on the game and poised to execute the pitch.", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 2154, "caption": "In the scene, a shopping cart is sitting outside a large parking area with many cars parked. The parking area is adjacent to the street, and there are several dining tables scattered in different places. In the middle of the scene, there are two chairs placed close together, and a bench is located near the cars parked in the parking lot.", "mscoco_hallucinated_words": [["table", "dining table"], ["chair", "chair"]], "mscoco_gt_words": ["car", "umbrella", "bench", "person"], "hallucination_idxs": [33, 49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.5}}, {"image_id": 274931, "caption": " The image features a black and white view of a side street, with multiple cars parked on it. The cars are parked in a row, and some of them appear to be older models. The street is a bit empty, with only a few people visible on it. The scene looks like it was taken during the day, as there are trees and sunlight visible in the background. There are some traffic signs, including a stop sign, and one of them appears to be placed closer to the left of the image. The sidewalk is also visible, with a small plant near the street.", "mscoco_hallucinated_words": [["person", "person"], ["stop sign", "stop sign"]], "mscoco_gt_words": ["traffic light", "fire hydrant", "car"], "hallucination_idxs": [49, 84], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 104326, "caption": "According to the image, there is a long green-colored kitchen located in the middle of a room. A wooden dining table stands near the kitchen area, with two long benches placed at the sides. A few chairs are also scattered around the kitchen and the dining table. Two clocks can be seen: one closer to the left side of the room and the other on the right side, near the table. Several bottles and cups are present throughout the kitchen, likely placed for serving food or drinks. The kitchen and dining room appear clean and well-maintained.", "mscoco_hallucinated_words": [["bench", "bench"], ["bottle", "bottle"], ["cup", "cup"]], "mscoco_gt_words": ["clock", "knife", "oven", "dining table", "potted plant", "sink", "microwave", "chair"], "hallucination_idxs": [32, 80, 82], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.375}}, {"image_id": 233888, "caption": " According to the picture, a little boy is playing baseball in a room while a baseball game is on television. The boy is holding a bat and the ball is in the room as well, just in front of the TV.", "mscoco_hallucinated_words": [["ball", "sports ball"]], "mscoco_gt_words": ["tv", "potted plant", "baseball bat", "person", "baseball glove"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.4}}, {"image_id": 213905, "caption": "The image features two women standing on a tennis court together, both holding tennis rackets in each hand, as they seem to be discussing something together. There are various people in the background, some seated and others standing in different parts of the court, likely engaged in different activities or watching the tennis players. The scene conveys a relaxed and casual atmosphere, as the women share their conversation and time together on the tennis court.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "tennis racket", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 277479, "caption": " The image features a toddler, a baby and an open refrigerator. The little girl is standing by the refrigerator door, while the baby is on the left side and closer to the fridge. The refrigerator door is also partially open, indicating a person reaching into it. The arrangement of the people around the refrigerator appears to be a family scene with multiple people interacting with the child. There is an adult in the background on the right side, and another person is in the center of the frame, closer to the toddler. Overall, the image captures a family moment in a kitchen with the baby and the toddler exploring the contents of an open refrigerator.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 300784, "caption": " The image portrays a group of young children sitting together in a sitting area or on a couch, surrounded by a bunch of stuffed animals, possibly a collection or a museum display. The children are sitting close to one another, looking at the animals and enjoying their time together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bench", "dog", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 183766, "caption": " The image features a dish with cherries and wine glasses on a dining table. The dish, possibly a dessert, is surrounded by small plates holding cake and macarons. There are two wine glasses next to the dish, one to the left and the other to the right, making a total of three glasses in the scene.", "mscoco_hallucinated_words": [["wine glass", "wine glass"], ["wine glass", "wine glass"]], "mscoco_gt_words": ["cake", "fork", "knife", "dining table"], "hallucination_idxs": [8, 35], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.5}}, {"image_id": 169169, "caption": " In the scene, there is a long, blue bench with pillows sitting near a building. A few people can be seen walking on the sidewalk in the vicinity, but there's no indication of the time of day or their purpose.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["traffic light", "person", "potted plant", "car"], "hallucination_idxs": [10], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 409346, "caption": " In the image, there are two women standing near a table with a variety of foods and desserts displayed. They are possibly sharing a meal or event together. The food seems to be arranged in different sections, with some pieces placed towards the top of the table and others closer to the bottom. The two women seem to be in the process of handing out the food to those around them.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "spoon", "cake", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 522262, "caption": "In the image, there are numerous bikes parked on the sidewalk next to a railing. They are parked in a row, with some bikes positioned more closely together and others further apart. The bikes appear to be locked to the railings and the railing is likely used to prevent them from falling over.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 79014, "caption": "The image captures a man's reflection in a glass window, with a yellow object placed in front of his face. The glass surface seems to be a door, as there is a door handle visible on the left side of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 344942, "caption": "A person is sitting at a computer with a laptop placed on a table. On the screen, a brown box is visible on the top part, possibly representing the profile picture of someone on a social networking site. There is another box located near the left side of the screen, also possibly representing the profile picture of someone else. This gives the appearance of a social networking page with multiple images present.", "mscoco_hallucinated_words": [["person", "person"], ["table", "dining table"]], "mscoco_gt_words": ["clock", "laptop", "tv"], "hallucination_idxs": [1, 13], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 275929, "caption": " The image features several white toilets sitting on the front steps of a building, which is covered with a staircase. The toilets are placed side by side on the steps, creating an unusual display. There are also some bottles and bowls placed near the stairs, adding to the clutter. The bottles are closer to the stairs, while the bowls are further away. Overall, the scene presents an image of various white toilets arranged on the steps and some additional items scattered around them.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["bottle", "bottle"]], "mscoco_gt_words": ["sink", "bowl", "toilet"], "hallucination_idxs": [42, 56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.6666666666666666}}, {"image_id": 465601, "caption": " According to the picture, a group of people is standing outside under umbrellas. Some of the people are holding umbrellas, while others are not. There are several umbrellas visible in the scene, with some people holding them and others positioned nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "umbrella", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 300124, "caption": "The image features a large chair with a broken frame and a small microwave sitting on top of it. The microwave is positioned on the edge of the broken couch, creating a unique scene that draws attention. The chair appears to be green and located outdoors, likely in a sidewalk or street setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["microwave", "couch", "chair", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 46919, "caption": "According to the image, there is a man snowshoeing down a snowy slope. He is surrounded by snow-covered mountains. Two more individuals are visible in the background, one to the left and one to the right, possibly skiing or hiking on the same mountain trail. The image is taken from a somewhat elevated perspective.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 434022, "caption": "A large train is traveling down a long train track, passing over a small river near the coast. The train is filled with an abundance of people onboard, with some gathered at the front of the train, and others spread throughout its length. The train seems to be carrying a significant number of passengers on this journey, with the majority positioned near the front of the train.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 490491, "caption": "According to the picture, there is a street with a brick wall in the middle of it. The wall has the word \"S,\" possibly a sign for a street, on it. There is a traffic light next to the wall, possibly indicating the end of a block. A few cars are parked around the area, including one in front of the brick wall and another closer to the traffic light. Various street signs, including stop signs and directional signs, can be seen in the scene. In addition to these signs, there is a cell phone on the ground close to the sidewalk.", "mscoco_hallucinated_words": [["traffic light", "traffic light"], ["traffic light", "traffic light"], ["cell phone", "cell phone"]], "mscoco_gt_words": ["stop sign", "car", "person"], "hallucination_idxs": [41, 78, 108], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6, "Recall": 0.6666666666666666}}, {"image_id": 175878, "caption": "According to the picture, a group of four people are standing near each other in a building. They are holding or holding a cell phone, and one of them appears to be taking a picture with their cell phone. The cell phone can be seen in both the left and right areas of the scene, and a bottle is placed near one of the individuals. There is a spacious atmosphere in the room, and everyone is enjoying their time together.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["backpack", "cell phone", "person"], "hallucination_idxs": [62], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 12669, "caption": "According to the image, a group of people is standing inside a building or room, with one of them holding a glass of wine. A young woman wearing white and pink is standing near the left side of the room, looking at the wine glass. Other people can be seen in the background, with one person in the right side of the room and another one standing in the center of the room. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["wine glass", "handbag", "vase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 192496, "caption": " In the image, two train cars can be seen, one behind the other, as they are traveling through a city. The front train car is more visible, with some of its detail on the right side of the image. The second train car is positioned further back, partially obscured by the first train car. There is a sense of movement and motion as the train is passing through the city.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 133146, "caption": "The image presents a room with a large collection of furniture in a well-organized manner. There are two couches, one towards the left side and another closer to the center, both having a distinct layout. A few chairs are positioned around the room, giving a comfortable seating arrangement for occupants.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tv", "vase", "book", "chair", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 53744, "caption": "The image features various food items on a cutting board, including several slices of tomato and lettuce. These vegetables appear to be cut into different pieces, creating a colorful and appetizing scene. A knife can be seen, placed near the vegetables, likely used for cutting the tomato and lettuce into different pieces. In the background, a refrigerator can be observed.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"]], "mscoco_gt_words": ["sandwich", "knife", "dining table"], "hallucination_idxs": [64], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 375902, "caption": "In this image, there is a person wearing skis who has just crossed the finish line, possibly completing a skiing event. He is holding his arms up triumphantly and stands out in the scene. The person is situated near the center of the image, surrounded by other individuals, who could be the competition or spectators. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 498747, "caption": " In the image, a man wearing a sweater is holding a cell phone up to his ear, while the people in the background are focused on something else. There are several people in the scene, with some standing closer to the foreground and others in the background. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "bird", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 52435, "caption": "A large group of people is at the beach, with several people surfing in the water. There are many people on the beach, ranging from a few in the background to a handful closer to the water's edge.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "person", "umbrella", "bird"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 301977, "caption": " The image features a cow walking across a road near a store, likely a market or shop, in the middle of a busy street. There are a few bottles located around the cow, some closer to it and others farther away, including ones that resemble sodas.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cow", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 261097, "caption": " The scene takes place on a tennis court with a group of people, including a man holding a tennis racket, walking or standing around it. Several other individuals are scattered throughout the court, some closer to the man, and others on opposite sides of the court. The players appear to be moving around, possibly waiting for a match to begin or simply taking a break during a game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "tennis racket", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 227204, "caption": " In the image, three women of Asian descent are sitting around a dining table, enjoying a meal together. They appear to be having breakfast, and there are various food items on the table. There are multiple bottles, likely containing liquids like juice or tea, positioned around the dining area. Some of the food items include bananas, likely part of a breakfast spread, along with other fruits and vegetables.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "bowl", "spoon", "bottle", "cell phone", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4444444444444444}}, {"image_id": 353981, "caption": "There is an image of an airplane on a runway in the image. The airplane appears to be a large plane with multiple air tanks attached to it, likely used to transport and store various substances such as fuel, oil, or weapons. The runway is likely an airport, and the image is likely a black and white photo, taken during a different era.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 185327, "caption": "In the image, there is a car driving down a city street, surrounded by a herd of walking elephants. The elephants are in a line, with some walking closer to the car and others farther away. There are also several other people walking on the street, but they are further away from the car and the elephants. The scene appears to be happening in front of a building, likely a restaurant or business of some sort.", "mscoco_hallucinated_words": [["elephant", "elephant"], ["elephant", "elephant"], ["elephant", "elephant"]], "mscoco_gt_words": ["horse", "bus", "cow", "car", "person"], "hallucination_idxs": [20, 23, 62], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 0.4}}, {"image_id": 348654, "caption": "In the image, there is a cell phone sitting in a holder on a wooden desk. It appears to be black and placed in the center of the desk. Near the cell phone, there are two monitors - one placed on the left and the other on the right side of the desk. The monitors are positioned somewhat close to the cell phone, with one monitor a bit further towards the back, and the other closer to the front of the desk. Overall, the scene features the combination of a cell phone and monitors in a home or office setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "laptop", "tv", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 482432, "caption": "The image is a close-up view of a bathroom sink. It features a bottle of skincare product on the left side, a toothbrush sitting next to it, and a tube of dental toothpaste nearby. The toothbrush is situated close to the toothpaste tube, with some space between them. There is also a cellphone, possibly for use while in the bathroom, placed further away from the sink.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cellphone", "cell phone"]], "mscoco_gt_words": ["toothbrush", "sink", "bowl"], "hallucination_idxs": [14, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 454161, "caption": " The image features a piece of pizza placed on a plate, which is the main focus of the image. It appears to be a classic Italian-style pizza with multiple slices of meat inside. The pizza is accompanied by a wine glass on the side, suggesting that the meal may be accompanied by a drink. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["wine glass", "pizza", "fork", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 144723, "caption": "The image depicts a dog leaping into the air with a frisbee in its mouth as it tries to catch the disc. The dog is mid-jump, and its head and front paws are close to the frisbee, while its back legs are further away. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "frisbee", "car", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 129707, "caption": "A person is holding a cell phone in their hand, showing the front side of the phone. The cell phone has multiple buttons, including buttons on the front and back of the device. There are also two chairs visible in the background, indicating that the person might be in a living room or a meeting space.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["book", "cell phone", "person"], "hallucination_idxs": [41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 379842, "caption": "A picture of a book on top of a table is displaying two video games, a Wii game and a Nintendo 3DS game titled \"super mario bros. wii\". The Wii game is positioned slightly above the 3DS game and both are side by side. They appear to be in great condition with minimal wear.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "remote", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 203110, "caption": "The scene takes place on a street with several people walking around and a train car traveling down the tracks. The train car is orange and appears to be a retro streetcar, as it runs alongside a modern street and the surrounding buildings. There are a couple of people walking in close proximity to each other, and several more individuals are scattered throughout the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bus", "train", "orange", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 556478, "caption": "The image features two vegetarian pizzas placed side by side on a stove top. They are cooked on pans, with one pan located further right and the other one on the left. The pizzas are covered with toppings such as cheese, peas, broccoli, onions, and a vinaigrette. The image captures the process of preparing these pizzas, showcasing the kitchen as a place for cooking and creating delicious meals.", "mscoco_hallucinated_words": [["broccoli", "broccoli"]], "mscoco_gt_words": ["pizza", "cup", "oven", "bottle"], "hallucination_idxs": [47], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 103815, "caption": "The image features a baseball game in progress. There are multiple players on the field, and two baseball players are visible, one pitching and the other swinging at the ball. The pitcher is preparing to pitch the ball, holding a baseball in his hand, while the batter is swinging a baseball bat with the intention of hitting the ball. The players in the scene are scattered throughout the field, and a pile of baseballs can be seen in the area, indicating that the game has been going on for some time.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 451943, "caption": "According to the picture, the scene is a bedroom with a double bed. The bed is neatly made with a comfortable bedspread and a pillow underneath it. There are two lamps to the left and right of the bed. Near the bed, a large area has been sectioned off with a bench and a foot stool, creating a seating area for a desk or other furniture.", "mscoco_hallucinated_words": [["bench", "bench"], ["stool", "chair"]], "mscoco_gt_words": ["backpack", "suitcase", "bed", "dining table"], "hallucination_idxs": [56, 60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.5}}, {"image_id": 268854, "caption": " The scene features a group of people standing around on a sandy beach near a body of water, with several boats in the background. The person closest to the water is carrying a small handcart filled with a pumpkin-shaped fruit. The other people appear to be gathered around her as she transports the goods to the boats. The boats, likely fishing boats, are parked in various positions near the shore.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "person", "boat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 34464, "caption": "The image depicts a beach scene where a red surfboard is partially submerged in the ocean water. Several people can be seen in the background, standing on the shore, with some of them closer to the water and others farther away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 495513, "caption": "According to the picture, there are two main people in the scene: a man and a woman standing next to each other. The couple is wearing coats and holding up hot dogs on sticks. The hot dogs are arranged side by side. There are a few other people in the background, but they appear to be in different positions, as if they are not as focused on the hot dog sticks.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "hot dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 529256, "caption": "According to the picture, there are two dogs lying on a bed. One dog is situated near the left side of the bed, covering most of it, while the other dog lies on the right side of the bed, occupying a smaller area. The two dogs are positioned close together, with the one closer to the left side sleeping on top of the other. Both dogs are relaxing, seemingly enjoying their restful sleep.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "bed", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 37907, "caption": "The image shows a cat sitting on top of a television in a room. The TV is placed to the right of the image, and the cat is sitting on the top edge of the television. The cat appears to be looking at the camera or at something in front of the TV. There is a DVD player to the left of the TV, and a few remotes can be seen on the right side of the image, near the TV and the cat. Additionally, a vase is placed near the television, adding some decorative flair to the room.", "mscoco_hallucinated_words": [["player", "person"], ["remote", "remote"], ["vase", "vase"]], "mscoco_gt_words": ["cat", "potted plant", "tv"], "hallucination_idxs": [61, 72, 94], "metrics": {"CHAIRs": 1, "CHAIRi": 0.21428571428571427, "Recall": 0.6666666666666666}}, {"image_id": 506401, "caption": "According to the picture, it is a sunny day as various street lights, including a traffic light, illuminate a small intersection in a city setting. The intersection is bustling with activities, with a red car driving towards the camera from the left side, and another car parked at the right edge. In the center of the scene, there is a statue of three dogs, surrounded by a rock or water feature.", "mscoco_hallucinated_words": [["dog", "dog"]], "mscoco_gt_words": ["truck", "traffic light", "cat", "car", "person"], "hallucination_idxs": [71], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.4}}, {"image_id": 406611, "caption": " The image features a group of people gathered around a ski slope on a sunny day. Most of the crowd is standing, with people of all ages and sizes, including children and adults. Some people are holding snowboards, while others are carrying skis, showcasing the sports enthusiasm in the area.", "mscoco_hallucinated_words": [["snowboard", "snowboard"]], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.6666666666666666}}, {"image_id": 267225, "caption": "In the image, there are several vehicles and people gathered around them, creating a scene of a transport truck and a few smaller vehicles. The truck, with large satellite dishes mounted on its sides, dominates the scene. Several people are walking around the vehicles, interacting and likely having a conversation. They are positioned around the truck and smaller vehicles, giving the scene a somewhat festive vibe. A few cars are also present in the scene, one towards the left side, and another two towards the right side of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "remote", "tv", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 278365, "caption": "According to the picture, there is a man and a woman sitting at a dining table eating pizza in the evening. The scene seems to be taking place outside, possibly at a patio or balcony. They are holding their food using forks. Additionally, there are two bottles, one close to the man and another close to the woman. The bottles seem to contain beverages. The image includes a total of four cups on the table, two closer to the man and the other two towards the woman.", "mscoco_hallucinated_words": [["fork", "fork"], ["bottle", "bottle"], ["bottle", "bottle"]], "mscoco_gt_words": ["pizza", "cup", "dining table", "cell phone", "chair", "person"], "hallucination_idxs": [45, 52, 67], "metrics": {"CHAIRs": 1, "CHAIRi": 0.23076923076923078, "Recall": 0.6666666666666666}}, {"image_id": 69757, "caption": "According to the picture, the scene features a group of people gathered around a dining table. One person in particular, a woman, is wearing a striped shirt and seems to be the center of attention as she cuts a cake. There are two knives placed in front of the cake, one closer to the right side and another near the left side. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "cake", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 432213, "caption": " The image features a plate filled with a variety of food items, including a sandwich and French fries. The sandwich appears to be a breakfast sandwich, with an egg and a variety of ingredients. The French fries are arranged on the plate with several around the sandwich. There is a mix of foods on the table, which includes the sandwich, French fries, a bottle, and an empty plate.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["knife", "dining table", "bowl", "sandwich", "wine glass", "person"], "hallucination_idxs": [71], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.3333333333333333}}, {"image_id": 368367, "caption": "In a scene featuring several people, a woman is smiling as she talks on a cell phone, holding it up to her ear. She is positioned near the center of the image. A few others are also present in the background, some closer to the camera and some farther away, including a couple near the right side of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "cup", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 201555, "caption": "A dirt road has been constructed on the beach to support the new highway bridge, creating a significant construction site for the area. There are a couple of traffic lights along the street, one near the top of the dirt hill and the other at the bottom. Several cars are parked or passing by on the highway, including one to the left, two in the middle, and one near the right side of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 92131, "caption": " The image features a green bus with people outside waiting for it. There are several people in the scene, with one person carrying a bag and another wearing a backpack. The majority of the crowd is waiting in front of the bus, some of them standing and others sitting on the ground. The bus is parked on the street, and the group of people appear to be waiting to board the bus.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["bus", "person", "handbag"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.3333333333333333}}, {"image_id": 21779, "caption": " The image depicts a young woman lying down on the floor, propping herself up with her head resting on a large suitcase. She is smiling and seems to be exhausted from a busy day, possibly traveling. There are a couple of bottles and a bag located near her, but the focus is on the woman and her situation.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["backpack", "suitcase", "person"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 565148, "caption": "This scene takes place in a baseball game, as a batter stands at home plate, ready to hit. A catcher is also present, waiting for the ball to be thrown. The batter has a baseball bat in hand, and there are several people around the field, including a umpire, likely observing the batter's action.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 160183, "caption": " The image features a fire truck parked near a small building. The fire truck is primarily red and has white accents. The truck is parked in a parking lot, which is surrounded by cars in various positions throughout the lot. There are a few parked cars in the background, as well as a building, creating a diverse scene with the fire truck at its center.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 555456, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 235595, "caption": " The scene features several parked motorcycles lined up in a row along the sidewalk on the right side of the street. They are next to a couple of cars, one located in front of the parked motorcycles and another farther back. A truck can be seen in the background, parked further down the street. ", "mscoco_hallucinated_words": [["truck", "truck"]], "mscoco_gt_words": ["cup", "motorcycle", "potted plant", "car", "parking meter", "person"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 489909, "caption": " The image features a basket filled with various types of vegetables and fruits, including tomatoes, oranges, bananas, and cucumbers. The oranges are located in the center of the basket, with a few of them spread throughout the space. The bananas are located towards the bottom and to the right, while the cucumbers can be seen closer to the edges and bottom of the basket.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "orange", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 501080, "caption": " The image features a potted flower arrangement sitting on a table. It consists of pink flowers, which are located towards the top and middle of the arrangement. The arrangement looks beautiful and lush.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "potted plant", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 405183, "caption": "The scene is a living room where a group of people are gathered around a TV. A man is standing next to a woman, holding a remote control, and he is enjoying a Wii video game. Another person is located on the left side of the TV, while two more individuals are placed on the right side of the room.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "handbag", "tv", "potted plant", "book", "chair", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.375}}, {"image_id": 409138, "caption": "The image captures a large white bus driving down a road past buildings, likely in a city environment. The bus can be seen in the center of the scene, positioned between the two sides of the road. Several cars are visible along the street, one closer to the bus, and the others on the other side. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 411840, "caption": "The image depicts a tennis court with a group of tennis players walking on the court. There are two main players, both women, walking on the left side of the court. They are holding tennis rackets as they walk, possibly heading towards the court's service area or a change of side area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "chair", "backpack", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 333712, "caption": " In the image, there's a piece of pizza that is covered in toppings. The pizza has several slices cut into it, and some appear to be on a plate or dish. The toppings include bacon and onions, giving the pizza an appetizing flavor. There are additional slices that are not immediately visible, but are likely present in the pizza dish as well.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "spoon", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 148977, "caption": " In the image, a teddy bear is sitting on a table, wearing headphones. The teddy bear seems to be enjoying the music or audio it's listening to.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "teddy bear", "bear", "dining table", "bottle", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 399058, "caption": " In the image, there are two trucks driving on the road, one behind the other, with an elephant standing in the middle of the road. The man is riding on the back of the elephant as it walks down the street. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "elephant", "orange", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 183571, "caption": " According to the picture, there is a man standing on a red boat while a dog is located next to him on the left side of the boat. The man, possibly in a red shirt, is wearing a baseball cap and seems to be leaning over the boat. There is also a person standing near the left edge of the image, and another individual can be spotted near the right edge. The background of the image features some water and a variety of people in various locations. Overall, the scene appears to be a man enjoying a boat ride with his dog, while also interacting with others around the area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "boat", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 325836, "caption": " There is a display of multiple motorcycles on exhibition at a gathering, likely a trade show or expo. These custom-painted motorcycles are parked and displayed on a red carpet, showcasing the custom paint designs and decorations. In addition to the motorcycles, there are several people standing or walking around the room, enjoying the event and examining the exhibits. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 437910, "caption": "The image depicts a small bathroom with a sink and toilet. The bathroom has a door that opens towards the left side, leading to a sink area, which is located near a window. The toilet is situated in the right side of the bathroom, occupying a significant portion of the room. The sink is placed near the right edge of the room, and the toilet appears to be white. This unique bathroom setup is not very common, but it serves its purpose.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "toilet", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 155885, "caption": "The image shows a black and white photo of a bay with boats in it. There are multiple boats of different sizes scattered around the bay. Some are closer to the water's edge while others are further away. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 467887, "caption": "The image depicts a man sitting at a dining table in his home, using a laptop computer. He is positioned in front of the table and his hand is on the laptop's keyboard. There is another person, possibly a friend or family member, sitting nearby. In the room, there are two couches, one of them closer to the dining table, while the other one is on the other side of the table. The couches seem to be decorated with a vase and a book, with the vase located towards the center and the book on the right side.", "mscoco_hallucinated_words": [["keyboard", "keyboard"], ["vase", "vase"], ["book", "book"], ["vase", "vase"], ["book", "book"]], "mscoco_gt_words": ["dining table", "laptop", "mouse", "chair", "person", "couch"], "hallucination_idxs": [35, 91, 94, 98, 105], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 174402, "caption": " The image shows a man standing on a tennis court holding a tennis racket in his right hand. He appears to be concentrating on the incoming tennis ball. The ball is located closer to the left side of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 352734, "caption": " The image depicts a little boy, seated at a dining table, eating a piece of chocolate cake. The cake is sliced and served on a fork. There is a spoon nearby, which the boy is using to feed himself with the cake. Additionally, there is a plate placed at the center of the table, possibly with the cake on it. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "spoon", "cake", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 346716, "caption": " There is a city street scene with two red double-decker buses driving down the road next to each other. The buses are moving in opposite directions, creating a unique sight. One of the buses is closer to the right edge of the image, while the other is positioned to the left. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 451144, "caption": " In this snowy outdoor scene, a pair of snowboarders is standing on a snow-covered slope. One of the snowboarders is positioned further down the slope, while the other is closer to the top. They are wearing backpacks, which can be seen on their backs. The overall atmosphere is relaxed as they enjoy their day out in the snowy terrain.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 295656, "caption": " According to the picture, there are three people in the scene, one of them is a man and the other two are women. They appear to be dressed up, perhaps for a special event or gathering. All three individuals are eating pizza together, with each person holding a slice on a plate. The people are smiling and seem to be enjoying their meal, posing for the camera in a group setting. A cake can also be seen, which could be a dessert or part of the meal.", "mscoco_hallucinated_words": [["cake", "cake"]], "mscoco_gt_words": ["pizza", "clock", "tv", "tie", "car", "chair", "person"], "hallucination_idxs": [81], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.2857142857142857}}, {"image_id": 129886, "caption": "In the image, there are several people standing on the snowy slope in skiing attire. Two main ski lifts are visible, one near the left side of the image and the other near the center of the scene. Some of the people are positioned closer to the left ski lift, while others are nearer the center ski lift.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 557998, "caption": "According to the picture, a woman is sitting at a table with bottles on it. There are three bottles in total, with one bottle in the center of the table and the other two on the sides. The woman is sitting close to the center bottle and appears to be holding her hand to her chin. The background features a combination of wood and metal elements, which may be part of the dining table or decorative elements.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bottle", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 401010, "caption": "The image features a room with a large, neatly made bed and two windows. The bed is situated in the center of the room and is surrounded by various items, including a bottle, a clock, a cup, and a television. There is also a cell phone on the bed, possibly belonging to the person who slept there. The room appears to be neatly organized, and there is ample space to move around the bed and enjoy the surrounding views.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["clock", "clock"], ["cup", "cup"], ["cell phone", "cell phone"], ["person", "person"]], "mscoco_gt_words": ["sink", "chair", "bed", "tv"], "hallucination_idxs": [35, 38, 41, 51, 61], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.5}}, {"image_id": 567562, "caption": " In the image, there is a woman sitting at a table surrounded by three children. The table is full of food, and the woman is smiling as she eats. The children are also eating, with several plates filled with various dishes. There are bowls and a cup placed on the table, indicating that they are enjoying a meal together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "dining table", "bowl", "spoon", "bottle", "fork", "broccoli", "book", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.36363636363636365}}, {"image_id": 313928, "caption": "In the picture, there is a large outdoor area overlooking the ocean, featuring a table and chairs set up for people to enjoy their beach vacation. The table is positioned near a railing and a surfboard can be seen sitting on the ground next to the table.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["surfboard", "dining table", "bottle", "wine glass", "chair"], "hallucination_idxs": [22], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6}}, {"image_id": 490284, "caption": "A black dog is lying on a couch, partially covering a person seated beside it. The dog's head is near the left side of the couch, while its back end is on the right side. A few chairs are visible in the background, but only one is occupied by someone, seated closer to the left side of the scene. The dog appears to be relaxing and enjoying its position on the couch.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["couch", "person", "dog"], "hallucination_idxs": [42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 1.0}}, {"image_id": 346940, "caption": "The image features a stove with a pot, skillet and an electric kettle. The pot and the skillet are filled with water, which appears to be simmering. A silver tea kettle is also present on the stove, its spout emitting a fine mist of water. The steam coming from the pot and kettle creates an interesting effect, adding a steamy atmosphere to the kitchen. The stove is located in a room surrounded by windows, providing a light and airy setting for the cooking scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "knife", "oven"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 222782, "caption": "The image is a black and white photo of a street scene with people walking on the sidewalk near a gas station. The sign above the gas station reads, 'american amoco gas' and is displayed in front of the store. There are two people walking on the sidewalk, one closer to the store and the other farther away. The street is empty except for the two people and the gas station.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "cell phone", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 255934, "caption": "In the image, two elephants are walking, one of which is a baby. The mother elephant is slightly larger, with a noticeable size difference. They appear to be walking in an enclosed area, such as a park or a zoo. There is also a person in the image, likely a parent or guardian accompanying the child. The other elephant and the person are positioned near the mother elephant and baby, creating a close family atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "handbag", "elephant", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 278627, "caption": "According to the image, the following descriptions are associated with the food presented in the picture:", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "spoon", "potted plant", "bottle", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 400851, "caption": " The scene is an outdoor sandy beach, filled with a large group of people gathered near the water. There are various seating options, such as a few benches and a few chairs dotted around the area. Several boats are also visible near the waterline, providing additional ambience to the beach setting. People are spread across the beach, engaging in conversations, enjoying the sand and sun, and enjoying their time together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "umbrella", "chair", "backpack", "boat", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 472610, "caption": " The image features a woman standing near a large field surrounded by cows. The cows are scattered throughout the grassy area, appearing in various locations. The woman is likely petting the white dog or playing with it. There are two main areas where the cows are present in the image, one towards the upper part of the scene and the other at the bottom. The cows seem to be enjoying their time in the pasture, while the woman enjoys the company of the dog.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cow", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 78565, "caption": "The image is an old photo of people and boats in the water. There are 13 people in total, scattered throughout the scene. Nine of these people are standing at the shore, some closer to the water and others at a distance, while the other four are closer to the boats.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "boat", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 388237, "caption": " In the scene, a boy is skateboarding down a street while riding on a bench. There are several other people present in the background, some closer to the skateboarder and others more distant. The skateboarder is wearing a baseball cap and gray shorts.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["skateboard", "chair", "person", "bottle"], "hallucination_idxs": [15], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.25}}, {"image_id": 289640, "caption": " In the scene, there are four people sitting at dining tables or chairs at a dining area. One person is holding a red frisbee in their hand, which they are about to throw. Another person is sitting across from them, while two more people are in different parts of the room. The room appears to be a public setting, perhaps at a bar or restaurant.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "frisbee", "orange", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 122602, "caption": " The image features a table with an assortment of supplies, including books, scissors, and craft paper. The table is filled with various items, such as a glue bottle and multiple paper cutouts. There are two books placed on the table, one closer to the left side and another closer to the center.", "mscoco_hallucinated_words": [["book", "book"], ["book", "book"]], "mscoco_gt_words": ["dining table", "scissors", "bottle"], "hallucination_idxs": [12, 41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 1.0}}, {"image_id": 24104, "caption": "The image showcases a desk with multiple electronic devices arranged in front of a wall. On the desk, there are two computer monitors placed side by side, one of which is a 20-inch monitor and the other one is a 14-inch monitor. In addition, a mouse is situated close to the left monitor.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "tv", "dining table", "laptop", "mouse", "book", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 183709, "caption": " The image features a busy intersection with numerous people standing at various locations. Some of the people are walking around, while others appear to be waiting or hanging out. There is a large crowd of people gathered in the middle of the street, and a few individuals are on the sidewalk near the side of the street. A traffic light is visible in the middle of the intersection, and two more traffic lights are positioned further away from the center.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "handbag", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 59710, "caption": " The image depicts a small refrigerator with a black color and two doors. One of the doors is completely open, revealing the interior of the refrigerator, while the other door is closed. Inside the fridge, there are a few items visible, including a few glasses on a shelf and a couple of cups on the lower part of the refrigerator. A mirror is also placed close by to the left of the refrigerator, creating a reflection of the inside of the appliance.", "mscoco_hallucinated_words": [["cup", "cup"]], "mscoco_gt_words": ["chair", "refrigerator", "dining table"], "hallucination_idxs": [58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.3333333333333333}}, {"image_id": 279407, "caption": " The scene features a young boy in a baseball uniform, holding a baseball ball, dressed in green and white colors, and wearing a baseball glove. He is on the field, possibly playing baseball or practicing for a game. In the background, there are a few other players on the field, with one player close to the right side of the image and two others scattered across the scene, some near the right side of the boy.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "bottle", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 314729, "caption": "The image depicts a black and white baseball scene with an old baseball player holding a bat, standing in a ready-to-hit stance. The player is dressed in vintage clothing, and there is a ball on the ground, suggesting that he may be practicing for a game or an upcoming competition. The overall composition of the image gives it an antique, vintage feel.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 527497, "caption": "According to the picture, there is a computer in the workplace that has a keyboard, mouse, and a TV monitor on a desk. The TV monitor is placed in the center of the table, while the keyboard and mouse are positioned closer to the left side. In the background, there are a few other computer carts, creating a typical office layout.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "tv", "dining table", "laptop", "mouse"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 411564, "caption": "According to the picture, a man is sitting in a window, looking at his cell phone. He is surrounded by various objects, including a refrigerator, several bottles, and a table or bench. ", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["refrigerator", "dining table", "bottle", "cell phone", "chair", "person"], "hallucination_idxs": [37], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.8333333333333334}}, {"image_id": 320350, "caption": " According to the image, there is a group of three people standing together on snow skis in a large snow-covered field. The three people appear to be skiing with two dogs, as there are two dogs clearly visible in the scene. One of the dogs is positioned directly in front of the two skiers, while the other dog is located further back, possibly leading them. The people and their dogs are enjoying their skiing experience.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "skis", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 401857, "caption": "The image features a baseball game happening on a field. Several people are present in the scene, with some playing baseball, while others serve as the catcher and umpire. The game appears to be in action as a batter is holding a baseball bat, standing on the home plate, and getting ready to swing at an incoming pitch. The baseball players are wearing matching baseball uniforms and are focused on the sport.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "bench", "baseball bat", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 285302, "caption": "A little boy is seated at a dining table, sitting in a chair and actively working on a craft project. The table is set with various materials like paper and scissors. The boy is surrounded by several pieces of artwork displayed on the table, including one with a large elephant. There are also multiple cups and a bottle of water placed on the table, along with a knife and a fork.", "mscoco_hallucinated_words": [["elephant", "elephant"], ["cup", "cup"], ["bottle", "bottle"], ["knife", "knife"], ["fork", "fork"]], "mscoco_gt_words": ["person", "chair", "scissors", "dining table"], "hallucination_idxs": [53, 59, 62, 73, 76], "metrics": {"CHAIRs": 1, "CHAIRi": 0.38461538461538464, "Recall": 1.0}}, {"image_id": 61959, "caption": "The image captures a city street scene, showcasing a white, double-decker bus making its way down the road. There are several cars around it, parked in various positions, such as one in front of the bus and others farther back on the street. A traffic signal is also visible near the bus, indicating the potential presence of a traffic light.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "bus", "truck", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 229659, "caption": " The image depicts a laptop computer sitting on a counter in a restaurant, which is open for business. In the vicinity of the laptop, several chairs are arranged around two tables, some more closely around the counter, others scattered in the area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "tv", "dining table", "laptop", "bottle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 473118, "caption": "The image shows a man on a skateboard riding a ramp, with a blue background. The skateboarder is in the middle of a trick, possibly doing a grind. The ramp extends across the scene, with the skateboarder positioned right above the middle of the ramp. There is another person visible in the background, likely watching the skateboarder perform the trick.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 391222, "caption": "In the image, a group of three people, bundled up in coats and hats, is standing in the snow. They are wearing skis and they are looking at a sign displayed on the snow. The sign appears to have some information on it, such as directions or an attraction. The people in the image are standing near a trail path, possibly checking the display before proceeding with their skiing adventure.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 416104, "caption": "According to the picture, there is a group of people sitting at outdoor tables while others walk by. A large number of tables with blue chairs and umbrellas are set up, providing shade for those enjoying their meals. Several bottles are visible at different locations in the scene, indicating that people are enjoying beverages.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "dining table", "bottle", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 511852, "caption": "In the image, there are two skiers descending a snowy hill, with the main focus being on the person on the right, skiing down the slope with skis on their backpacks. The person on the left is skiing with their skis on their backpacks, too, but slightly further away from the right skier. Both skiers are wearing backpacks with their skis attached.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 59386, "caption": "In the image, a group of giraffes stands together in a field eating from a tree, which is located near several other trees. There are several giraffes present, with some of them closer to the tree while others are further away. One giraffe appears to be standing to the left of the group, while another is situated at the bottom of the field.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["zebra", "giraffe", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 540180, "caption": "According to the picture, there is a street scene with traffic lights and a few cars on the road. The image captures a street corner with a traffic light mounted on a pole in the center, surrounded by a few vehicles. The traffic light appears to be green and has a glow-in-the-dark emblem on it. Cars can be seen at various positions around the intersection, including one towards the left, two towards the right, and a third one in the middle. The traffic lights and cars create a sense of activity and movement in the urban scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 153692, "caption": "In the image, a man is riding a bike accompanied by his dog, while they are crossing a street. They are both wearing backpacks and appear to be on their way somewhere. The street is filled with various traffic lights and signals, creating a sense of activity in the scene. There are four traffic lights in the image, three on the left side and one on the right.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "traffic light", "dog", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 265628, "caption": " The image showcases a toilet, with its bowl sitting atop a sink. A person is standing in front of the toilet, wearing shoes. The toilet and sink are located in a room, which has tiled flooring. This scene is viewed from a bird's eye view, with the person and toilet in the center, and the sink on the left. Several other people can be seen in the room as well, with one standing in the background, and another on the right side of the image.", "mscoco_hallucinated_words": [["sink", "sink"], ["sink", "sink"], ["bird", "bird"], ["sink", "sink"]], "mscoco_gt_words": ["bowl", "person", "toilet"], "hallucination_idxs": [12, 30, 48, 64], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 1.0}}, {"image_id": 499594, "caption": "In the image, a piece of meat is served on a plate along with a variety of vegetables, including broccoli. The vegetables can be found in different locations on the plate, such as towards the right and bottom of the plate. The plate also includes a side dish of pasta or rice.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "bottle", "fork", "broccoli"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 5934, "caption": " The image showcases a baseball glove and a baseball inside it, sitting on top of a dining table or bench. The baseball is placed inside the glove, appearing to be relatively small in size compared to the size of the glove itself. The glove and the baseball are both prominent in the image, taking up most of the space. ", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["bench", "sports ball", "baseball glove"], "hallucination_idxs": [18], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 47828, "caption": "The image depicts a river with a variety of blue lights, likely illuminating the bridge over the water. There is a water boat visible at the left side of the river, underneath the bridge. Several people are walking on a sidewalk along the river, with some standing around or closer to the boat. They appear to be enjoying the ambiance and view of the night lights.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "boat", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 47916, "caption": " In the image, there are several dogs present in a grassy field, some of which are playing with a frisbee. One of the dogs has a frisbee in its mouth, while another is close to it in the left corner of the image. A few more dogs are visible in the background, positioned variously across the field. The grassy field seems to be a fun place for these dogs to play and socialize.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bench", "dog", "frisbee", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 255483, "caption": "This image shows a person sitting at a table with a lot of paper in front of them. They are holding a stapler or some other office object in their hand, possibly to read the documents on the table. There are also multiple sheets of paper on the table in front of the person.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "person", "scissors", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 131138, "caption": "The image shows a desk with various computer-related items, highlighting a laptop and computer monitor on top. There are also a keyboard, a cup, and a cell phone on the desk. The laptop and computer monitor are side by side, providing a workspace. ", "mscoco_hallucinated_words": [["cell phone", "cell phone"]], "mscoco_gt_words": ["keyboard", "cup", "tv", "dining table", "spoon", "laptop", "potted plant", "mouse", "person"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.09090909090909091, "Recall": 0.5555555555555556}}, {"image_id": 258209, "caption": "According to the image, a man is sitting on a bench in front of a building. He appears to be resting or waiting for something. There are two benches in the scene \u2013 one near the man and the other one a little further away, towards the right side of the image. A cell phone is also visible in the image, which the man is likely using while sitting on the bench. ", "mscoco_hallucinated_words": [["cell phone", "cell phone"]], "mscoco_gt_words": ["bicycle", "book", "bench", "person"], "hallucination_idxs": [58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 347453, "caption": "The image captures a busy city street with multiple buses and a couple of red double-decker buses parked and driving on the street. There are several vehicles, including a truck on the left, a car in the middle, and another car on the right. In addition, a traffic light can be seen in the center of the street.", "mscoco_hallucinated_words": [["truck", "truck"]], "mscoco_gt_words": ["bus", "bench", "traffic light", "car", "person"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6}}, {"image_id": 285820, "caption": "According to the picture, the image shows a living room with a couch, featuring a white, black, and brown interior design. There is a kitchen in the room with a table and a dining room table, and the couch is facing towards the dining table. There are a few bottles and cups placed around the room, and a cup is situated in the kitchen. A few people can be seen in the living room, including one in the foreground, two in the middle, and a third in the background.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cup", "cup"], ["cup", "cup"], ["person", "person"]], "mscoco_gt_words": ["bench", "tv", "dining table", "bowl", "cell phone", "book", "couch"], "hallucination_idxs": [56, 58, 66, 75], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4444444444444444, "Recall": 0.2857142857142857}}, {"image_id": 245664, "caption": "The image features a desk with several electronic devices, including a laptop sitting on the right side, a computer monitor and keyboard placed near the laptop, and a mouse on the left side of the desk. The laptop is closed, and the computer monitor is partially visible, showing its screen. Additionally, there is a cell phone on the desk towards the center.", "mscoco_hallucinated_words": [["cell phone", "cell phone"]], "mscoco_gt_words": ["keyboard", "cup", "tv", "dining table", "laptop", "mouse", "book", "wine glass"], "hallucination_idxs": [62], "metrics": {"CHAIRs": 1, "CHAIRi": 0.07692307692307693, "Recall": 0.625}}, {"image_id": 453731, "caption": "According to the image, there is a sleeping dog resting on a wooden floor in front of an apothecary. The dog appears to be sleeping and is surrounded by various bottles, likely containing wine. There is another dog near the background of the image, which adds to the overall atmosphere of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dog", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 524601, "caption": "A group of people is gathered in a living room playing a video game. There are four people in the scene: one standing on the left, another standing slightly behind him, one in the middle, and another standing on the right. Each person is holding a remote controller as they play.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "refrigerator", "book", "chair", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 253262, "caption": "In the image, a small white propeller plane is parked on the airport tarmac, and two people are getting off of it, likely carrying their luggage with them. One person is standing next to the plane on the right side, while the other person is standing slightly closer to the middle of the tarmac.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "suitcase", "tie", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 87052, "caption": "The image depicts a kitchen area, with three people standing in it. They appear to be two adults and a child, possibly a family or a group of friends. The people are standing in various positions and are wearing different clothes.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "oven", "refrigerator", "bowl", "bottle", "vase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.14285714285714285}}, {"image_id": 357578, "caption": "There is an image on a city street, featuring a fire truck driving past several cars parked next to them. The fire truck is parked on the right side of the street, with the fire engine and lights prominent on its rear side. The cars are lined up in a row, parked on the left side of the street. They include one white car, two black cars, and another car with a light green color.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 418692, "caption": " There is a living room in the image with a large sofa and a couple of chairs. One person, possibly a woman, can be seen sitting on the sofa. The room has a lamp, which adds some light, and several books on shelves, including two stacked on top of each other. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "cell phone", "book", "chair", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 259345, "caption": "According to the picture, there is a bearded man sitting on a bench, possibly outdoors, with a pink umbrella held over his head. The man is dressed in a colorful, flowing robe.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["umbrella", "chair", "person"], "hallucination_idxs": [13], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 119709, "caption": "According to the picture, the scene features a woman dressed in an apron, standing in front of a table covered with donuts. The donuts are dispersed in different sections of the table, and there are several pieces of mats on the table as well. A red dining table is visible in the background, and there are two chairs near the table, one towards the left and another towards the right side. The scene seems to take place in a kitchen or dining area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "donut", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 443299, "caption": "In the image, two people are sitting on a white bench next to a wall. One of them appears to be a man, wearing a jacket, while the other is a woman. They are both sitting in a relaxed position, looking at their cell phones.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "cell phone", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 120776, "caption": " In the image, there is a large outdoor sculpture in the shape of an 'S' that is being viewed by a group of people standing on various outdoor surfaces. The sculpture has several umbrellas positioned around it to provide shade. Some of the umbrellas are small, while others are larger in size, providing protection from the sun or rain.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 411765, "caption": " The image features a tennis court surrounded by a chain-link fence, and a man in a white and blue shirt is holding a tennis racket, preparing to hit a tennis ball. He appears to be in the middle of a tennis match. Two other individuals can be seen in the background, one near the back of the court and the other closer to the right side. They might be part of a doubles team or just observing the game. The tennis ball is on the left side of the court, closer to the man swinging the racket.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 447991, "caption": "The image features a city street with a van parked near the curb and a woman standing on the sidewalk next to it. She appears to be looking at her phone while holding it with one hand. A few cars are present in the scene, with one located further ahead of the van and two cars closer to the woman. The scene captures the bustling nature of a city street and the woman's casual stance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "handbag", "traffic light", "cell phone", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 276719, "caption": "A small dog is running around an intersection near two parked cars. The dog appears to be enjoying the space and its surroundings. There are several people present in the scene, including two women and four men, with one of them standing near a car's hood. The individuals seem to be engaging with each other and the dog.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "handbag", "motorcycle", "dog", "car", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 422328, "caption": "The scene depicts a horse pulling a snow plow while traveling down a snow-covered road. There are multiple people around the horse and a snowman in the background. One of the people is standing next to the horse, while the others are located on the left side and further in the background. A car is parked near the left edge of the image, with another vehicle on the right side of the picture. Two benches are positioned at the side of the road, near the people and the horse.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["horse", "snowboard", "car", "skis", "person"], "hallucination_idxs": [80], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.6}}, {"image_id": 354559, "caption": "The image shows a woman dressed in a black and white stripped bustier with long hair, and she is posing with a stuffed animal. The teddy bear is quite large, covering much of her upper body. The woman is standing close to a wall and appears to be smiling. There is a single handbag visible in the scene, situated to the right side of the woman.", "mscoco_hallucinated_words": [["handbag", "handbag"]], "mscoco_gt_words": ["teddy bear", "bear", "person"], "hallucination_idxs": [58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 438788, "caption": "The image features two pieces of cake on a dining table, with a knife cutting one of them. There is another knife located close to the knife used to cut the cake. Two people are present in the scene, one sitting closer to the camera and the other at a distance. They both appear to be engaged in the act of preparing or consuming the cake.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "knife", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 468005, "caption": " The image features an elephant walking on a cement path near the water, with many people standing around the animal, likely enjoying the scene or waiting for a ride. The elephant is walking near the shoreline, with its massive tusks out, attracting the attention of the people. There are also several cars in the scene, likely used as transportation for some of the people in the area.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["backpack", "person", "elephant"], "hallucination_idxs": [57], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 545594, "caption": "The image captures a group of people sitting on a bus. Several passengers can be seen, including a young girl with colorful hair in the center. The bus has multiple seating areas with two benches and a chair onboard, providing spots for the passengers to sit. There are also a couple of handbags visible in the scene, and one is located towards the right side of the image, while the other is closer to the middle.", "mscoco_hallucinated_words": [["bench", "bench"], ["handbag", "handbag"]], "mscoco_gt_words": ["backpack", "chair", "orange", "person"], "hallucination_idxs": [37, 57], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.5}}, {"image_id": 219546, "caption": "A large, elaborate dining table is filled with an array of different foods, including meats, cheese, and vegetables. There are many bowls of food placed across the table, some containing meat, others vegetables, and still others with cheese. In addition, there are a few knives scattered across the table, likely used to slice and portion the food. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "bowl", "spoon", "bottle", "fork", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 386645, "caption": "In the image, a truck has toppled over on the roof of a building, potentially causing some damage. The toppled truck is in close proximity to the building, likely creating a dangerous situation.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "motorcycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 155131, "caption": "In the image, a woman dressed in white is jumping high into the air, arms fully extended, as she tries to hit a ball flying above her. The scene takes place on a tennis court. Numerous people are visible around her, possibly spectators or other tennis players waiting for their turn. The woman seems to be the center of attention. A chair is visible in the background, possibly placed for a coach or other personnel to watch the action.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["bench", "tie", "tennis racket", "sports ball", "person"], "hallucination_idxs": [69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.4}}, {"image_id": 188752, "caption": "According to the picture, a person has hanging a few bananas from a string above a fruit stand. The bananas are arranged neatly and seem to be a popular item for display in the store.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["apple", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 228350, "caption": " The image features a busy dining area with many people sitting around a table and a long counter. Some people are sitting near the counter while others occupy tables, where they are enjoying their meal. Many umbrellas are positioned around the room, providing shade from the sunlight coming through the large windows.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "umbrella", "dining table", "bowl", "tie", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.375}}, {"image_id": 437720, "caption": "According to the picture, there is a nude woman sitting at a desk in front of a red chair and a red table. The woman is sitting and reading a book, and there are several books and a vase displayed in the room. The image is a painting or an art piece featuring a nude woman reading a book. ", "mscoco_hallucinated_words": [["book", "book"], ["book", "book"], ["book", "book"]], "mscoco_gt_words": ["dining table", "laptop", "vase", "chair", "person"], "hallucination_idxs": [32, 38, 62], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3, "Recall": 0.8}}, {"image_id": 378831, "caption": " The image depicts a woman walking on the street, carrying two red shopping bags. She appears to be walking past a fire hydrant and close to a fence. There is a car on the left side of the scene and some potted plants located further down the street. A handbag is also visible, situated near the center of the scene.", "mscoco_hallucinated_words": [["car", "car"], ["potted plant", "potted plant"]], "mscoco_gt_words": ["fire hydrant", "handbag", "person"], "hallucination_idxs": [34, 44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 1.0}}, {"image_id": 375503, "caption": "The image captures an upstairs view of a room featuring a wooden staircase, a small clock, and a desk or table near the top of the stairs. The room is ornate with paneled walls, carpets, and a large chandelier hanging from the ceiling. There is a clock near the bottom of the stairs, and the wooden stairs are the central feature of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "tv", "dining table", "laptop", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 330369, "caption": "According to the picture, there is a large red and white boat on the water. It appears to be heading in the direction of the shore or a dock. The boat's name, \"Edwin W,\" is prominently displayed on the front of the vessel. It seems to be a centerpiece of attention on the water.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 288576, "caption": "According to the picture, a man is standing on a tree stump with a frisbee thrown near him. He is also holding a bicycle nearby. There are a few additional people in the scene, including one who appears to be walking towards the man on the tree stump, another standing nearby, and a third one further away. The people are dispersed across the area, some closer to the tree stump and others at the edge of the scene. The overall atmosphere appears to be relaxed and casual.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "frisbee", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 32190, "caption": "According to the picture, it shows a small toy horse placed in the middle of a street in front of a parked car. There are several other cars in the scene, with some closer to the side of the road and others further ahead, and a few people visible in the scene, some standing further away from the horse and some closer to it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["horse", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 535403, "caption": "This image displays a small bathroom with a sink and mirrors. There are two sinks located near the middle of the room, one on the left and another one on the right. A mirror is placed above each sink. In the left corner of the room, there is a long haired brush sitting on the counter top, while a few bottles are scattered around the space. The bathroom also has some decorative artwork hanging on the wall. Overall, the scene features a functional and aesthetically pleasing bathroom, with mirrors reflecting the light and adding to the room's size.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "potted plant", "bottle", "toilet", "sink", "hair drier"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 449798, "caption": "The image displays a desk with multiple items placed on top of it. A keyboard, a mouse, and a computer are all present, suggesting it to be a computer workspace. The keyboard and mouse are situated near the center of the table, while the computer is found on the left side. The table is adorned with a multitude of figurines, including some smaller ones on the right side, creating a fun, decorative touch to the office space.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "dining table", "laptop", "mouse", "cell phone", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 375769, "caption": "According to the picture, the scene features a small child wearing a helmet eating at a table in a restaurant or cafe setting. There are several chairs surrounding the table, one placed closer to the child and others further away. A few bicycles can be seen in the background, adding to the ambiance of the outdoor setting. In addition to the chairs, there are also bottles located throughout the scene, with one placed near the child's hands. Overall, it appears to be a warm and welcoming atmosphere.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["bicycle", "cup", "umbrella", "dining table", "hot dog", "car", "chair", "person"], "hallucination_idxs": [72], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.5}}, {"image_id": 426172, "caption": " The image features a table display with several items in a store. There are two different dishes on the table, one in the center of the display and the other towards the right side. They appear to be positioned symmetrically. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "bottle", "book", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 20650, "caption": "The image features a tasty, nutritious meal with a sandwich and plenty of broccoli on the plate. The sandwich is a delicious meal in itself, but the accompanying broccoli provides a nutritious and flavorful side dish. The meal appears to be served with a side of pasta as there's a fork and spoon placed nearby on the dining table. The fork is closer to the top of the scene, while the spoon is in the middle.", "mscoco_hallucinated_words": [["spoon", "spoon"], ["spoon", "spoon"]], "mscoco_gt_words": ["dining table", "bowl", "sandwich", "fork", "broccoli", "orange"], "hallucination_idxs": [57, 78], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.6666666666666666}}, {"image_id": 138078, "caption": " An outdoor baseball scene is depicted in the image, with three baseball players present in the field. Two of the players are wearing baseball gloves, and one of them is wearing red-colored clothing. The players seem to be aware of each other and are actively involved in the game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 152499, "caption": "The image depicts a family swimming pool scene, with a group of people around a pool, including children, having fun in the water. There are four individuals, one adult and three kids, participating in the event. The pool appears to have several water toys, such as floating boards, on which some of the children are sitting. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "umbrella", "dining table", "potted plant", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 180764, "caption": " According to the picture, a young man is eating a pink frosted donut in a kitchen setting. The man is in the center of the scene, and there is another person in the background, presumably the cook or a companion. There are a few doughnuts visible in the scene, including the one the man is eating, another one nearby, and another one in the right corner.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["microwave", "donut", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 147762, "caption": " The image features a piece of food on a plate, topped with a yellow substance. It appears to be an artistic dessert with a yellow and brown color scheme, with a mix of vanilla ice cream and other ingredients. The plate sits on a table or counter, inviting someone to enjoy the dessert.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "cake", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 113860, "caption": "The scene features a boat on water, moving down a river. This boat is carrying people who are riding it and enjoying their experience on the water. They are likely taking advantage of the beautiful day to enjoy their exciting ride. Several trees can be observed along the river, adding a natural touch to the environment.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 460866, "caption": "The image features a large number of toothbrushes stuck in a plastic container, forming a congested mass. The majority of the toothbrushes are visible on the left side, while a smaller group can be seen in the middle. The brushes of different sizes are arranged in a variety of orientations, creating a dynamic and busy scene. The total count of toothbrushes in the image is approximately 30.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toothbrush", "cup", "dining table", "bowl", "sink"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 177015, "caption": "According to the picture, a man is sitting in a room, holding a cat on his lap as he uses a laptop computer. The cat is sitting near the man's chest while the laptop is placed in front of them. The man seems focused on using the laptop and is staring at it. Additionally, there are other objects in the room, such as a bottle placed close to the cat, two chairs situated further away from the main scene, and a cell phone lying on a nearby surface.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cell phone", "cell phone"]], "mscoco_gt_words": ["refrigerator", "laptop", "cat", "apple", "chair", "person", "couch"], "hallucination_idxs": [72, 91], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.5714285714285714}}, {"image_id": 333998, "caption": "The image shows a well-furnished kitchen with various appliances and equipment. At the center of the room, there is a refrigerator located in front of the main refrigerator, as well as an oven situated to the right. There is a microwave and another oven near the refrigerator.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "suitcase", "oven", "refrigerator", "bowl", "bottle", "sink", "microwave", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 47131, "caption": "The image shows a train derailed on the tracks, with several men standing on and around the car. The men appear to be trying to help their fellow train passengers by standing and looking at the derailed train. The picture is black and white, and the train appears to be old. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 425447, "caption": "A white pickup truck is driving down a side street, with a sprinkler system on top of the truck spraying the back of the vehicle. A man is standing near the truck, possibly adjusting the sprinkler system or ensuring the operation is working. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "fire hydrant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 295105, "caption": "The image portrays a black and white picture of a kitchen with a man standing in the middle of it. He is wearing a black shirt and appears to be enjoying a meal in the kitchen, possibly preparing or eating food. The kitchen is equipped with a refrigerator, which is placed to the right in the scene. There are two ovens in the room, with one positioned to the left and the other on the far right side. A bottle can also be seen in the scene, sitting on the countertop. Multiple knives are spread around the kitchen, likely for use in preparing and serving food.", "mscoco_hallucinated_words": [["knife", "knife"]], "mscoco_gt_words": ["oven", "refrigerator", "bowl", "bottle", "apple", "sink", "person"], "hallucination_idxs": [101], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5714285714285714}}, {"image_id": 443259, "caption": "The image shows a very clean kitchen with two sinks and two refrigerators. The kitchen is spacious, featuring a large oven and a microwave placed on opposite ends of the room. The sinks are found near the microwave, as well as close to the oven. There are also two refrigerators, one of which can be seen more towards the left side of the room. ", "mscoco_hallucinated_words": [["microwave", "microwave"], ["microwave", "microwave"]], "mscoco_gt_words": ["oven", "refrigerator", "dining table", "bowl", "potted plant", "bottle", "orange", "sink", "chair"], "hallucination_idxs": [25, 40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 537861, "caption": "There are two trains on the train tracks in the image, one of which has an odd number of cars. The trains appear to be in motion as they are moving along the railroad track. One train is a combination of yellow and black cars, while the other train has multiple blue and gray cars. The tracks extend towards the edge of the image on both sides.", "mscoco_hallucinated_words": [["car", "car"], ["car", "car"], ["car", "car"]], "mscoco_gt_words": ["train", "bench", "person"], "hallucination_idxs": [20, 47, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 0.3333333333333333}}, {"image_id": 145020, "caption": "The image features a Disney character mascot standing in front of a large group of people at a street party. There are several people around the mascot, some closer to it and others further away. The people are of various sizes, likely positioned differently or standing in different locations.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "bird", "mouse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 249969, "caption": " The image depicts a white plate filled with assorted pieces of food, including a meat, vegetables such as broccoli and carrots, and bread. The meat appears to be sliced, while the vegetables are sliced as well. In addition, there are multiple forks arranged around the plate. A bottle is also present, possibly containing a beverage or sauce. The dining table is set for a meal or gathering, providing a suitable place for enjoying the food on the plate.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["knife", "dining table", "spoon", "fork", "broccoli", "wine glass", "carrot", "bird"], "hallucination_idxs": [55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 146504, "caption": "The image features a person's hand with a mouse placed on top of a keyboard, indicating that they are typing with the mouse. The mouse is positioned in the center of the image, while the keyboard is located to the right of the hand. There is another keyboard nearby, which is positioned near the upper left corner of the scene. This scene captures a common way of typing using a mouse and keyboard, which can be seen in offices, homes, or other settings.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "dining table", "laptop", "mouse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 40037, "caption": "The image features a grey building with a clock tower, creating an interesting scene with its unique architecture. The clock tower is located near the top of the building, providing an impressive view for the viewer.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 343218, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 301837, "caption": "It is a clean, sleek and modern kitchen featuring stainless steel appliances, a sink and an oven. The kitchen is well-lit with a skylight, making it a bright and airy space. A refrigerator and microwave are located towards the left side of the image, with a bottle placed closer to the back wall. Toward the center, there is an oven and a sink for easy cleaning. A hand dryer is situated near the oven, and a water glass is placed near the sink. A chair can also be seen in the room, likely for someone to sit and relax.", "mscoco_hallucinated_words": [["microwave", "microwave"], ["bottle", "bottle"], ["chair", "chair"]], "mscoco_gt_words": ["clock", "oven", "refrigerator", "apple", "sink"], "hallucination_idxs": [39, 52, 95], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3, "Recall": 0.6}}, {"image_id": 309071, "caption": " The image features an elderly man wearing a button-up shirt, reaching to open the door on a parked bicycle. The bicycle is parked on the sidewalk near the curb. There is another person in the scene, further behind the bicycle, but their face is not visible due to the man's position in front of the bike. The overall scene captures a casual moment of an older gentleman tending to his bike.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bench", "potted plant", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 240681, "caption": "A man is standing in the middle of the street with a backpack on, leaning over to use his cellphone. There is a red motorcycle parked nearby, and a handbag is visible on the ground in the scene. In addition to the man, there are other people in the area, such as two individuals standing closer to the motorcycle, and a person further down the street. The scene appears to be set on a quiet street.", "mscoco_hallucinated_words": [["cellphone", "cell phone"], ["handbag", "handbag"]], "mscoco_gt_words": ["backpack", "motorcycle", "person", "car"], "hallucination_idxs": [20, 32], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.75}}, {"image_id": 178753, "caption": "In the image, a young girl is seated in an orange chair in a room, with a cellphone to her right. She is smiling and laughing while wearing braces, indicating that she is enjoying her phone conversation. The chair she is sitting in appears to be in the center of the room, surrounded by a few other chairs nearby. The background features a set of steps near the right side of the image.", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["cell phone", "chair", "person"], "hallucination_idxs": [11], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 278890, "caption": "According to the picture, a person is holding their cell phone in their hand while sitting in a crowded area, likely in a restaurant or public place. They appear to be pointing their cell phone at someone while another person is also present in the image. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 47781, "caption": "There is a bedroom with a large red bed situated on the floor in the image. There are also two desks, one to the left and one to the right, and a few more tables with a bottle on one of them. There is also a chair situated towards the center of the room. The bedroom has a tiled floor and walls, with a ceiling light illuminating the scene.", "mscoco_hallucinated_words": [["desk", "dining table"], ["table", "dining table"], ["bottle", "bottle"], ["chair", "chair"]], "mscoco_gt_words": ["tv", "vase", "book", "bed", "person"], "hallucination_idxs": [21, 37, 40, 50], "metrics": {"CHAIRs": 1, "CHAIRi": 0.8, "Recall": 0.2}}, {"image_id": 542792, "caption": "There is a very large wooden baseball bat positioned near the end of a building in a city setting. It appears to be prominently placed and stands out in the urban environment. The street in the scene is bustling with a mix of vehicles such as cars and a truck, creating a typical city scene. There are also several people visible in the area, but they are scattered around, with some closer to the building and others a bit further away. Overall, it is a unique and attention-catching sight in the city.", "mscoco_hallucinated_words": [["car", "car"], ["truck", "truck"]], "mscoco_gt_words": ["person", "baseball bat", "sports ball"], "hallucination_idxs": [48, 51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.6666666666666666}}, {"image_id": 293703, "caption": "The image features three people, a woman and two men, sitting at a table in a restaurant. There are pizzas placed on the table as well, which is their main meal. Surrounding the people, there are some cups and bottles placed on the table. A handbag and a cell phone are also visible. The restaurant setting and the pizza give a sense of casual dining.", "mscoco_hallucinated_words": [["handbag", "handbag"], ["cell phone", "cell phone"]], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "bowl", "bottle", "fork", "chair", "person"], "hallucination_idxs": [52, 55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.15384615384615385, "Recall": 0.5555555555555556}}, {"image_id": 318777, "caption": " The image is a live baseball game taking place in a stadium. There are several people sitting in the stands around the playing field, observing the game. There are 10 baseball players on the field, spread across the outfield, infield, and at the catcher position.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 14773, "caption": " In the image, there is a plate full of hot dogs on a table, and a few more on the table's surface. There is also a drink in the center of the table, possibly a soda. The food and drink are arranged in an entertaining, car-themed style. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "spoon", "bottle", "fork", "hot dog", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 521863, "caption": "According to the picture, a person is riding a skateboard while navigating through a crowd of people. There are several other people scattered throughout the scene, with some walking in the background and others engaged in various activities. The majority of the individuals appear to be standing still, possibly observing the skateboarder or enjoying the atmosphere in the area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 250540, "caption": " In the scene, there are several people riding motorcycles, with two motorcycles in the foreground and two in the background. There is a person sitting in a truck in the right side of the image. There are also four other people visible, two on motorcycles in the middle of the scene and two others on motorcycles in the background. The street appears to be a city street.", "mscoco_hallucinated_words": [["truck", "truck"]], "mscoco_gt_words": ["handbag", "motorcycle", "cell phone", "car", "person"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.4}}, {"image_id": 211604, "caption": " In the image, a young man is lying on his back on the floor. He is wearing a white top and blue shorts. The tennis racket, located just to the right of the young man, is held close to him. Several other tennis rackets are scattered throughout the image, including one near the young man's feet, and a few more on the opposite side of the image. The overall scene appears to be a relaxed and casual setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 298067, "caption": "This is a food-related image featuring meat and a drink. The meat, likely chicken, is pictured next to a beer or another beverage, likely an alcoholic drink. There are also vegetables, such as broccoli or cucumber, and a dip or sauce to complement the dish. The image is divided into two sections, a close-up and a broader view, creating a comparison between the different elements.", "mscoco_hallucinated_words": [["broccoli", "broccoli"]], "mscoco_gt_words": ["bowl", "cup", "dining table", "bottle"], "hallucination_idxs": [39], "metrics": {"CHAIRs": 1, "CHAIRi": 1.0, "Recall": 0.0}}, {"image_id": 563938, "caption": "The scene depicts a man and a woman sitting on a couch together, with the man wearing sunglasses, a woman on the right and another woman on the left. There is a bottle in the scene, placed on a table located to the left of the couch.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["bench", "tie", "bottle", "person", "couch"], "hallucination_idxs": [43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.6}}, {"image_id": 461311, "caption": " The image depicts a large group of people gathered in a park or field on a sunny day. They are standing and walking around, spread throughout the area, and flying various kites at different heights, indicating that the park or field is a popular spot for kite flying.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["kite", "handbag", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 47940, "caption": " The image features a person lying on a bed, their legs resting on a book, and their feet occupying a significant portion of the scene. The person's legs are positioned on their sides, with one foot on the left side and the other on the right side. The book is on the left side, while the person's feet are at the bottom of the image. There is a computer visible in the background, positioned towards the center of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "laptop", "bed", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 261318, "caption": " The image features a man sitting on a bench, holding a cell phone to his ear as he talks. He is in a outdoor setting, likely surrounded by buildings, and there are several pieces of luggage near the bench he is sitting on.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["suitcase", "handbag", "bench", "cell phone", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 134768, "caption": " The image depicts a computer keyboard with a set of speakers in the background. The keyboard appears to be an electronic keyboard, with the buttons located at various positions. The speakers are positioned off to the side, near the bottom of the image. The overall setup suggests a computer setup in a home or office setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "laptop", "mouse", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 526414, "caption": "The image shows a living room with a long table and chairs in the middle, and a black cat is sitting on top of the table. There are various items placed around the room, such as a cupboard towards the back of the scene and a few books, including two on the tabletop.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["remote", "cup", "tv", "dining table", "bowl", "laptop", "cat", "book", "scissors", "couch"], "hallucination_idxs": [11], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.3}}, {"image_id": 273132, "caption": " The image depicts an older woman sitting on a park bench, with a hand touching her face. The bench is surrounded by various handbags, which can be found scattered around the scene. There are a total of eight handbags in the image, with one placed close to the woman's left side, two others around the middle of the scene, one on the right side, two more close to the right edge, and another near the bottom right corner.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 505213, "caption": " In the image, there is a man taking a large hot dog out of a bun while posing for the camera. The man appears to be laughing, and the hot dog is clearly visible as he is holding it up to show it to the camera. The hot dog is served in a bun, and it seems like a meal or a snack for the man. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "hot dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 192656, "caption": "According to the picture, it appears to be a baseball field where a man in a baseball uniform is shown standing in the grass. He is wearing a baseball glove and waving his hand, indicating he is ready to throw the ball. Several other baseball balls can be seen in the background, possibly representing the field as a whole or being thrown in various locations. The man is likely trying to make a throw to someone or in preparation for catching a ball during a game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 272728, "caption": " In the image, there is a beautiful vase filled with a variety of pink, yellow, and purple flowers on a table. The vase is placed near the center of the table, and it occupies a significant portion of the table's surface. The flowers appear to be a mix of different types, creating an attractive arrangement.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "chair", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 509811, "caption": "In the image, a person is standing in front of a large number of open laptops arranged in rows of dining tables in a large lecture room, creating a scene where someone has set up a presentation and is addressing an audience of students and computers. There is a projector, placed above a chair, possibly being used during the presentation. The lecture room is filled with books and numerous chairs, some of which are placed together in groups of four.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "apple", "book", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 412407, "caption": " The image shows a person holding a flip phone in their hand, which has an open screen displaying various lines. In the background, several other people are present, likely members of a crowd or event. Some of these individuals are holding other flip phones or other types of cell phones. One person is standing closer to the foreground, while others are scattered throughout the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "clock", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 353658, "caption": "In the image, there is a group of people, including the woman holding the umbrella, shopping in a farmer's market area. A person can be seen wearing a handbag, while two others appear to be standing closer to the back of the scene. Umbrellas are also present in the scene, providing shelter from the rain for some of the shoppers.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "suitcase", "handbag", "umbrella", "carrot", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 441363, "caption": "The image captures a man in a red shirt and white shorts playing tennis on a court. He is mid-air, trying to hit the tennis ball with his tennis racket, which is also shown in the scene. Several other people can be seen around the court, either watching the match or waiting for their turn. Some of the spectators are positioned close to the left side of the court, while others are more towards the right side. The atmosphere in the scene is lively and focused on the exciting game of tennis.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 331084, "caption": "In the image, there are four people at a dining table, where they are making pizzas. The table is filled with various ingredients and kitchen tools, including carrots, tomatoes, a knife, forks, and a bottle. The people appear to be working together and focused on preparing the meal.", "mscoco_hallucinated_words": [["fork", "fork"], ["bottle", "bottle"]], "mscoco_gt_words": ["pizza", "knife", "dining table", "bowl", "spoon", "carrot", "person"], "hallucination_idxs": [38, 42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.7142857142857143}}, {"image_id": 282553, "caption": "According to the picture, two women are walking together on a dirt path, with one of them holding a cell phone. The women are positioned on either side of the path. One woman is closer to the right side of the image, while the other is closer to the left side. The dirt path appears to be leading to a field or a park, which is in the distance. There are a few cars parked in the vicinity, but they are not a major part of the scene.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["cell phone", "handbag", "person"], "hallucination_idxs": [81], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 328818, "caption": "According to the picture, a woman in a pink shirt is bending over to tie her shoe to the bench. There are also a few other objects in the scene, including a bicycle situated to the left and a handbag near the center of the image. The bench is prominently positioned in the scene, with the woman bending down towards it while holding her shoe.", "mscoco_hallucinated_words": [["tie", "tie"], ["handbag", "handbag"]], "mscoco_gt_words": ["bicycle", "bench", "person", "bottle"], "hallucination_idxs": [15, 42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.75}}, {"image_id": 3103, "caption": " The image depicts a man skateboarding down a steep ramp, possibly practicing or performing a trick. He is wearing knee pads for protection while in motion. Another person can be seen in the scene, but it's unclear if they are a spectator or a skateboarder as well. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 188136, "caption": "The image depicts a large and open room filled with various items, including apples and an oven. A refrigerator is positioned near the center of the room, while a couple of pitchers are found on different parts of the countertop. ", "mscoco_hallucinated_words": [["apple", "apple"], ["oven", "oven"], ["refrigerator", "refrigerator"]], "mscoco_gt_words": ["dining table", "bowl", "orange", "sink", "carrot", "person"], "hallucination_idxs": [14, 17, 20], "metrics": {"CHAIRs": 1, "CHAIRi": 0.75, "Recall": 0.16666666666666666}}, {"image_id": 449675, "caption": " The image shows a plate on a table, featuring a sandwich and a salad. There's a fork also on the table, close to the plate. A knife is visible on the table, likely used for cutting the sandwich and salad. It appears to be a sandwich with a few different vegetables and a salad on the same plate. The fork and knife create a scene that suggests it's time to dine.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "spoon", "bottle", "sandwich", "fork", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 191691, "caption": "According to the picture, there is a man standing in a bathroom, shaving his face. He appears to be using a toothbrush to do so. The bathroom is also equipped with a sink and a toilet, both located near the man. In the background, there are several bottles and a tissue box, likely used for personal hygiene purposes. Overall, the scene suggests a routine bathroom experience for the man.", "mscoco_hallucinated_words": [["sink", "sink"], ["toilet", "toilet"]], "mscoco_gt_words": ["toothbrush", "person", "bottle"], "hallucination_idxs": [36, 39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 1.0}}, {"image_id": 89378, "caption": "The image depicts a man standing near a red motorcycle parked next to a brick building. There are a couple of garages, one located on the left and the other one on the right side of the image. The motorcycle is positioned diagonally between the garages, with one garage partially visible at the left corner of the image and the other garage partially visible at the right corner of the image. The man is standing next to the motorcycle, almost touching it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 99874, "caption": "In the image, a woman is seated at a table in a restaurant or dining establishment, using her cell phone. She is wearing sunglasses and has long black hair. The scene appears to be quite relaxed, with her casually taking a phone call while eating or dining at the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "dining table", "bottle", "cell phone", "cat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 437895, "caption": "The image is a bedroom scene with a large bed in the center. There's a colorful comforter or blanket draped over the bed, giving it a warm and cozy appearance. The bed has a metal frame, which enhances the vintage vibe of the bedroom.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 557447, "caption": " The image depicts a man in a suit wearing a yellow tie sitting at a conference table, looking down at a piece of paper in his hands. He appears to be concentrating and examining it. There is another person in the image, possibly in the background or another table in the room. The table features a few books, including a hardcover on the left and two books closer to the center of the table. ", "mscoco_hallucinated_words": [["book", "book"], ["book", "book"]], "mscoco_gt_words": ["knife", "dining table", "tie", "spoon", "fork", "person"], "hallucination_idxs": [62, 72], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.5}}, {"image_id": 159979, "caption": " There is an interesting scene taking place on the water, with multiple people in small boats and under various umbrellas. There are four boats present, two of which have umbrellas over them, while the other two do not have such shade.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "boat", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 357, "caption": "The image captures a professional baseball game with many players present on the field. The scene shows multiple players holding baseball bats and gloves, both in the dugout and on the field, ready to make plays or contribute to the game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 538891, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 561967, "caption": " The image features a large train traveling along the tracks. It appears to be carrying several cars, with several of them being in a line on the train's left side, while others are scattered throughout the train's length. There is a clear line on the train's right side, indicating that it is carrying different types of cars. Some of the cars are smaller, while others are bigger. The train is on the railroad tracks, moving towards the right side of the picture.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "train", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 565778, "caption": "The image depicts a blue and white passenger train traveling on a railroad track. It has a long body and is in motion, moving from left to right as it passes by a train station. There are people standing around the train, with some closer to the train and others further away. The overall scene highlights a common train-station setting where passengers are waiting or boarding the train.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "train", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 7125, "caption": " A group of people is enjoying a bike ride on a dirt path, with one man riding his bicycle in the center and others riding their bicycles to the right and left. The dirt path extends along the edge of the scene with water, likely the Potomac River, visible to the left. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "dog", "bottle", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 221350, "caption": "The image depicts a man holding a red and white tie, seemingly preparing to adjust or tighten it. The tie is adorned with various hearts, which can be seen as decorative embellishments. There are also two hand ties to the right and left of the main tie. Another tie is visible in the background, hanging from a tie rack.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "dining table", "tie", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 280930, "caption": "The scene features a little girl, likely around four or five years old, standing in a kitchen wearing a pink shirt. She is holding a tray with two slices of bread, both plain white. The girl is smiling and wearing a funny expression. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "oven", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 151081, "caption": "According to the picture, a helicopter is flying over a large field with two other vehicles nearby, such as cars, trucks or construction equipment. The helicopter is positioned behind one of the other vehicles, with both of the others closer to the left side of the field. There is another car visible on the left side of the image, but it is further away from the helicopter and the other vehicles.", "mscoco_hallucinated_words": [["car", "car"], ["car", "car"]], "mscoco_gt_words": ["truck", "airplane", "person"], "hallucination_idxs": [21, 56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.3333333333333333}}, {"image_id": 122232, "caption": " The image features a person in a yellow coat, posing on a tennis court. The person is holding a tennis racket and appears to be playing or practicing. There are several tennis balls scattered throughout the court, with some closer to the person and others at different positions.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 134223, "caption": " The image shows a plate containing a hot dog and various toppings such as ketchup and mustard. There are also French fries, with several portions placed on the plate. On the side of the plate, there is a bottle with a condiment, likely a dip or sauce, and two cups. ", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cup", "cup"]], "mscoco_gt_words": ["bowl", "hot dog", "broccoli", "dining table"], "hallucination_idxs": [42, 55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.25}}, {"image_id": 198163, "caption": "In the image, a man is sitting behind a window, likely at work in a studio or a workshop. He is surrounded by countless bowls and jars, which are placed in an assortment throughout the room. The variety of cups and jars can be found in different sizes and colors, covering the surfaces around the man.", "mscoco_hallucinated_words": [["cup", "cup"]], "mscoco_gt_words": ["bowl", "person", "dining table"], "hallucination_idxs": [44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 3742, "caption": " The image showcases a vase containing a red rose and a bowl filled with apples placed on a window sill. There are a total of six apples visible in the bowl, with their varying sizes spread throughout the bowl. The rose is positioned towards the right side of the vase, while the apples are situated more in the middle area of the scene. The window sill extends from the right side to the middle section of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "apple", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 217593, "caption": "According to the picture, there is a little boy wearing a baseball uniform standing in a field. He seems to be playing a game, and there is a baseball glove next to him. Additionally, a baseball mitt can be seen in the scene. There are also several other people in the field, some of them possibly playing the game too. A bench is placed on the side of the field, providing a place for players to rest during breaks in play. Overall, it appears to be a casual, friendly baseball game.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["handbag", "chair", "baseball bat", "person", "baseball glove"], "hallucination_idxs": [69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.4}}, {"image_id": 279078, "caption": " The scene depicts a young boy standing next to a refrigerator, opening its door and pointing to the array of bottles inside. The boy's hand is extended outward towards the refrigerator, indicating his interest in the contents. There are various bottles placed throughout the refrigerator, with some located closer to the door and others in the back. The majority of the bottles are placed on the left side of the refrigerator, creating an array of different beverages.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 572081, "caption": "The image captures a moment on a tennis court where two men are playing a game of tennis. They are both holding tennis rackets, with one player holding a tennis racket in his right hand and the other player holding it in his left hand. Another person is visible in the image, wearing a red suit and standing off to the side. This person is likely a court attendant or an umpire, ensuring a fair game. The scene is an active tennis match with people moving around and interacting with each other.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "chair", "sports ball", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 477016, "caption": " The image features several people standing in a room, likely a living room. In the center of the room, a little girl is standing in front of an appliance, likely an oven or a refrigerator, with small plastic fish or aquarium inside. ", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"]], "mscoco_gt_words": ["handbag", "oven", "backpack", "potted plant", "sink", "person"], "hallucination_idxs": [38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 301266, "caption": " According to the picture, there is a group of people gathered in a kitchen, with several baskets and cups on the table in front of them. They seem to be working on various projects, as some of them are handling kitchen utensils and food items, possibly organizing or preparing them for a meal or event.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "spoon", "book", "fork", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 475813, "caption": "A little girl is standing in a room, possibly playing or exploring her surroundings. She is wearing pyjamas and holding a small remote control in one hand.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 558784, "caption": "The image features a man standing on the side of a baseball field near a pond. The man is holding a baseball bat, and there is a baseball on the ground next to him. There are a few cars in the parking area nearby, adding to the atmosphere of the outdoor setting.", "mscoco_hallucinated_words": [["baseball bat", "baseball bat"], ["car", "car"]], "mscoco_gt_words": ["backpack", "frisbee", "person"], "hallucination_idxs": [22, 41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 308543, "caption": "The image depicts a person holding two spoons in their hand while standing in front of a sink. The person appears to be in a kitchen or bathroom setting, with the sink being close by. The spoons are placed on either side of the person's body, suggesting they are being held in their hands. The image conveys a sense of cleanliness and hygiene as the person performs their daily tasks involving utensils.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "spoon", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 359164, "caption": "The image shows a pair of dogs, one white and one brown, on a grassy lawn. The white dog is running in the foreground, while the brown one is standing further back. A plastic bag is also visible in the background. There is a bench near the left edge of the scene, and a few chairs and a table can be seen in the distance. The scene captures a playful moment between the two dogs as they run and explore in the garden.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["car", "chair", "dog", "dining table"], "hallucination_idxs": [50], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.75}}, {"image_id": 539951, "caption": "This image features a large kitchen scene with various kitchen appliances. There is a large oven in the middle of the kitchen and multiple sinks and refrigerators on the left side of the kitchen. There is a countertop in the middle of the kitchen, which is black and white. The image depicts a very organized kitchen with a mix of vintage and modern appliances, all placed neatly within the space.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "clock", "oven", "refrigerator"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 143479, "caption": "According to the picture, there are two men in the scene: one older gentleman in a white shirt and khaki pants, and another, closer to the right side of the image. The older gentleman is holding a tray with a group of hot dogs placed on it. There are several hot dogs visible on the tray, with some located closer to the right side of the scene, while others are more towards the left side. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["hot dog", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 475024, "caption": " In the image, a vase with various colored flowers and leaves is prominently displayed in a room under a mirror. The vase is placed on a table against a wall, and it has a significant number of flowers and leaves in it, creating a visually appealing and festive display.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "orange", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 238065, "caption": "In the image, a person is on a skateboard riding on the side of a cemented block, enjoying their time and performing tricks. They appear to be practicing or honing their skills while cruising on the skateboard. The skateboard is located on the left side of the image, while the person is located in the middle area, both near the top of the skateboard ramp. The setting appears to be an outdoor area with a few trees visible in the background, providing a natural backdrop for the skateboarding experience.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 314016, "caption": "According to the picture, a small bathroom has a sink and a toilet in the corner. The toilet is occupied by a book and a bottle of Jack Daniels, possibly promoting Magic Eye, a 3D effect used to create an optical illusion. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "book", "toilet", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 322163, "caption": "The image captures a scene in a restaurant kitchen where two men are actively preparing food together. One of the men appears to be the main cook, standing in front of the kitchen counter, and the other is situated closer to the oven area. Both individuals are wearing white aprons, suggesting that they are professionally working together to create a meal.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "pizza", "oven", "dining table", "bowl", "potted plant", "bottle", "microwave", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2222222222222222}}, {"image_id": 230983, "caption": "In the image, a man is shown riding a skateboard while doing a trick, as he is jumping in the air on the sidewalk. The skateboard he is riding is partially covered by a container that seems to be placed in front of it, possibly creating a dramatic scene.", "mscoco_hallucinated_words": [["container", "bowl"]], "mscoco_gt_words": ["skateboard", "car", "person"], "hallucination_idxs": [37], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 564629, "caption": "The image depicts a very clean and organized kitchen with a sink, stove, and oven. The sink is located in the center of the kitchen, with a black countertop surrounding it. The stove is on the left side of the kitchen, and the oven can be seen behind it. There are also two bottles on the countertop, one located near the center and the other towards the right side. A black microwave can be seen in the upper left corner of the room, and there is a black refrigerator against the far wall. The overall room is very neat and well organized, with a good distribution of appliances and kitchen tools.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["sink", "microwave", "oven", "refrigerator"], "hallucination_idxs": [60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 1.0}}, {"image_id": 53431, "caption": "In a scene featuring a man sitting outdoors near a brick building, a man is using a laptop and wearing a pair of black shoes with white ties on them. The laptop is on a small stool or table in front of the man, and there is another small stool located to the left of the main subject. The man is wearing a white bow tie, making him stand out in the image. Apart from the man, there are two bottles nearby, one placed close to the left side of the scene and another located further right in the picture.", "mscoco_hallucinated_words": [["table", "dining table"], ["tie", "tie"], ["bottle", "bottle"]], "mscoco_gt_words": ["laptop", "chair", "person"], "hallucination_idxs": [40, 68, 87], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 441840, "caption": "The image features a field with several people present. In the center of the scene, a little boy holding a baseball bat is swinging at a ball that is mid-air. Surrounding him are various people, with one person in the distance to the left, and another person closer to the left side of the scene. Two more people are near the right edge, with one closer to the background and the other closer to the edge.", "mscoco_hallucinated_words": [["ball", "sports ball"]], "mscoco_gt_words": ["truck", "car", "baseball bat", "person", "baseball glove"], "hallucination_idxs": [28], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.4}}, {"image_id": 53465, "caption": " The image depicts a cake, or more specifically, a piece of cake that has been decorated with \u201cwelcome to punch\u201d written on it. It's sitting on a table, surrounded by several food and drink items. The cake appears to be covered with white frosting, as the words are written on it. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "bowl", "spoon", "cake", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 381031, "caption": "There are several items on a table, including a plate filled with a variety of foods, a couple of bottles, and a cup. The food on the plate includes apples, a handful of oranges, and a couple of donuts. There are at least two bottles present, one closer to the center and another on the left side of the table. A cup is placed near the right side of the table.", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["cup", "dining table", "spoon", "bottle", "apple", "microwave", "donut"], "hallucination_idxs": [38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 0.7142857142857143}}, {"image_id": 530520, "caption": "In the image, three men are sitting on the grass in front of a building, engaged in an outdoor activity. One of them is using a laptop, probably communicating with someone or doing some work. There is a bottle placed near the group, and another laptop can be seen in the background on the left side. ", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["clock", "motorcycle", "laptop", "backpack", "person"], "hallucination_idxs": [43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.4}}, {"image_id": 539555, "caption": "According to the picture, a man in a white shirt is playing with a frisbee, attempting to throw it in the air and catch it. Several other people can be seen nearby, likely observing the frisbee game or participating in their own activities.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "frisbee", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 279521, "caption": " In the image, there is a large silver train parked by a small train. Two people can be seen standing close to the first train, one of them walking away and the other standing in the middle. There is another person closer to the second train, standing next to it. A handbag is visible, located on the right-hand side of the first train, likely belonging to the individual walking away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "handbag", "train", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 560439, "caption": "The image depicts a group of people wearing snowboards engaged in an exciting winter sporting activity. There are three main people in the scene, one at the front, and two others following behind, all wearing snowboards and participating in the fun.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "snowboard", "backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 248353, "caption": "The image captures a group of people standing on skis in the snow, making it a winter snow sport scene. There are four people in total, with two groups of skiers separated by a large distance. The first group includes a skier on the right side, another individual a bit further away, and another person with skis on the left side. The second group has three people standing on their skis, with one person further back and another pair standing together. All the skiers seem to be enjoying their time and the various combinations of people are engaging in conversations and activities.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 9628, "caption": "The image captures a thrilling scene where a man is enjoying a waterskiing experience in the lake. He is riding a pair of skis, one after the other, and appears to be in awe, capturing attention. In addition, there are several water skis spread out on the water near the man. The man is likely to have had an exciting experience as he soars through the air, while the skis perform their stunts below him.", "mscoco_hallucinated_words": [["ski", "skis"], ["ski", "skis"], ["ski", "skis"]], "mscoco_gt_words": ["surfboard", "boat", "person"], "hallucination_idxs": [24, 48, 77], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 330050, "caption": "According to the image, the man and woman are standing on a tennis court. Both of them are holding tennis rackets, and there are no other people in the scene. The image depicts a relaxed and casual atmosphere, with the man and woman smiling and standing close to each other. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 254161, "caption": " In the image, a man is flying a kite on a brick plaza in the city, surrounded by tall buildings. There are several people standing nearby, including another man standing off to the side, as well as a few other people further back in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "kite", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 275749, "caption": " In the image, a group of people are gathered in a room, with a man and a woman standing closely together. The woman seems to be looking towards the man, creating a sense of closeness between them. There are four bottles placed on a countertop, and a wine bottle is standing on the lower left edge of the image. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "tv", "dining table", "laptop", "mouse", "bottle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 163518, "caption": "In the image, there is a woman standing in a kitchen-like environment with a tray filled with various salads. She is wearing a black apron and blue gloves, as she appears to be prepping the food for serving. The salads are arranged neatly in a row on the tray, with a couple of them located towards the middle, and the remaining ones spread out towards the left and right sides. A knife can be seen in the bottom right corner of the tray, possibly used to cut the salads.", "mscoco_hallucinated_words": [["knife", "knife"]], "mscoco_gt_words": ["oven", "dining table", "bowl", "chair", "person"], "hallucination_idxs": [78], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.2}}, {"image_id": 156696, "caption": "This is a black and white image of a train station, with two people standing on the platform. The image is slightly blurred, as if it was taken in the past. There are two people standing next to the first train car, which appears to be the back of the train. Another person is standing on the platform in front of the second train car, which is also the back of the train. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "train", "handbag", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 525202, "caption": " In the image, a pizza is placed on a counter set out on a table. The pizza has cheese on it, indicating a delicious treat for those around it. There are a few people in the background, likely enjoying the atmosphere and gathering for a meal or a social event. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "spoon", "bottle", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 423984, "caption": " The image features a plate filled with a variety of food items, including carrots, broccoli, and bread. Among the vegetables, there are multiple carrots placed around the plate, along with a few broccoli pieces. The broccoli is spread out, with some pieces placed towards the top of the plate, and others placed closer to the bottom.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["carrot", "broccoli", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 499486, "caption": " According to the picture, two cats are sleeping on a messy bed in an undisclosed location. One of the cats is on the left side of the bed, while the other is on the right side. The bed appears to be unmade and disheveled, with a pile of clothes on top of it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "cat", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 118124, "caption": " The image features a smiling man in a shirt wearing a tie while sitting in the back seat of a car. The car is placed in the center of the picture and the man is occupying most of the space in the background. A purse can be seen on the seat next to the man, which suggests that he might be sitting inside a vehicle that is still in motion. Some additional details in the scene include a bottle on the right side and a glass on the left side, both located near the man.", "mscoco_hallucinated_words": [["seat", "chair"], ["purse", "handbag"], ["seat", "chair"], ["bottle", "bottle"]], "mscoco_gt_words": ["tie", "person", "car"], "hallucination_idxs": [17, 46, 52, 82], "metrics": {"CHAIRs": 1, "CHAIRi": 0.36363636363636365, "Recall": 1.0}}, {"image_id": 288729, "caption": "According to the picture, there is a small cake that has been made into a circus tent. Inside the tent, there are two dolls positioned in the middle. The cake is placed in a prominent position on the table, likely for a party or a decorative display.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["teddy bear", "bear", "elephant", "cake", "person"], "hallucination_idxs": [42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.2}}, {"image_id": 177721, "caption": "According to the picture, a person in a green shirt is seen performing a skateboard trick on a railing. The skateboard is in mid-air as the person is skillfully managing their balance. The background includes several park benches, suggesting it might be a park or a public space where people gather. There are also a couple of other people in the area, likely watching or waiting for their turn to perform their own tricks. Overall, the image captures the energy and excitement of skateboarding in a public space.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["skateboard", "person", "dining table"], "hallucination_idxs": [40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6666666666666666}}, {"image_id": 372220, "caption": " The scene features a man with a white beard wearing a hat, standing in a crowded area. He appears to be reading a book while holding his cell phone with a black background. The area is filled with people, with some walking in different directions and others standing nearby. There is also a bench nearby, where people can rest or sit while they interact with each other.", "mscoco_hallucinated_words": [["book", "book"], ["bench", "bench"]], "mscoco_gt_words": ["cell phone", "horse", "handbag", "person"], "hallucination_idxs": [25, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 252625, "caption": "The scene is set along a city street with various stores and shops visible from the road, as well as a few signs indicating the presence of parking spaces and parking restrictions. There are several people on the street, two of them on bicycles, and one person walking on the sidewalk next to the street.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "potted plant", "person", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 493610, "caption": " In the image, a blue truck is driving down a dirt road with people standing around and walking in the area. There is a large group of people, with some seated and others standing, on the back of the truck, enjoying the ride. A number of them can be seen seated on different parts of the truck, while others are standing or walking around the vehicle. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "person", "bus", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 302789, "caption": " The image features a small bedroom that has a single bed against one wall and a suitcase sitting on the floor next to it. The suitcase is the only piece of luggage in the room, suggesting that it belongs to the person sleeping on the bed. The bedroom has a refrigerator located near the other wall, while the sink is placed near the edge of the bed. ", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["suitcase", "handbag", "refrigerator", "backpack", "bottle", "sink", "bed"], "hallucination_idxs": [43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.5714285714285714}}, {"image_id": 30667, "caption": "The scene features a large brown dog sitting on a boat with two people's feet visible around it. There is a seat or bench positioned behind the dog, possibly where the two people could be sitting. In the background, there are three other chairs in different parts of the boat. It appears to be a fun outing with both the dog and the people enjoying the boat ride.", "mscoco_hallucinated_words": [["seat", "chair"], ["bench", "bench"], ["chair", "chair"]], "mscoco_gt_words": ["person", "boat", "dog"], "hallucination_idxs": [23, 25, 48], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 114108, "caption": " The image displays a dining table with several people seated around it. At the center of the table, there are two pizzas, one with cheese and the other topped with vegetables. A knife is positioned near one of the pizzas, possibly for cutting. A cup is also visible on the table, possibly for drinks.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "spoon", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.625}}, {"image_id": 328110, "caption": " In the picture, a woman is sitting on a bench with her baby in a stroller. The baby is holding the bottle, which is likely being fed. The woman has her hand on the baby, keeping an eye on the child as she sits on the bench.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["backpack", "handbag", "bench", "person"], "hallucination_idxs": [23], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.5}}, {"image_id": 366111, "caption": "The image features several people riding bicycles in an urban setting, creating a lively atmosphere. In total, there are five bicycles visible, with a man riding a bike near the front of the scene, another person with two bikes in front of him, and three more bikes further back. A traffic light is positioned at the side of the scene, indicating a controlled intersection.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 532481, "caption": " The image depicts an exciting moment in the water with several people actively engaged in parasailing, kite boarding, or other water sports.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "surfboard", "kite", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 291634, "caption": " The scene features a sidewalk with several parked motorcycles lined up next to each other. One of the motorcycles is a bit closer to the edge of the sidewalk while the rest are more towards the middle. There are also a couple of bicycles parked in the background, one slightly in front of the motorcycles and the other further away from them. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 463678, "caption": " The image features a young man playing tennis on a court. He is in the middle of the court, wearing white and holding a tennis racket, ready to hit the tennis ball. The ball is also in the middle of the court, approaching the tennis player. There are several chairs visible around the court, possibly for spectators or other players.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 70815, "caption": "In a black and white image, a dog is walking down a sidewalk, perhaps on a leash, on the left side of the picture. The dog is being led by a bicycle, likely parked on the sidewalk. The dog is near a street sign and a parked car on the right side of the scene. There are at least three other people in the image, standing on the sidewalk in various positions or moving. A truck is also present, likely parked in the background.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["bicycle", "person", "truck", "dog"], "hallucination_idxs": [53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 367872, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 353898, "caption": " In the image, a man is sitting on a bench surrounded by a large group of pigeons. The birds are spread across the scene in various positions, covering the ground and nearby surfaces. There are several of these birds present in the scene, both small and larger in size.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "cell phone", "backpack", "bird", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 88269, "caption": "According to the picture, there is a bowl of soup and two grilled cheese sandwiches on a dining table. The soup is placed near the left edge of the table, while the sandwiches are situated closer to the right edge. A spoon can be seen, indicating that the sandwiches are being served with the soup.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"]], "mscoco_gt_words": ["sandwich", "spoon", "bowl"], "hallucination_idxs": [19, 31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 1.0}}, {"image_id": 463211, "caption": "The image depicts a little girl wearing a pink backpack walking near a school bus. There are two other children in the scene, with one of them being led by the little girl. The girl is wearing a pink shirt and appears to be a student getting ready for school. The children are walking in the same direction as the school bus, which is in the background. A couple of school buses can be seen in the scene, one closer to the little girl and another one in the distance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bus", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 396269, "caption": "The image depicts a grand and elegant room with an ornate and colorful ceiling and walls, featuring a large vase of colorful flowers on the dining table. There is a chair placed at the right side of the room, while another chair is closer to the center, with a flower arrangement sitting in front of the chairs. The overall atmosphere of the room appears to be elegant and sophisticated.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "chair", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 98979, "caption": "The image features a microwave sitting on a counter top next to a wine bottle. The microwave has a silver color, with the word \"Kenmore\" prominently displayed on the front. It seems like a new and clean microwave is being used in this kitchen setting.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["microwave", "knife", "oven"], "hallucination_idxs": [14], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 244455, "caption": " The scene features two motorcycles riding down the road, each wearing the full gear, including helmets, that typical for bikers. One of the motorcycles is more prominent, riding slightly in front of the other. There are multiple cars parked along the road, likely at various distances from the motorcycles.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 379443, "caption": "According to the picture, there is an orange pair of scissors on a table, with a pair of scissors open and a spoon sitting next to them. The spoon and scissors are positioned in the foreground of the image, creating an interesting contrast with the black fabric. The scissors appear to be held together by the fabric, with the pair occupying the center of the image and drawing attention to the tools.", "mscoco_hallucinated_words": [["spoon", "spoon"], ["spoon", "spoon"]], "mscoco_gt_words": ["orange", "scissors", "dining table"], "hallucination_idxs": [24, 31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 536154, "caption": " A young boy with arms outstretched and a smile poses among many stuffed animals. The image features a blue background with a bunch of teddy bears and stuffed animals of various shapes and sizes surrounding the boy, who seems to be enjoying his time in the midst of them. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "cat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 374061, "caption": " In the image, a black and white cat is sleeping on a bed and cuddling a stuffed animal, likely a teddy bear. The cat is stretched out and holding the stuffed animal with its arms and legs. There are a couple of people sitting near the cat in the image, likely watching over or caring for it.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["teddy bear", "bear", "cat", "bed", "couch"], "hallucination_idxs": [46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6}}, {"image_id": 560018, "caption": "In the image, there is a large group of people on water scooters and surfboards enjoying a day of surfing on the ocean. There are multiple surfboards visible in the scene, with people riding them and spread out over the water. Several people are positioned closer to the beach, while others can be seen further out to sea. Some surfers appear to be in groups or clusters, while others are on their own.", "mscoco_hallucinated_words": [["scooter", "motorcycle"]], "mscoco_gt_words": ["surfboard", "kite", "person"], "hallucination_idxs": [13], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 353320, "caption": " In the image, there is a guitar sitting on top of a large brown electric guitar. The bench is located next to a pair of suitcases, and there are two backpacks sitting on the ground nearby. These bags are positioned close to each other on the left side of the scene, while the guitar is closer to the right side. There is another backpack in the background, visible behind the first two bags.", "mscoco_hallucinated_words": [["backpack", "backpack"], ["backpack", "backpack"]], "mscoco_gt_words": ["handbag", "bench", "suitcase", "person"], "hallucination_idxs": [33, 69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.5}}, {"image_id": 256628, "caption": "The image features a golden retriever dog sitting on a wooden deck. The dog is wearing a collar and lying down on the floor, with its tongue slightly hanging out. There are chairs placed around the dog, including two chairs on either side, and one chair located farther away in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "chair", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 158806, "caption": "The image depicts a large black and tan dog sitting outside, with a person in the background on the right side of the scene. The person has a plate with two slices of toasted bread on it, which they are about to offer to the dog. The dog is also in close proximity to the person, and it seems that the two have a close bond. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 424270, "caption": "In the image, a little girl is sitting on a bed in the middle of the scene, surrounded by sheets and blankets. The bed appears to be a large size, taking up a significant portion of the scene. The girl appears to be playing or relaxing on the bed. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bed", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 368335, "caption": "According to the picture, there is a horse walking across a parking lot, with several cars blocking its path. The horse appears to be walking from one side of the parking lot to the other. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "horse", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 483227, "caption": "The image depicts a group of young people standing on a tennis court. There are six people in the scene, four of them holding tennis racquets. The individuals appear to be participating in an outdoor activity or event, perhaps a tennis tournament. Some of the players are wearing purple shirts, and one person is holding a drink bottle.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.3333333333333333}}, {"image_id": 228306, "caption": " The image features a red double-decker bus driving down a street in front of a tall building with many apartments. The bus occupies a large portion of the scene, covering most of the left side and a considerable part of the right side. Several other vehicles are visible in the background, including a few cars parked on the street and another bus moving on the road to the right. There is a person standing near the bus, likely waiting for it or taking a ride.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 416384, "caption": "According to the picture, a woman in a pink bathing suit is sitting on a beach chair, looking at her cell phone. She is holding her cell phone with her right hand, possibly checking messages or communicating with someone. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 364783, "caption": "The image shows a brightly colored bathroom with a small toilet and sink. The toilet is positioned near the center of the bathroom, and the sink is located nearby. The bathroom features a mixture of different colors, with red dominating the overall decor, including the walls, tub, and sink. Several cords and ropes are visible in the picture, hanging from various spots in the room.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toothbrush", "cup", "potted plant", "bottle", "toilet", "sink", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 335041, "caption": "The image features a person, likely a woman, wearing a red jacket and black pants, skiing down a snow-covered street. This street is surrounded by parking lots and a car can be seen parked in two different spots, close to the sidewalk. The woman appears to be enjoying the winter weather while walking through the snow-covered street.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 531327, "caption": "The image features a white bowl filled with a variety of colorful foods, including different types of carrots and broccoli. This assortment of vegetables can be seen at a table top. There are a number of utensils on the table as well, including a fork, a knife, and a spoon.", "mscoco_hallucinated_words": [["bowl", "bowl"]], "mscoco_gt_words": ["knife", "dining table", "spoon", "fork", "broccoli", "wine glass", "carrot"], "hallucination_idxs": [5], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.8571428571428571}}, {"image_id": 451228, "caption": "There is a busy street scene with several buses, including a red double-decker bus, travelling in the same direction. One bus is travelling on the right side of the street while the other is on the left. There is a bicycle in the middle of the road, with another bicycle near the right bus. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "bus", "person", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 256365, "caption": " The image features a plate filled with several cupcakes decorated with blue frosting. There are nine cupcakes of various shapes and sizes on the plate, which seems to have been made of white and blue frosting. Some cupcakes are placed in the middle of the plate, while others are positioned towards the left and right edges. The plate is likely made of white or another light-colored material, making the blue frosting stand out.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "cup", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 502936, "caption": " A group of people are standing around a tennis court, and a man is playing a game of tennis on the court. He is holding a tennis racket in his hand, getting ready to hit a tennis ball. The ball appears to be high up in the air and is likely to be coming towards the man in the near future.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 451449, "caption": "In the image, a black and white dog is sleeping in a dog bed with various books on top of it. The bed is situated in the lower part of the picture, with books arranged on the left side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "bed", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 51054, "caption": " The scene in the image features a woman sitting in a bedroom, holding a cell phone up to her ear. The bed appears to be placed off to the right side of the room, while a pillow can be seen on the bed. The woman is wearing a pink shirt and looks at the cell phone while it rings. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "bottle", "cell phone", "bed", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 1268, "caption": "In the image, a group of people is gathered near a body of water, likely a river or a canal, possibly under a bridge. The scene features a woman, who is the main subject, taking a photo of a white bird on the water. Another woman can be seen nearby, standing closer to the water. They are both holding cameras, making them interested in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "cell phone", "backpack", "bird", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 62692, "caption": " The image captures a woman in black and white clothing hitting a tennis ball with a racket on a tennis court. The woman appears to be in the middle of the court, swinging the racket and attempting to return the ball. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 542771, "caption": "The image depicts a room with a chair sitting in front of a mirror and a table with chairs and vases of flowers scattered around. The chair appears to be vintage, and there are four chairs in the room, one slightly below the other chairs. Two vases are positioned on the table, one towards the top and another near the center of the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "potted plant", "vase", "book", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 575406, "caption": " | Scene | ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "bus", "potted plant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 244834, "caption": "According to the picture, there are two people standing and walking down a rainy street. Both are wearing umbrellas and holding them over their heads for protection from the rain. They are holding hands, making it seem like a romantic or close friendship between the couple. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "clock", "handbag", "umbrella", "potted plant", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 151159, "caption": "The image depicts a large outdoor park, with multiple people scattered around the space. Two men can be seen skateboarding in the park, one of them on the right side, and the other on the left side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 277073, "caption": "The scene features a man riding a motor scooter, and a woman is sitting behind him on the back. There is another man walking next to the motor scooter, and a person can be seen in the background walking near a bus. A bottle is also present in the scene, possibly as a beverage for the riders.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["bicycle", "motorcycle", "car", "backpack", "person"], "hallucination_idxs": [47], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.4}}, {"image_id": 377879, "caption": "In the image, two military tanks are parked on train tracks. The first tank is located further towards the left, while the second one is towards the right. Both tanks appear to be a similar color, possibly a sandy or brown shade. The tanks appear to be quite large, taking up most of the train track they are parked on. They are stationary, possibly waiting to be transported by the train or being exhibited as military memorabilia.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "train", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 436814, "caption": " The scene shows a typical office with a computer sitting at a desk along with a laptop on top. The office is set up with a door in the background, a chair placed closer to the desk, and a phone resting on the desk. Additionally, there is a camera positioned on the left side of the room. The computer and laptop are equipped with monitors and keyboards. The environment suggests that this is a typical work setting, and the presence of various technology and devices highlights the nature of the office and its use.", "mscoco_hallucinated_words": [["phone", "cell phone"], ["monitor", "tv"], ["keyboard", "keyboard"]], "mscoco_gt_words": ["laptop", "mouse", "chair", "dining table"], "hallucination_idxs": [42, 70, 72], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2727272727272727, "Recall": 0.75}}, {"image_id": 63934, "caption": " The image shows a person on a skateboard, wearing a costume, riding in the street with various cars parked on either side of the road. The cars are parked at different angles, some of them nearer to the curb than others. The person riding the skateboard appears to be navigating down the street, perhaps enjoying a day outdoors.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "skateboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 275611, "caption": " The image showcases a delicious-looking homemade Italian cheese pizza, placed on a silver pizza pan, sitting on top of a table. The pizza is neatly displayed with several slices, including one slice in the upper-left, two slices in the center, one slice on the right side, and another slice in the lower-right area. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "knife", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 516248, "caption": "In the image, there is a person sitting at a table with a computer setup. The person is using a keyboard and mouse on the table to interact with the computer. The keyboard and mouse can be seen, highlighting the person's activity on the computer.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "tv", "dining table", "laptop", "mouse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 478380, "caption": "The scene features a large herd of sheep walking along a road. A person is driving down the road in a vehicle looking out the window at the sheep. They are spread out across the roadway with some near the car, others farther away. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "sheep", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 422326, "caption": " There is a parking lot filled with several motorcycles of different sizes and designs. One of the motorcycles has an unconventional shape, with a side car attachment. The motorcycles are lined up in the parking lot, displaying various styles and colors.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 110449, "caption": "The image features three people in a room, sitting or standing around a table. Two of the people are holding cell phones, which might suggest that they are using their mobile devices. There is also a TV behind them, likely on and playing some sort of content. ", "mscoco_hallucinated_words": [["cell phone", "cell phone"]], "mscoco_gt_words": ["remote", "tv", "dining table", "laptop", "person"], "hallucination_idxs": [22], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6}}, {"image_id": 45550, "caption": "The image depicts a man posing for a photo, smiling and holding a plate containing some food, likely fried food, such as fried dough, accompanied by some pasta. There are several bottles and cups in the background, likely placed there by the restaurant's staff for patrons.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cup", "cup"]], "mscoco_gt_words": ["sandwich", "clock", "person", "bowl"], "hallucination_idxs": [36, 38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.25}}, {"image_id": 444913, "caption": "In the image, a man is walking along a sidewalk with a large bush in the background. The man appears to be holding a baseball hat, possibly preparing to wear it as he walks. There is another bush on the other side of the sidewalk, adding to the greenery in the area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["frisbee", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 82709, "caption": "The image captures a dog in action, leaping up in the air while catching a pink frisbee. The dog is in the center of the scene, surrounded by numerous people seated on the grass. Some people are closer to the dog, while others are further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dog", "bottle", "frisbee", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 132796, "caption": "In the image, there are two people riding on the back of a large elephant. One person is seated in the middle of the elephant, while the other is sitting towards the right edge of the animal. Both individuals appear to be enjoying their ride on the elephant.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "elephant", "backpack", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 303099, "caption": "Two cats are sitting on a toilet together in a bathroom, resting and possibly sharing the space. There is a toilet on the right side of the image, with the cats sitting on top of it, one closer to the left side and the other towards the center. A sink is also present, situated to the right side of the toilet. The picture showcases a typical bathroom setting, with a toilet and a sink, creating an everyday scene.", "mscoco_hallucinated_words": [["sink", "sink"], ["sink", "sink"]], "mscoco_gt_words": ["cat", "toilet", "bottle"], "hallucination_idxs": [54, 81], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 271565, "caption": "According to the image, there are several motorcycles parked down the street in a line. They are placed in various positions on the street, some closer to each other than others. Some of the motorcycles are in the middle of the road while others are near the edges.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 205193, "caption": "According to the image, it is a kitchen scene featuring a few items in the foreground. To the right, there is a brown granite countertop with various food items on it. These items include an orange cup near the top of the image, a bag of green tea below the cup, and a tea kettle near the bottom of the image. On the countertop, there is also a black pitcher to the left of the orange cup.", "mscoco_hallucinated_words": [["orange", "orange"], ["cup", "cup"], ["cup", "cup"], ["orange", "orange"], ["cup", "cup"]], "mscoco_gt_words": ["toaster", "microwave", "oven", "person"], "hallucination_idxs": [39, 40, 55, 83, 84], "metrics": {"CHAIRs": 1, "CHAIRi": 0.8333333333333334, "Recall": 0.25}}, {"image_id": 272566, "caption": "The image captures a man sitting on the ground outside, surrounded by various types of bananas arranged in different baskets. The bananas are spread out in front of him, creating a colorful fruit display. There is also a woman nearby, sitting in the same area. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "apple", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 170099, "caption": "In the image, there is a man standing in front of a couch, wearing a blue shirt and a red tie. He is also looking down, possibly at the couch or another surface nearby. The couch is positioned slightly behind him, occupying a significant portion of the scene. A chair is located nearby, with a smaller chair in the background.", "mscoco_hallucinated_words": [["chair", "chair"], ["chair", "chair"]], "mscoco_gt_words": ["tie", "orange", "person", "couch"], "hallucination_idxs": [56, 64], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.75}}, {"image_id": 14940, "caption": "In the image, there are two stuffed animals \u2013 a cat and a dog \u2013 laying on a bed, with the cat closer to the viewer's right side and the dog on the left side. The cats and the dogs seem to be enjoying being near each other.", "mscoco_hallucinated_words": [["bed", "bed"]], "mscoco_gt_words": ["dog", "cat", "orange", "chair", "couch"], "hallucination_idxs": [19], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.4}}, {"image_id": 408449, "caption": "According to the picture, a shirtless man is skateboarding down a street next to another man, who appears to be standing near the street, observing the activity. The street is quiet with no other people or vehicles around, giving the impression that it's an unusual scene. The shirtless man is in the center of the scene, enjoying his skateboard ride. There is a slight breeze, as indicated by the wind blowing the hair of a person on the left side of the scene. The shirtless man appears to be the focal point of the picture as he skateboards by, with the other person observing the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "skateboard", "bus", "handbag", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 140702, "caption": " The image showcases two elephants walking through a field, with one of them being much larger than the other. The bigger elephant is walking towards the right side of the scene, while the smaller one is closer to the left. There are some birds in the field as well, one in the background near the right edge and another closer to the center. Additionally, there are multiple people scattered around the field, with some of them near the edge of the grassy area, and others standing in different parts of the scene. The main elephants take center stage and draw attention to their impressive size difference.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "person", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 354540, "caption": "In a kitchen filled with an oven, there is a man sitting on a bench with a laptop placed on the benchtop. The man is holding a remote, likely for a presentation or video conferencing. There are also several devices, such as a cell phone, an iPad, and a laptop on the table, giving the impression of a high-tech setup.", "mscoco_hallucinated_words": [["bench", "bench"], ["remote", "remote"], ["table", "dining table"]], "mscoco_gt_words": ["toaster", "oven", "laptop", "cell phone", "microwave", "chair", "person"], "hallucination_idxs": [15, 29, 59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5714285714285714}}, {"image_id": 19763, "caption": "The image displays a close-up view of a piece of deep-dish pizza on a plate, placed on a table. The pizza appears to be partially consumed, with some slices remaining. The scene is set in a restaurant or bar, where people are likely to enjoy their meals and drinks. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 191301, "caption": "The image features a delicious-looking piece of cake or other dessert on a white platter, along with a couple of spoons placed on a table. There is also a cup and a bottle of an alcoholic beverage, likely a beer or wine, in the scene.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["cup", "dining table", "spoon", "cake", "person"], "hallucination_idxs": [34], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.8}}, {"image_id": 124236, "caption": "The image showcases a kitchen that is dimly lit by the sun. Various pots and pans are positioned on a table, giving a cluttered appearance to the scene. In addition to the table, there are several other items present in the kitchen, such as chairs and pots. The pots are evenly distributed around the kitchen, creating a sense of order and organization. A few bowls are placed on a lower part of the table, while other items such as a clock and a bottle are scattered around the kitchen as well. The overall vibe of the image captures a small, cozy kitchen with a few dishes neatly arranged.", "mscoco_hallucinated_words": [["chair", "chair"], ["clock", "clock"], ["bottle", "bottle"]], "mscoco_gt_words": ["toaster", "oven", "dining table", "bowl", "spoon"], "hallucination_idxs": [49, 89, 92], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 0.4}}, {"image_id": 344930, "caption": "A red fire hydrant sits next to the curb on a street, surrounded by grass and shrubs. A car, likely a truck or SUV, is parked nearby on the road. The fire hydrant and car create an interesting contrast, with the fire hydrant's colorful appearance drawing attention and the car in the background adding to the scene. In total, there are 10 cars in the vicinity, with three cars closer to the left side of the image, two cars more towards the center, and five cars further to the right.", "mscoco_hallucinated_words": [["truck", "truck"]], "mscoco_gt_words": ["fire hydrant", "orange", "car"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.08333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 14285, "caption": " In the image, a cat is lying on a bed in a bedroom with books on a table. The cat appears to be orange, and it's occupying the majority of the scene. ", "mscoco_hallucinated_words": [["table", "dining table"], ["orange", "orange"]], "mscoco_gt_words": ["remote", "tv", "cat", "chair", "book", "bed"], "hallucination_idxs": [18, 25], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 82488, "caption": "The image shows two beds in a room, with one bed being larger than the other. Both beds have orange and brown blankets, which are likely throw pillows, and there is a lampshade on the wall. The larger bed is on the right, and the smaller bed is on the left. The room appears to be a hotel or motel room, as it has a clean, well-maintained look. The beds are situated close to the right and left side of the image, leaving a small amount of space in the middle. Overall, the scene is peaceful and neat.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "clock", "dining table", "orange", "book", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 348251, "caption": " The scene depicts a man wearing skis, skiing down a snowy hill. There are several other people skiing nearby, some closer to the foreground and others in the background. The skiers are either enjoying the slope or taking breaks while skiing. The skiing enthusiasts are wearing backpacks to carry their gear and belongings. The presence of multiple snow sports equipment in the area, such as skis, creates an active and lively skiing environment.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 150703, "caption": "The image features a sandwich-like food item, with the toppings including various sliced vegetables, including tomatoes. The sandwich is positioned in the center of the image, taking up a majority of the space. It is surrounded by cups filled with beverages, one of which is positioned closer to the left edge of the picture, with a glass cup containing beer, and another cup to the right edge with a clear liquid. The combination of food and beverage creates a lively atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "cup", "hot dog", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 381615, "caption": "In a windowed room, there is a wooden chair sitting underneath a window. The chair is adorned with various plants, including several potted plants hanging from a large hanging plant holder, as well as a few others placed in other areas around the room. There are several pictures on the wall, mostly towards the top, including one centered around the room and another on the side. A bottle of wine and a wine glass can also be seen on a small table near the chair.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["wine glass", "wine glass"], ["table", "dining table"]], "mscoco_gt_words": ["cup", "potted plant", "vase", "book", "chair"], "hallucination_idxs": [75, 80, 89], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 0.4}}, {"image_id": 352703, "caption": "In the image, there is a woman standing at a cooking area on the sidewalk, surrounded by chairs. She appears to be cooking food in a wok, preparing a dish for an event or occasion. The scene includes various dishes and utensils, such as a knife, a bowl, and a small bottle. A handbag can also be spotted next to the table.", "mscoco_hallucinated_words": [["chair", "chair"], ["knife", "knife"]], "mscoco_gt_words": ["cup", "handbag", "oven", "dining table", "bowl", "spoon", "bottle", "person"], "hallucination_idxs": [19, 51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.625}}, {"image_id": 111109, "caption": "The image captures a young boy wearing shorts while sitting on a chair. There is another chair situated in the room, likely indicating the presence of at least two chairs. The boy appears to be wearing a sweater or a jacket, and he is positioned near a couch.", "mscoco_hallucinated_words": [["couch", "couch"]], "mscoco_gt_words": ["tv", "tie", "motorcycle", "chair", "person"], "hallucination_idxs": [51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.4}}, {"image_id": 226802, "caption": "This scene takes place on a red street with a lot of people walking around, providing a sense of activity. There is an abundance of cars on the street, including several buses, giving the area a busy feel. Some of the cars are moving down the street, while others are parked near the buildings or at the edge of the road.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 49005, "caption": "The image captures a baseball game taking place on a field, surrounded by fans and players. A total of 14 baseball players are visible on the field, including both baseball pitchers and baseball players, such as the catcher, the batter, and several infielders. The baseball game appears to be underway with the baseball ball and gloves in action. ", "mscoco_hallucinated_words": [["ball", "sports ball"]], "mscoco_gt_words": ["baseball bat", "person", "baseball glove"], "hallucination_idxs": [60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.3333333333333333}}, {"image_id": 410114, "caption": "In the image, a large herd of horses is walking down a long, open road, along with a group of riders on their backs. The herd of horses takes up much of the road, with some scattered closer to the left side and others near the right.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "horse", "person", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 100274, "caption": " The image depicts a train track, with multiple trains on it. The trains are traveling in various directions, with some on the left side and others on the right side. Several of these trains are parked, while a few are moving. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "train", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 317254, "caption": "The image captures a man wearing a ski mask riding a motorcycle on a road in a city. The motorcycle is the main focus of the scene, while a car is seen in the background, possibly driving on the same street.", "mscoco_hallucinated_words": [["ski", "skis"]], "mscoco_gt_words": ["motorcycle", "car", "person"], "hallucination_idxs": [7], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 425569, "caption": "The image features a large, empty kitchen with a long counter spanning most of the room. The counter is stocked with various kitchen appliances and cookware, including a mix of stainless steel pots and pans of different sizes. There are several bowls positioned at different areas of the counter, while several utensils, such as spoons and knives, are distributed throughout the scene.", "mscoco_hallucinated_words": [["spoon", "spoon"], ["knife", "knife"]], "mscoco_gt_words": ["bowl", "sink", "oven"], "hallucination_idxs": [60, 62], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.3333333333333333}}, {"image_id": 28842, "caption": " The image captures a skateboarding scene with two people skateboarding down a hill in a park. The focus of the image is on the first skateboarder, who is in the middle of the ramp, while the second skateboarder is further down the hill. The presence of multiple skateboards on the hill adds to the atmosphere of the park. The people's shadows, along with the skateboards, suggest that they are enjoying themselves and enjoying the activity.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 319607, "caption": " The image captures a busy street scene with a person riding a bicycle at the corner of a street with many cars. The road has a few traffic lights, one of which is at the intersection where the cyclist is located. This particular traffic light is positioned in a middle area of the scene, close to the person riding the bike.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bus", "traffic light", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 210372, "caption": " The image depicts a small red kitchen with a sink and a few appliances. The kitchen features red cabinets with a few drawers, as well as some white counters and a sink. A few bottles can be found in the kitchen, one is situated near the left side, another one is located on the center's top portion, and a third bottle can be seen in the right area. The image showcases the sink's location and the various bottles within the kitchen.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 156915, "caption": "The image features various flowers and greens arranged in a vase on a table, creating a beautiful arrangement. The vase is filled with a large red rose and a red and white rose, along with a variety of green leaves. The arrangement is then surrounded by additional white and pink flowers, creating a beautiful, colorful display. The table can be seen in the background, showcasing the bouquet's stunning details and beauty.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 349642, "caption": "The image features two people sitting on a pair of adjacent benches, with a dog sitting between them. One person is closer to the left side of the image, while the other is closer to the right side. The dog is positioned in the middle of the benches. Additionally, there is a hand visible in the image, likely belonging to one of the people. The benches are in an outdoor area, possibly a park or a promenade.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bench", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 423008, "caption": "The image captures a large group of people gathered on a canal in front of numerous boats with colorful umbrellas. The people are spread out along the edge of the canal, occupying different sections of the water. Many umbrellas are placed on the boats, providing shade and coverage for the attendees.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["suitcase", "handbag", "umbrella", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 45435, "caption": " The image showcases a person's hand holding a cell phone, specifically a silver or black phone. The phone is in focus and prominently displayed in the image. There is another cell phone in the scene, sitting on a table in the background, behind the person's hand and the phone being held.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["cell phone", "cup", "person"], "hallucination_idxs": [43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.6666666666666666}}, {"image_id": 35827, "caption": " The image features a person sitting in front of a computer, working at his desk. He is surrounded by multiple keyboards, including a keyboard in front of him, another one on his right, and a third keyboard located further away from the first two. There are also a laptop, placed above a keyboard on the right, and two more keyboards placed lower in the scene.", "mscoco_hallucinated_words": [["keyboard", "keyboard"], ["keyboard", "keyboard"], ["keyboard", "keyboard"], ["keyboard", "keyboard"], ["keyboard", "keyboard"]], "mscoco_gt_words": ["cup", "tv", "dining table", "laptop", "bottle", "chair", "person"], "hallucination_idxs": [22, 26, 41, 59, 67], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5555555555555556, "Recall": 0.42857142857142855}}, {"image_id": 72656, "caption": "In the picture, there are three children sitting at a small table eating together. The children are sitting on a bench located in front of the table. They are eating food, likely pizza, from plates placed on the table in front of them.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["pizza", "suitcase", "dining table", "bowl", "chair", "person"], "hallucination_idxs": [22], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 372087, "caption": " The image shows a large green and white train moving on a train track alongside the platform, where people are waiting. Some individuals are standing on the platform, near the train, while others are sitting further away from it. The train is the main attraction, as it occupies a significant portion of the scene. There are several bicycles on the platform, both closer to the train and farther away. A traffic signal is also visible on the left side of the scene, indicating a busy intersection or traffic regulation. The people on the platform appear to be engaged in various activities, such as waiting, chatting, or watching the train. Overall, it's a lively atmosphere with the train and people interacting on the platform.", "mscoco_hallucinated_words": [["bicycle", "bicycle"], ["traffic signal", "traffic light"]], "mscoco_gt_words": ["train", "suitcase", "bench", "person"], "hallucination_idxs": [64, 79], "metrics": {"CHAIRs": 1, "CHAIRi": 0.18181818181818182, "Recall": 0.5}}, {"image_id": 336209, "caption": "A young man is captured in the act of skateboarding on a ramp, enjoying a thrilling moment of skateboarding. The ramp appears to be at a skatepark or other outdoor facility. The man is on the right side of the image, with the skateboard on the ramp. The surrounding area features a variety of buildings, including one located closer to the ramp, and a few others further away, possibly at a distance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 46031, "caption": "There is an image of a laptop computer on top of a white desk in the image. The computer appears to be turned off, with a mouse positioned near the left side of the keyboard. A cell phone is also present, placed near the bottom left of the desk. The cell phone is slightly elevated compared to the laptop. A screen is visible behind the desk, with several plants visible on it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "tv", "dining table", "laptop", "mouse", "cell phone"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 283118, "caption": "According to the picture, the image depicts a family with a man, a woman, and a young girl sitting together at a dining table in a restaurant. They are smiling and enjoying each other's company. The table is adorned with various food items, including an apple, an orange, a plate of pasta, and several slices of pizza, placed around the table for the family to enjoy. ", "mscoco_hallucinated_words": [["apple", "apple"], ["orange", "orange"]], "mscoco_gt_words": ["pizza", "handbag", "dining table", "bottle", "cell phone", "car", "backpack", "chair", "person"], "hallucination_idxs": [52, 55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.3333333333333333}}, {"image_id": 429598, "caption": "The image features an old kitchen with wooden cabinets and flooring. It has a stove and oven, as well as a refrigerator and a sink. The sink is located in the far left side of the kitchen, while the refrigerator is near the middle. The kitchen also has a small oven placed near the sink and the stove. The refrigerator has a wooden countertop in front of it, adding a rustic touch to the room.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "oven", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 333365, "caption": " The scene is set in an airport with several airplanes visible. Three planes are positioned on a runway with one plane closer to the front and two planes farther away. The runway is located near the edge of a parking lot, where some cars are parked. There are also other planes, including a couple more, in various positions throughout the scene, some closer to the runway and others further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 424174, "caption": " The image shows a large plate filled with various kinds of apples and bananas, all placed together in the bowl. The bowl is positioned on top of a wooden table, making the colorful display of fruit easily accessible. Some of the apples are visible in the foreground, while others are located towards the back of the plate. The bananas are spread out across the plate, with some of them occupying more prominent positions near the front of the bowl. This arrangement of fruits is likely intended to be a visually appealing display, showcasing a variety of colorful fruit.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "apple", "orange", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 3035, "caption": "The scene features a large airport runway with multiple airplanes parked at various locations. One airplane is situated near the end of the runway and another airplane is visible in the distance, on the left side of the runway.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 177213, "caption": "The image displays a table with a white plate on it, containing a cheese pizza placed on the left side. To the right of the pizza, there is a knife, indicating that someone might have just finished eating or preparing food. In the scene, there are also two cups near the plate and pizza, one closer to the plate, and the other closer to the knife. In addition to the pizza, another person can be seen at the back of the picture. The overall scene appears to be a dining or kitchen area, with multiple people and food items around the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 438535, "caption": "The image is a black and white shot of a city street with a large clock, likely a subway station, prominently displayed in the scene. The street is narrow and busy, with many cars and people scattered in different parts of the street.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "clock", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 490711, "caption": "The image depicts a spacious and empty kitchen, featuring several white appliances and fixtures. A refrigerator is positioned towards the left side of the room, surrounded by a couple of cabinets and a sink. A microwave oven is situated closer to the middle of the room, while a toaster oven is placed further to the right. Several bottles and cups can be found throughout the kitchen, likely for drinks or storage purposes.", "mscoco_hallucinated_words": [["microwave", "microwave"], ["oven", "oven"], ["toaster", "toaster"], ["oven", "oven"], ["bottle", "bottle"], ["cup", "cup"]], "mscoco_gt_words": ["sink", "refrigerator", "banana"], "hallucination_idxs": [39, 40, 53, 54, 63, 65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.75, "Recall": 0.6666666666666666}}, {"image_id": 163728, "caption": "According to the image, it features a sleeping kitten lying on a dining table next to a keyboard and mouse. The kitten is relaxed and content, taking a nap amidst the dining essentials. The dining table has a surface that's covered with a few books, and there are several mice scattered throughout the scene, including one on the table beside the kitten. The sleeping cat is occupying the left side of the dining table, while the computer peripherals are on the right.", "mscoco_hallucinated_words": [["keyboard", "keyboard"], ["book", "book"]], "mscoco_gt_words": ["cup", "dining table", "laptop", "mouse", "cat"], "hallucination_idxs": [18, 49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.15384615384615385, "Recall": 0.8}}, {"image_id": 456756, "caption": " The image features two giraffes and a zebra, all together in a pen area. The giraffes are the primary focus of the scene, as one is standing tall and prominently positioned in the middle of the image, while the other is a little further back and to the right. The zebra can be seen on the far left side of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["zebra", "giraffe", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 281028, "caption": "The image features a train or a bus with two rows of seats going down the middle. One row is situated closer to the window, while the other row is behind it. There are several empty seats, including some arranged in the front of the vehicle and others spread across the middle. The occupants are spread throughout the cabin, some sitting together and others sitting alone. A TV can be seen mounted above the seats.", "mscoco_hallucinated_words": [["tv", "tv"]], "mscoco_gt_words": ["train", "person", "chair", "car"], "hallucination_idxs": [74], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 343937, "caption": "The scene captures a woman snowboarding down a hill in the snow. She is wearing a snowboard and skis, while her hands and legs are visible. There is another snowboard in the background, possibly belonging to another snowboarder.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 277051, "caption": " The image depicts two birds, one smaller and the other a bit larger, sitting on a table next to some pizza slices on a plate. The table appears to be dining-related, possibly a dining table or a table used for meals. The birds seem to be focused on the food on the plate. A bottle of liquid, likely a sauce or juice, is also present in the scene. ", "mscoco_hallucinated_words": [["pizza", "pizza"]], "mscoco_gt_words": ["knife", "dining table", "bottle", "bird", "chair"], "hallucination_idxs": [22], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.6}}, {"image_id": 145666, "caption": " The image showcases a pot filled with various vegetables, including broccoli, carrots, and onions. The broccoli is spread across the pot, covering a large portion of the cooking area. The carrots are also present in the pot, positioned closer to the left side. Additionally, there are several onions located throughout the pot, some closer to the bottom and others near the top. The vegetables are being cooked and appear to be well-cooked, with some appearing more or less cooked than others.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "carrot", "broccoli", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 304984, "caption": " The image displays a delicious meal consisting of a sandwich and a side dish on a plate. The sandwich appears to be a French toast sandwich, with a piece of cake on the side. There are also two bottles, one wine and the other a juice, surrounding the meal on the table. The sandwich is served with two forks on the plate, allowing the person to enjoy their meal comfortably. Additionally, there are two spoons on the table, one closer to the wine bottle and the other closer to the cake.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["table", "dining table"], ["person", "person"], ["spoon", "spoon"], ["table", "dining table"], ["bottle", "bottle"]], "mscoco_gt_words": ["bowl", "sandwich", "cake", "fork", "orange", "wine glass"], "hallucination_idxs": [41, 56, 71, 83, 86, 93], "metrics": {"CHAIRs": 1, "CHAIRi": 0.46153846153846156, "Recall": 0.5}}, {"image_id": 122467, "caption": "A person is shown holding a video game remote controller, likely for the Nintendo Wii, in their left hand. The remote controller is positioned in a way that highlights their injured hand. The person appears to be preparing to play a video game, using the remote as they would in a typical gaming session.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "dining table", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 518287, "caption": "The image features a toothbrush with a smile in the middle. It's likely the brush has a unique design or is a custom piece. The smile is either carved or sculpted into the toothbrush's surface. The toothbrush is on an orange background, providing a playful and colorful atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toothbrush", "cup", "orange", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 145448, "caption": "According to the picture, there is a cat sitting on top of a flat sheet that is covering a car seat. The cat is likely enjoying the warmth provided by the blanket-like material. In the background, a man is walking, and a person sitting on a bench can be seen as well. There are a couple of other chairs in the scene, and a dining table can be spotted on the left side of the scene. A potted plant is placed in the left corner, and a bottle is situated on the right side. A car tire can be seen in the middle of the scene, adding to the scene's atmosphere.", "mscoco_hallucinated_words": [["bench", "bench"], ["potted plant", "potted plant"], ["bottle", "bottle"]], "mscoco_gt_words": ["dining table", "motorcycle", "cat", "car", "chair", "person"], "hallucination_idxs": [51, 85, 96], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.8333333333333334}}, {"image_id": 352900, "caption": " |  The image features a plate of pasta on a table with several pieces of broccoli distributed throughout. There is a bowl of broccoli, some broccoli lying on top of the pasta, and smaller pieces of broccoli scattered around the dish. The pasta itself appears to be a mix of penne and spaghetti.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["bowl", "fork", "knife", "broccoli"], "hallucination_idxs": [10], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.5}}, {"image_id": 153607, "caption": "According to the picture, there is a collection of three vases filled with water, each featuring a flower in a slightly different arrangement. The flowers appear to be orange, with one flower being placed in the middle, another on the left, and a third on the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "orange", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 157581, "caption": " The image shows a dining table with a variety of wine bottles lined up on top of it. The table is placed in the middle of a room, surrounded by various chairs and a mirror. There are several people visible in the scene, with at least seven individuals sitting or standing close to the dining table. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "bottle", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 227438, "caption": "According to the picture, a spacious bedroom is shown with a large bed sitting in the middle of the room. There is a nightstand on the right side of the bed, and a television sitting on the wall to the left of the bed. Two lamps can be seen, one placed on the right side of the room and the other on the left side, both pointing towards the bed.", "mscoco_hallucinated_words": [["television", "tv"]], "mscoco_gt_words": ["clock", "couch", "bed", "chair"], "hallucination_idxs": [36], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.25}}, {"image_id": 116037, "caption": "This image showcases a baseball game in progress, with a batter standing in front of home plate ready to swing at a ball coming towards them. The batter is holding a baseball bat, while a catcher and an umpire are nearby, anticipating the swing. In the scene, multiple people are present, some of them close to the players, likely spectators, and others further away, possibly teammates. There are various chairs and benches scattered around the field, providing seating for the players and audience.", "mscoco_hallucinated_words": [["chair", "chair"], ["bench", "bench"]], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [80, 82], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.75}}, {"image_id": 202658, "caption": "The image features a white toilet sitting on the ground outside. There are two chairs near the toilet, with one located to the left and the other to the right. The toilet is also adorned with a large pink hello kitty sticker on the seat cover, creating an interesting design.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "chair", "toilet"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 18150, "caption": " The image shows a man and a young child sitting on a couch, playing together. The man appears to be eating a slice of pizza, with a child next to him eating a similar pizza. There is also a bottle of water visible near the man. The scene is set in a living room, possibly within a house.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "bottle", "backpack", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 418711, "caption": "The image features a man on a bike with a large amount of bananas tied to it. The bananas are arranged in bunches and hang from various parts of the bike, making it appear as if the man is carrying a large bag of bananas with him.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "car", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 92107, "caption": "The image features a green bowl filled with food. The food inside the bowl appears to be a mixture of corn, broccoli, and some other ingredients. There is a scoop of chow in the bowl, as well as some bread slices or bread pieces. The food in the bowl is on a green tablecloth or placemat, and there is a fork in the picture. The image seems to be placed against a green backdrop.", "mscoco_hallucinated_words": [["fork", "fork"]], "mscoco_gt_words": ["bowl", "spoon", "broccoli", "dining table"], "hallucination_idxs": [67], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.5}}, {"image_id": 491000, "caption": "According to the picture, there is a group of people walking down a narrow city street. They are located near a building in the middle of the scene, and several other people are standing and talking. There are also several cars parked along the street, adding to the busy atmosphere of the scene. The street appears to be a pedestrian area, with people walking along it and enjoying the urban setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "motorcycle", "dog", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 401163, "caption": "A man sits on a park bench surrounded by trees while a woman and another man stand in the park. The park bench is positioned in the center of the scene, with the woman on the right and the other man on the left. The man is sitting closer to the left side of the image while the woman and the other man are farther away, near the right edge of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bench", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 39663, "caption": "The image shows a variety of items arranged on the floor. The most prominent among these items is a backpack, which takes up the largest part of the area. There is also a laptop sitting next to it. Numerous stationary items such as scissors, tape, and erasers are scattered around the backpack. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["laptop", "book", "cell phone", "backpack", "scissors"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 268293, "caption": "The image features a man, who is wearing a gray shirt, playing tennis and holding a tennis racket. He appears to be in the middle of a match, ready to return the ball with his racket. The ball in the image appears to be flying through the air during the action, and the man is focused on his game. There are several people in the scene, some positioned closer to the man and others slightly further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 474499, "caption": "The image depicts a small, clean kitchen with white cabinets and light wooden flooring. The kitchen has a combination oven/microwave (oven) located near the right side of the room and a sink located at the center, by the refrigerator. A washing machine is also present in the room, situated near the left side of the kitchen.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"]], "mscoco_gt_words": ["clock", "oven", "spoon", "bottle", "vase", "apple", "sink"], "hallucination_idxs": [43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.2857142857142857}}, {"image_id": 358086, "caption": "According to the picture, a group of people, consisting of men and women, are engaged in various actions on a grass field. Several individuals are running towards a white frisbee, which one of them is trying to catch in mid-air. The others are watching and getting ready to play their part. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "frisbee", "backpack", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 89541, "caption": "In the picture, there is a tray filled with a variety of food items, primarily consisting of fruits. Among the fruits, there are several strawberries and blueberries, as well as other toppings and pastries. The food is placed in a yellow container, which is laid out on a colorful carpet. The food in the tray is well-arranged and nicely organized, making it appealing for various events and meals.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "cake", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 239445, "caption": " The image shows a man standing in an open area outdoors, possibly at a park or a garden. He is holding two white frisbees, one in each hand, as he smiles while the frisbees are upside down.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["frisbee", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 105782, "caption": "According to the picture, a kangaroo is standing in a field with a colorful kite on top of it. The kite appears to be flying and the kangaroo is enjoying itself, likely taking advantage of the windy conditions. The field is filled with grass, and there are a few traffic lights and cars visible nearby. The setting seems to be a park or a green space.", "mscoco_hallucinated_words": [["traffic light", "traffic light"]], "mscoco_gt_words": ["cat", "kite", "dog", "car"], "hallucination_idxs": [54], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 487990, "caption": "A person is standing over a large table filled with various different foods. There are also several other people around the table, some closer to the person holding the guitar and others more distant. The table is adorned with many dishes and bowls, some with food, others empty, and a few cups. The arrangement of food items on the table gives the impression that it is a gathering or event with multiple people contributing to it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "bowl", "spoon", "bottle", "cake", "fork", "donut", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 138834, "caption": "According to the picture, there is a man in the snow wearing a black jacket, carrying his skis, and walking with them in tow. The snow is quite heavy, making it a bit challenging for the person to carry his equipment on his back. The man is likely getting ready to go skiing and is walking in the snow, possibly to get to his car or another location. In the background, there is another person visible, possibly a companion or someone assisting the man.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 492057, "caption": "In a bathroom, a man is standing in front of a mirror while taking a selfie with a cell phone. There are a few people visible in the background, some closer and others more in the distance. The bathroom has a handful of drinks nearby, including bottles placed at different spots. The mirror reflection shows a couple of additional people, one of whom is wearing a tie.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["tie", "person", "cell phone"], "hallucination_idxs": [51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 388215, "caption": "The image captures a tennis court with a person in the center of the court, preparing to hit a tennis ball. The man wears a blue and white shirt, and he is holding a tennis racquet in his hand. The court has multiple sports nets attached on various areas, making it a suitable venue for playing various sports. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 111409, "caption": "The image features a man riding a motorcycle down a street, likely on a city block. He is sitting on the motorcycle, wearing a helmet and looking ahead as he navigates the street. The motorcycle has the letters \"JAPAN\" inscribed on the rear.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 451391, "caption": "This image displays a beautiful bouquet of greens on a table. The greens consist of a large bunch of flowers with a vibrant, bright green color. The flowers are prominently placed in the center of the image, surrounded by various fruits, vegetables, and a few people. There are three people in the image, each standing at different positions and looking towards the bouquet of flowers. One person is on the right, another is in the middle, and the third is on the left side of the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["broccoli", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 92462, "caption": "The image depicts a street scene with several cows walking down the road, likely as part of a herding process. Some of the cows are closer to the front of the scene, while others are further back. There are four cows on the right side, while only three cows are on the left side. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "cow", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 397268, "caption": "The image shows a small bike store inside a building with various bicycles on display in the window, including a few behind chairs outside. There is a large bicycle placed in the foreground, with another one in the middle and a third one further towards the back. The chairs are positioned along the edges of the front window, attractively displaying the bicycles for customers to view. There is a bench placed outside the store, close to the left edge of the image. Overall, the setting is a small bicycle store with bikes both inside and outside the building.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "chair", "potted plant", "bench"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 127659, "caption": " The image shows a long train traveling along a train track, with people standing by the platform as they wait for the train to come to a stop. There are four people standing near the train, three on the right side and one on the left. They appear to be waiting at the station or getting off the train.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "person", "bench", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 251857, "caption": "The image features two men riding skateboards in a city park, with one person performing tricks on his skateboard and the other person watching. They are positioned in different parts of the park, with one person closer to the right side of the image and the other closer to the left side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "skateboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 243212, "caption": "According to the picture, a young child is standing in a park with a blue frisbee in their hand. There are several vehicles parked nearby, including cars, one of which is closer to the child. The vehicles appear to be behind the child, with some further away and others closer to him.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "dog", "frisbee", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 444304, "caption": "The image depicts a young man kneeling on the floor while holding a remote control. In front of him, there is a television set and multiple game controllers. There are two other controllers on the left side of the image and a bottle nearby. The scene appears to be an intimate setting for entertainment or relaxation.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["remote", "person", "bed", "tv"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.75}}, {"image_id": 305000, "caption": "According to the picture, there are various vegetables and fruits displayed at a woman's stand in a marketplace. The stand has numerous fruits such as apples, oranges, and bananas. Two women can be seen in the image: one is in the foreground, standing near the fruits, and the other is in the background, standing further away from the display. Both of the women are wearing plaid shirts.", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["apple", "person", "banana"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 44136, "caption": "The image features two men sitting at a table, possibly in a dining or banquet setting, enjoying a meal or event. Both men are dressed in tuxedos, and one is wearing a bow tie. One of the men is holding a wine bottle, while the other has a wine glass in hand. They appear to be in a festive mood, socializing, and enjoying the event together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "tie", "bottle", "wine glass", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 154846, "caption": "The image features a large pepperoni pizza with mushrooms on it, placed on a table. There are several people in the scene, some of whom are dining or chatting, while others are simply enjoying the food.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 567254, "caption": "A small child is the main subject in the image, standing in front of a TV. In the background, there is a room with a couch, and two more people can be seen in the scene, although they are not in focus. The child is brushing her hair, likely with a pink brush, while looking at the TV.", "mscoco_hallucinated_words": [["couch", "couch"]], "mscoco_gt_words": ["person", "vase", "tv", "bottle"], "hallucination_idxs": [28], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.5}}, {"image_id": 79261, "caption": "There is a man in a swimming pool in the middle of a street filled with water, cleaning and paddling the pool with a broom. The street is filled with people from all different places, sitting and standing around the area. One person can be seen standing right next to the man mopping the floor, while a few more are dispersed around the scene, watching the action.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "clock", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 301402, "caption": "The image captures a man sitting at a dining room table, using his laptop. He is wearing a hat and a black jacket, and has his hands on the table, likely working on his laptop. Around him, there are various items on the table, including a bottle of water, two glasses, a cup, a bowl, and a remote. The man appears to be relaxed, possibly engaging in an activity that he enjoys.", "mscoco_hallucinated_words": [["cup", "cup"], ["bowl", "bowl"], ["remote", "remote"]], "mscoco_gt_words": ["dining table", "laptop", "potted plant", "bottle", "chair", "person"], "hallucination_idxs": [61, 64, 68], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2727272727272727, "Recall": 0.6666666666666666}}, {"image_id": 415828, "caption": " The image features a group of people walking down a long sidewalk alongside some tall buildings in an urban setting. A man with an umbrella is in the center of the group, standing underneath it for shade. The group consists of several individuals spread out across the scene, each carrying their own belongings or wearing backpacks.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "umbrella", "traffic light", "stop sign", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 543231, "caption": " In the image, a woman is playing Wii with her friend. She is holding the game controller while standing in front of a television. The scene features several people, including two other women, one in the background on the left side and the other on the right side, and a man sitting closer to the background.", "mscoco_hallucinated_words": [["television", "tv"]], "mscoco_gt_words": ["remote", "bowl", "book", "bed", "person"], "hallucination_idxs": [25], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.2}}, {"image_id": 495357, "caption": "The scene is an outdoor street event where a little boy is sitting on the back of a motorcycle. He appears to be enjoying his ride, as multiple people are standing around him watching and taking pictures. The motorcycle seems to be positioned on the left side of the scene, and other vehicles are also present in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "person", "handbag", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 523175, "caption": "According to the picture, a bowl filled with soup is present on a table. The soup is topped with several pieces of carrots, as well as some green beans and a spoon. The spoon is situated in the soup and likely used to eat from the bowl. The bowl is on the right side of the image and the spoon is situated on the left side, giving the scene a balanced composition.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "carrot", "spoon", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 184945, "caption": "The image depicts a parking lot or garage area with several motorcycles parked on the sidewalk. There are a few bicycles positioned in various locations, one close to a motorcycle, and another one near the front row of bikes. A handbag is also visible in the scene, placed near the sidewalk.", "mscoco_hallucinated_words": [["handbag", "handbag"]], "mscoco_gt_words": ["bicycle", "motorcycle", "potted plant"], "hallucination_idxs": [44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 49942, "caption": " The image features a young man holding an umbrella over his head. He is staring up and slightly to the right, with an expression that suggests a slight confusion. The scene is captured from a bird's eye view, giving a sense of the entire person being under the umbrella. The man is wearing a tie and a multi-colored shirt.", "mscoco_hallucinated_words": [["bird", "bird"]], "mscoco_gt_words": ["tie", "umbrella", "person"], "hallucination_idxs": [38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 223019, "caption": "There is a blue train parked at a train station in the image. It appears to be on the lower portion of the platform, where the train would typically pull in. There are three people standing near the train: two closer to the front and one closer to the middle. They are all dressed in different outfits, with one person wearing a coat and another one wearing a hat.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["suitcase", "handbag", "train", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 42889, "caption": "The image displays a teddy bear sitting next to a laptop on a table. The teddy bear is placed right next to the keyboard, almost as if they're partners in some sort of conversation. On the table, there's also a remote control, slightly to the right of the laptop. It appears that the teddy bear is positioned in a spotlight, drawing attention to the setting.", "mscoco_hallucinated_words": [["laptop", "laptop"], ["table", "dining table"], ["table", "dining table"], ["remote", "remote"], ["laptop", "laptop"]], "mscoco_gt_words": ["keyboard", "teddy bear", "bear"], "hallucination_idxs": [10, 13, 40, 46, 55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5555555555555556, "Recall": 0.6666666666666666}}, {"image_id": 22112, "caption": "The image features a shelf or table filled with various objects, including a pair of vases, one filled with flowers and the other with greenery, as well as some other plants and sculptures.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "potted plant", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 247587, "caption": " The image features a man standing on a white surfboard at a beach location. He is likely enjoying the waves and the warm sandy environment. The background includes several trees, some closer to the foreground and others farther away, adding a lush atmosphere to the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 171962, "caption": "In the image, a large city bus is parked on the street with its bicycle rack and the back of the bus towards the viewer. There are several bicycles parked next to the bus on the street, and a person appears to be taking the time to observe them. The street also has a bridge or overpass nearby, creating a transportation hub for the bus and bicycles.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 545056, "caption": " This image depicts a large field with many cows grazing on it. The cows are spread out across the grassy landscape, enjoying the pasture. There are black and white cows of various sizes, with some closer to the foreground and others further back in the scene. A few of these animals are laying down, likely resting after their grazing.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "sheep", "cow", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 401411, "caption": " The image features a man standing in the middle of a kitchen, likely at a restaurant or other food-related setting. There are several people in the scene, some of them further away from the man in the center. Various food items can be seen around the kitchen, including several bottles and bowls, as well as some apples and a few bananas. The man appears to be focused on his task, possibly preparing food or giving instructions to others in the kitchen.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["apple", "apple"], ["banana", "banana"]], "mscoco_gt_words": ["cup", "oven", "dining table", "bowl", "book", "carrot", "person"], "hallucination_idxs": [54, 62, 66], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.2857142857142857}}, {"image_id": 2867, "caption": " The scene is set amid snowy ski slopes, with several people skiing together. The people are skiing in different directions and using different backpacks. There are three pairs of skis clearly visible in the scene, with two skis on the left side and one pair of skis on the right. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 246963, "caption": " The scene features a group of motorcycle riders driving down the road together, with several of them positioned at the back of the pack. The lead rider is positioned near the front of the group, creating an orderly formation as they make their way down the road.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "stop sign", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 498225, "caption": " The scene features a group of people standing outside a bakery or cake shop, most likely in a bustling city or town. There are four distinct people in the image, two of them are women and the others are men. One of the women is wearing a white top, while the other has a blouse that's light in color. The men have a white and a light-colored shirt. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "cake", "cell phone", "donut", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 124601, "caption": " There is a small kitchen with two ovens in it, placed one above the other. One of the ovens is located at the top of the image, while the other is below. There is a microwave positioned between them. The kitchen has a laptop and a computer monitor located on the countertop near the right side. A mug is situated closer to the left side of the image. Various beverages and a bowl are also visible, adding a touch of variety to the kitchen.", "mscoco_hallucinated_words": [["oven", "oven"], ["oven", "oven"], ["monitor", "tv"], ["bowl", "bowl"]], "mscoco_gt_words": ["keyboard", "cup", "dining table", "laptop", "bottle", "wine glass", "sink", "microwave"], "hallucination_idxs": [7, 20, 52, 79], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5714285714285714, "Recall": 0.25}}, {"image_id": 581749, "caption": " The image features a variety of fruits and vegetables placed on a table, including oranges and broccoli. The oranges are displayed in different positions around the table, with some on the left side and some on the right. The broccoli is spread out in different locations, including on the left side, in the center, and on the right side of the table. The arrangement showcases a diverse selection of fruits and vegetables, making it a colorful and appealing display.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["broccoli", "orange", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 227960, "caption": "The image features a white square dish filled with food on a dining table. The food appears to consist of a mix of cooked broccoli and carrots. In the dish, there are multiple carrots and at least two spoons. One spoon is located near the center of the dish, while the other spoon is positioned closer to the right edge of the dish. There are also multiple forks placed around the dish.", "mscoco_hallucinated_words": [["table", "dining table"], ["broccoli", "broccoli"], ["fork", "fork"]], "mscoco_gt_words": ["cup", "bowl", "spoon", "wine glass", "carrot"], "hallucination_idxs": [13, 25, 73], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.4}}, {"image_id": 286503, "caption": "The image features an elephant standing outdoors near a fence, likely in a zoo or park. The elephant is occupying the center of the scene, with a person standing nearby, likely observing the animal or taking part in some activity involving it. A few other people are also visible around the area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "sports ball", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 315740, "caption": "The image features a large and clean kitchen with hardwood flooring, metal appliances, and modern touches. A sink is located near the center, surrounded by a variety of cabinets and drawers for storage.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["oven", "tv", "bowl", "potted plant", "sink", "microwave", "chair", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.125}}, {"image_id": 142108, "caption": "The image depicts a group of women and young girls sitting and working around a dining table. There are several bananas placed on the table, covering large parts of it, while the group members wrap the bananas with different colors. The group of women is busy making banana bread, while some young girls are participating in the banana wrapping activities.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 542933, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "chair", "baseball bat", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 475043, "caption": "In the image, there is a group of five people gathered around a couch in a living room. Four of them are sitting on the couch while one person is standing. They are all using laptops, working or browsing the internet. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "spoon", "laptop", "bottle", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 456784, "caption": "The image depicts a person holding a knife and cutting a cake. The person appears to be right in front of the cake as they carefully cut the white frosted cake. The knife is positioned between the cake and the person's hand. Additionally, there is a chicken bone placed close to the knife, adding an interesting and humorous touch to the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 33753, "caption": "The image features a wooden park bench sitting outside, and a remote control is placed on top of the bench. The remote control appears to be positioned towards the left side of the bench. The park bench is situated in a grassy area, with no other objects or people visible nearby.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["remote", "chair", "bench", "tv"], "hallucination_idxs": [52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 448694, "caption": " In the image, two young men are playing a game of frisbee as they battle it out in a field with their friends watching and cheering them on. One of the players is actively trying to catch the frisbee, while the other is focused on preventing it from being caught.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["frisbee", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 99053, "caption": " The image displays a delicious bowl of Chinese noodles with vegetables and meat. The bowl is filled with the noodles and vegetables, including broccoli and carrots, along with a fork and a spoon for serving and eating the dish. There is also a cup placed on the table nearby. The noodle dish is likely a popular and satisfying meal option.", "mscoco_hallucinated_words": [["spoon", "spoon"], ["cup", "cup"]], "mscoco_gt_words": ["dining table", "bowl", "fork", "broccoli", "carrot"], "hallucination_idxs": [35, 47], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 503210, "caption": "The image is of a living room filled with a couch, a TV, and a table. The couch is positioned against the far wall and extends across a large portion of the room, while the TV is placed near the center of the space. The table is positioned closer to the front of the room, possibly serving as a place to eat or a decorative piece. The room also features two windows, one along the far wall and another one near the front of the space. In the background of the scene, a piano can be seen, adding to the overall ambiance of the living room. The overall design of the room is spacious and welcoming, offering comfortable seating and a pleasant atmosphere for entertainment and relaxation.", "mscoco_hallucinated_words": [["tv", "tv"], ["table", "dining table"], ["tv", "tv"], ["table", "dining table"]], "mscoco_gt_words": ["clock", "potted plant", "vase", "orange", "book", "chair", "couch"], "hallucination_idxs": [13, 17, 39, 50], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.14285714285714285}}, {"image_id": 142257, "caption": " The image shows a busy city street, with a red car traveling down the road. Several traffic lights are visible, positioned above the street at various points, controlling the flow of traffic. A few cars are also visible along the street, including one in front of the red car and another on the other side of the road. A person can be seen walking near the front of the red car. There are a few other traffic lights positioned at different spots around the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}], "overall_metrics": {"CHAIRs": 0.3616161616161616, "CHAIRi": 0.11395912712157949, "Recall": 0.5455381784728611, "num_hallucinated_caps": 179.0, "num_caps": 495.0, "hallucinated_word_count": 329.0, "coco_word_count": 2887.0, "length_response": 71.13535353535353, "hallucinated_caps_ls": [144305, 308003, 54870, 324008, 577912, 234433, 39053, 493862, 369547, 401027, 253630, 326021, 25812, 195645, 155671, 32625, 343534, 398753, 254368, 209347, 523292, 12375, 146837, 209925, 2154, 274931, 104326, 233888, 183766, 169169, 344942, 275929, 490491, 175878, 53744, 185327, 482432, 129707, 556478, 451943, 37907, 506401, 406611, 278365, 432213, 92131, 21779, 235595, 467887, 295656, 401010, 313928, 490284, 388237, 122602, 59710, 411564, 285302, 265628, 5934, 131138, 258209, 347453, 285820, 245664, 259345, 422328, 354559, 468005, 545594, 155131, 437720, 378831, 375769, 20650, 177015, 295105, 443259, 537861, 249969, 301837, 240681, 178753, 47781, 542792, 293703, 250540, 298067, 563938, 526414, 331084, 282553, 328818, 188136, 191691, 557447, 151081, 134223, 198163, 217593, 477016, 558784, 359164, 230983, 564629, 53431, 441840, 381031, 530520, 9628, 163518, 118124, 288729, 177721, 372220, 302789, 30667, 328110, 463678, 70815, 88269, 98979, 379443, 374061, 560018, 353320, 483227, 531327, 277073, 436814, 110449, 45550, 303099, 205193, 170099, 14940, 354540, 191301, 124236, 344930, 14285, 381615, 352703, 111109, 49005, 317254, 425569, 45435, 35827, 72656, 372087, 283118, 490711, 163728, 281028, 277051, 304984, 145448, 352900, 227438, 116037, 92107, 474499, 105782, 492057, 444304, 305000, 567254, 301402, 543231, 184945, 49942, 42889, 401411, 124601, 227960, 33753, 99053, 503210]}}}