{"overall_metrics": {"CHAIRs": 0.244, "CHAIRi": 0.09349593495934959, "Recall": 0.5015945330296128, "num_hallucinated_caps": 122.0, "num_caps": 500.0, "hallucinated_word_count": 207.0, "coco_word_count": 2214.0, "length_response": 52.864, "hallucinated_caps_ls": [144305, 164091, 54870, 324008, 52700, 39053, 493862, 128119, 401027, 507171, 231534, 209347, 12375, 2154, 104326, 300784, 409346, 344942, 133146, 185327, 348654, 454161, 529256, 368367, 295656, 401010, 313928, 346940, 278627, 122602, 24104, 59710, 527497, 411564, 61959, 5934, 131138, 258209, 285820, 245664, 453731, 87052, 418692, 259345, 449798, 375769, 426172, 333998, 443259, 301837, 309071, 240681, 47781, 14773, 261318, 134768, 526414, 188136, 357, 561967, 151081, 134223, 301266, 475813, 558784, 314016, 564629, 53431, 381031, 9628, 163518, 423984, 118124, 252625, 328110, 88269, 98979, 536154, 416384, 531327, 1268, 575406, 377879, 436814, 45550, 444913, 303099, 271565, 205193, 272566, 170099, 14940, 354540, 124236, 348251, 381615, 352703, 49005, 425569, 35827, 283118, 490711, 163728, 277051, 304984, 518287, 145448, 18150, 474499, 89541, 138834, 111409, 397268, 444304, 523175, 42889, 2867, 581749, 227960, 475043, 99053, 503210]}, "results": {"sentences": [{"image_id": 144305, "caption": "The image features a computer with a laptop open on top of a mouse. The mouse is connected to the laptop, and there are multiple computer peripherals, including a keyboard, a monitor, and two mice placed at different heights. The laptop is using a cable and a second mouse, suggesting that it is connected to the computer through a network or a wireless connection", "mscoco_hallucinated_words": [["mouse", "mouse"], ["mouse", "mouse"], ["monitor", "tv"], ["mouse", "mouse"], ["mouse", "mouse"]], "mscoco_gt_words": ["laptop", "keyboard", "dining table"], "hallucination_idxs": [13, 16, 35, 39, 54], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4166666666666667, "Recall": 0.6666666666666666}}, {"image_id": 164091, "caption": "According to the picture, a doll is sitting in a baby carriage, surrounded by beds. There is a chair on the left side of the scene and two other dolls, one of which appears to be a baby, are located around the baby carriage. The baby carriage, surrounded by the beds, is designed to resemble a traditional, old-fashioned type, with the baby sitting on the carriage", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["teddy bear", "person", "bed"], "hallucination_idxs": [21], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.6666666666666666}}, {"image_id": 308003, "caption": "According to the picture, a wooden bench is sitting in a grassy area, surrounded by trees. The bench has a green and white striped pattern on it, and it appears to be in a shaded area, providing a comfortable place to sit and enjoy the outdoors.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "bench", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 54870, "caption": "The image shows a sandwich, lettuce, and tomato sitting on a white table. The sandwich consists of a meat and a vegetable, while the sides feature a plate of salad. The salad has vegetables such as pickles and tomatoes arranged in a pleasing fashion", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["knife", "sandwich", "fork"], "hallucination_idxs": [14], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 95908, "caption": "The image showcases a silver cup or mug placed on a keyboard, likely a computer keyboard. This makes it appear as if the cup has been placed in a place where someone might usually sit down with a computer or other electronic device. There are multiple keyboards around the cup, including a mouse pad close by", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "mouse", "keyboard", "laptop", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 329614, "caption": "According to the image, there is a church situated near a street with two lanes and a sidewalk. A car is driving down the street in front of the church, and a red bus is further down the street", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 324008, "caption": "The image depicts a tray of food, with various items placed on it. The tray includes a banana, a couple of oranges, yogurt, a sandwich, and a muffin", "mscoco_hallucinated_words": [["sandwich", "sandwich"]], "mscoco_gt_words": ["dining table", "banana", "train", "orange", "airplane", "person", "cup"], "hallucination_idxs": [29], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.2857142857142857}}, {"image_id": 52700, "caption": "According to the picture, there is a group of people standing around a table filled with a variety of cakes and desserts, all on different plates and decorated differently. The cakes range from chocolate cake to vanilla, carrot, and poppyseed ones. There are also several forks and knives on the table, ready to be used for serving the food", "mscoco_hallucinated_words": [["carrot", "carrot"], ["knife", "knife"]], "mscoco_gt_words": ["fork", "dining table", "spoon", "cake", "handbag", "person"], "hallucination_idxs": [41, 53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.6666666666666666}}, {"image_id": 501311, "caption": "According to the picture, there is a baseball player pitching a baseball on a grassy field. The pitcher is in the middle of the field, while a crowd of people stands around the pitcher watching the game. There are a total of 15 people visible in the scene, some of whom are likely fans attending the game", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 247179, "caption": "According to the image, it appears to be a street scene during the day, possibly during a cultural event or a parade. There are three elephants, two of which are decorated, in the scene, accompanied by several people", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "elephant", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 577912, "caption": "According to the picture, the bathroom features a large white tub sitting underneath a window, surrounded by light-colored walls. The bathroom also includes a sink and a mirror, providing an attractive and functional space for bathing and grooming. The sink is situated on the right side of the image, while the mirror is found on the left side of the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "toothbrush", "toilet", "sink"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 196341, "caption": "The image features a group of people standing in front of a large clock located in the interior of a historical building. The clock, which can be seen at the very bottom of the image, is positioned in a prominent location within the room", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "clock", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 234433, "caption": "A couple is sitting on a cement path near the street, enjoying a sunny day. In the foreground, there is a pole on which a traffic sign is mounted, indicating a bike ride", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 455610, "caption": " In the image, there is a large red and green motorcycle parked on the street in front of a building, catching the attention of people around it. There are several individuals visible in the scene, some of them standing closer to the motorcycle while others appear further away", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 39053, "caption": "The image displays a table filled with various food items. There is a pita bread and a bowl of creamy sauce, along with an assortment of cups, bowls, and a vase arranged on the dining table. Among the cups, there is one placed near the bowl of sauce, while two more are positioned closer to the left side of the table.", "mscoco_hallucinated_words": [["vase", "vase"]], "mscoco_gt_words": ["cell phone", "bowl", "dining table", "spoon", "cake", "apple", "cup"], "hallucination_idxs": [34], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.42857142857142855}}, {"image_id": 493862, "caption": " The image depicts a person standing on a grassy hill, holding a tennis racket and looking up at the sky. The person appears to be wearing a red shirt and has a backpack resting at their back", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["tennis racket", "dog", "person"], "hallucination_idxs": [34], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 269873, "caption": " The image depicts a black and white photo of a group of people walking down the sidewalk in front of a bus stop. There are many people scattered across the image, some standing closer to the bus stop, while others are walking further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 524656, "caption": "In a small kitchen, a microwave oven is placed at the top, along with a toaster oven. There is a refrigerator located near the oven, which is also situated at the top", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "oven", "spoon", "bottle", "sink", "wine glass", "knife", "microwave", "toaster", "refrigerator", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.36363636363636365}}, {"image_id": 369547, "caption": "The scene features a black and white dog sitting on a beach, surrounded by pebbles. The dog is looking at its owner who is standing behind it, as if waiting for attention. The background reveals a vast ocean, and other birds can be seen scattered throughout the area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cat", "bird", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 181446, "caption": "According to the picture, a man is standing in front of a refrigerator, holding a bottle. He is wearing a black shirt and glasses, and he has a big smile on his face. He is looking at the refrigerator with an open bottle, possibly because he forgot to close the bottle or left it open for later use", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "bottle", "sink", "person", "refrigerator"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 128119, "caption": "According to the picture, a white city bus is parked on a city street, with people standing nearby. The bus has a few passengers on board, and it is stopped at a corner. The street is quiet, with no cars in sight", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["cell phone", "backpack", "bus", "bicycle", "handbag", "person"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.16666666666666666}}, {"image_id": 401027, "caption": "According to the picture, there is a sandwich made with cheese, cucumber, and lettuce on a plate. The sandwich appears well-prepared and looks appetizing. A variety of vegetables, including carrots, are scattered throughout the plate. The table is adorned with a vase, and there are candles located next to the vase, as well as a wine glass near the bottom of the table", "mscoco_hallucinated_words": [["carrot", "carrot"], ["vase", "vase"], ["vase", "vase"], ["wine glass", "wine glass"]], "mscoco_gt_words": ["fork", "dining table", "sandwich", "knife", "cup"], "hallucination_idxs": [35, 49, 59, 65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.4}}, {"image_id": 253630, "caption": "According to the picture, a living room is situated next to a large window. In the center of the room, there is a dark brown dining table sitting on a hardwood floor. The table has a glass top, and it is surrounded by various items like a television on the right side and another piece of furniture nearby. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "dining table", "potted plant", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 326021, "caption": "According to the image, a woman is sitting at a table with a teddy bear placed on it. There is another teddy bear nearby, suggesting it might be part of the event", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 317320, "caption": "The image portrays a busy street scene with a man and a bicycle passing by. Several people are walking on the street, and a couple of motorcycles are also present in the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 507171, "caption": "According to the image, a man is seen riding a surfboard in the ocean, attempting to catch a wave. He is partially submerged in the water as he rides the wave, demonstrating his surfing skills. The waves are caused by a boat in the background, creating some excitement in the scene", "mscoco_hallucinated_words": [["boat", "boat"]], "mscoco_gt_words": ["surfboard", "cat", "person"], "hallucination_idxs": [46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 161227, "caption": "A chair is sitting in the middle of the image, with a remote control placed on it. A second remote control can also be seen on the side of the couch, partially covered by a pillow", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "remote", "chair", "couch", "keyboard", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 25812, "caption": "According to the picture, there is a person skiing down a snowy slope wearing a suit. Another person, possibly on a snowboard, is also visible in the background, heading up the same slope. The first person appears to be enjoying the snowy terrain, and both skiers are wearing appropriate clothing for the conditions", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 267537, "caption": "According to the picture, several people are sitting on steps in the image. There are three women sitting together and holding umbrellas. They are dressed in long coats and appear to be enjoying the event or outing together", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "umbrella", "person", "bench"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 195645, "caption": "According to the picture, a group of people, including adults, are gathered together on a grassy hillside. They are sitting at various spots, including several benches, with some people seated closer to the stroller and others more dispersed. The scene appears to be a relaxed, social occasion where people are chatting and enjoying each other's company.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bottle", "wine glass", "handbag", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 155671, "caption": "A small town is depicted in the image with numerous cars and trucks on the road at the intersection. There are a total of four vehicles on the scene, including a police car, a few cars, a taxi, and a truck", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 412894, "caption": "According to the picture, there is a crowd of people walking on the sidewalk in front of a building. Some of them are standing in front of a traffic light at the corner. They are spread out across the scene, covering a large area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "backpack", "traffic light", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 32625, "caption": " There is a table in the image with a variety of food on it. To the right of the table, there is a bowl containing a delicious looking meal, and a plate with some food on it is placed closer to the left side of the table", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "bowl", "dining table", "book", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 18688, "caption": " The image captures a baseball game in progress, with several players involved. A batter is at home plate, holding a bat and getting ready to swing", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 343534, "caption": "According to the picture, there is a small doll or toy next to two stuffed animals. The small doll is dressed in a pink outfit and boots, and there is a teddy bear and another stuffed animal nearby. The teddy bear appears to be the same size as the other stuffed animal", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "potted plant", "chair", "book", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 370657, "caption": "According to the picture, a group of people is riding on motorcycles. There is a man and a woman in the front, with a young girl behind them. Several baskets can be seen, suggesting that the riders are carrying things or groceries", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 481596, "caption": "According to the picture, there is a man sitting in a baseball field, surrounded by a crowd of people. He is holding a hot dog in his hand and eating it while watching the game", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "hot dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 460347, "caption": "According to the picture, a city street or highway features multiple lanes, with a combination of buses and cars traveling through them. There are several cars dispersed across the scene, driving in different directions, while buses can be spotted traveling both in front of the buses and in the background. The presence of various vehicles on the street suggests that it is a bustling transportation route", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 398753, "caption": "The image shows a large crowd of people gathered around a baseball field, watching a baseball game in progress. The field is filled with baseball players, with a baseball bat held in one player's hand as they stand on the field. The crowd, consisting of many spectators, watches the game with excitement and engagement", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 67833, "caption": "According to the picture, a large pizza with various toppings, such as fish, olives, and onions, is being served on a table. The pizza is placed on a white plate, and several bowls and cups are also placed on the table. A bottle of red wine is positioned in the middle of the table, along with several glasses", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "bowl", "dining table", "bottle", "knife", "wine glass", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5555555555555556}}, {"image_id": 272880, "caption": " In the image, there is a pizza with the label \u201cMystic Pizza\u201d, which appears to be the only item on the plate. The pizza is cut into various pieces, with one piece close to the right side of the plate and another one close to the center of the plate. There is also a fork on the table, possibly used for serving the pizza", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "knife", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 254368, "caption": "The image features a man in bed eating food. He is sitting on the bed and holding a plate with food on it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "person", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 193108, "caption": "According to the picture, there are multiple baseball players involved in a game, including a young man in the center, who is swinging at a baseball while holding a bat. Two other players are positioned behind the batter, one closer to the left and another closer to the right, watching the batter's movements.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 459141, "caption": "According to the image, a woman is riding a bicycle while wearing a conical hat, carrying a large bag on the bicycle's back rack. The woman appears to be in the middle of a street or sidewalk, possibly on her way to or from a store. There is also a small pile of items near her bicycle, which could be additional grocery items or items she is carrying with her", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "potted plant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 231534, "caption": "The image depicts a tarmac filled with multiple vehicles, including an airplane parked near a ramp. A variety of vehicles are involved in loading or unloading the airplane, including trucks, vans, and cars", "mscoco_hallucinated_words": [["van", "car"], ["car", "car"]], "mscoco_gt_words": ["handbag", "airplane", "person", "truck", "suitcase"], "hallucination_idxs": [34, 37], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.4}}, {"image_id": 227031, "caption": " The image portrays a man and woman, dressed in formal attire, standing next to a multi-tiered cake, likely during a special occasion like a wedding or anniversary. They appear to be joining the cake together, likely cutting it for the first time as a couple", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "potted plant", "tie", "cake", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 209347, "caption": "According to the picture, a woman is sitting on a subway train, looking at her cell phone. She is wearing a black jacket and has a purse, and she is surrounded by other passengers. Some people are nearby, while others are sitting further back in the train car. A couple of bottles are also visible, likely placed on the train's surfaces or held by some of the passengers", "mscoco_hallucinated_words": [["car", "car"], ["bottle", "bottle"]], "mscoco_gt_words": ["cell phone", "train", "tie", "handbag", "person"], "hallucination_idxs": [53, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.18181818181818182, "Recall": 0.8}}, {"image_id": 523292, "caption": "According to the picture, a fluffy cat is sitting on a table near a laptop, looking at something on the screen of the laptop. The cat is spread out on the table and the laptop is placed in front of it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "cat", "chair", "laptop", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 12375, "caption": "According to the picture, the image features a white plate filled with a variety of foods. The plate has carrots, potatoes, and some other vegetables, along with some meat and rolls. The food is placed on a table, with multiple meat balls and rolls on the dining surface", "mscoco_hallucinated_words": [["ball", "sports ball"]], "mscoco_gt_words": ["carrot", "broccoli", "dining table"], "hallucination_idxs": [48], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 13550, "caption": "According to the picture, it shows a woman sitting on a rock next to an elephant, petting it while it is standing on the rock. The woman seems to be enjoying the company of the elephant and has her arms around it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "elephant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 146837, "caption": "According to the image, a young man is performing a trick while riding a skateboard on a city street. The man is actively in motion, catching some air and likely trying to show off his skills to the bystanders", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 67042, "caption": "A man is standing in a long, narrow corridor, looking at his cell phone as he walks. He appears to be concentrated and focused on the device, possibly reading messages or checking for updates", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "tie", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 209925, "caption": "According to the picture, an image depicts a baseball player standing on a field, getting ready to pitch a baseball. The player is wearing a baseball uniform, and has a baseball in his hand. He is shown on the mound, getting set to throw the ball.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 2154, "caption": "According to the image, a shopping cart is parked near a bench and a dining table. There is a trash can and a few other items scattered around the area", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["bench", "car", "person", "umbrella"], "hallucination_idxs": [16], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 274931, "caption": "The image is a black and white photograph of a street scene featuring several parked cars, all of which are old cars. They are all lined up along the curb of the street, creating a line of parked cars.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fire hydrant", "traffic light", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 104326, "caption": "According to the picture, there is a long dining room table situated between two rooms: a kitchen and a living room. There are chairs of various heights on each side of the table, and they can seat more than six people. Several potted plants are situated around the dining area, adding a touch of natural beauty. The kitchen contains a dishwasher, oven, and a microwave, along with multiple bowls placed on the counter", "mscoco_hallucinated_words": [["person", "person"], ["bowl", "bowl"]], "mscoco_gt_words": ["dining table", "potted plant", "oven", "chair", "clock", "sink", "knife", "microwave"], "hallucination_idxs": [44, 78], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.625}}, {"image_id": 233888, "caption": "According to the picture, there is a young boy playing baseball in front of a television. The boy is holding a baseball bat and has an outfielder's glove on his left hand", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["potted plant", "baseball bat", "person", "baseball glove", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 213905, "caption": "According to the picture, there are two women standing on a grassy field. They appear to be friends and are smiling at each other, possibly in the middle of a conversation. Both women are wearing different outfits \u2013 one is wearing a white and blue top, while the other is wearing a black and white outfit", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "backpack", "tennis racket", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 277479, "caption": " The image depicts a young girl, possibly a toddler, walking towards a refrigerator in a kitchen. She is holding a bottle in her hand and has a toy in her other hand, possibly about to drink or eat. The refrigerator has several bottles in it, with one located near the middle and others further down towards the left side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "refrigerator", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 300784, "caption": "According to the picture, it features a group of children sitting on a bench in front of a large dog. The dog seems to be a focus of attention for the children, as they appear to be looking at the animal. One of the children is holding a drink, while the other two are holding something that appears to be a book. The dog is seated next to them", "mscoco_hallucinated_words": [["book", "book"]], "mscoco_gt_words": ["couch", "dog", "person", "bench"], "hallucination_idxs": [66], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.75}}, {"image_id": 183766, "caption": "The image displays a dessert, likely a cheesecake, decorated with fresh berries and on a plate. There are two pieces of cake on the plate \u2013 one larger one and another smaller one. The dessert is likely to be perfect for a special occasion or as a treat to indulge in", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "cake", "knife", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 169169, "caption": " According to the image, there is a street scene showing a sign advertising an event, likely the Springs in the City event, with a green banner and a poster. The banner is visible above the sign and the poster appears to be promoting the event.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "potted plant", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 409346, "caption": "According to the picture, a group of people, including a woman and a man, are standing around a table, enjoying a meal. The woman is holding a knife while the man is holding a fork. There is a cake in front of them, and various snacks and appetizers, such as cupcakes, are spread out on the table", "mscoco_hallucinated_words": [["knife", "knife"], ["fork", "fork"]], "mscoco_gt_words": ["bowl", "dining table", "spoon", "cake", "person"], "hallucination_idxs": [32, 39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.18181818181818182, "Recall": 0.6}}, {"image_id": 522262, "caption": " The image captures a sidewalk where many bicycles are parked, lined up in a row parallel to the sidewalk. There are different types of bicycles, such as two small bikes, two medium-sized bikes, and two large bikes", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 79014, "caption": "According to the image, a man is holding a yellow ball in his hand and preparing to throw it against a wall. He is standing in front of glass, and the ball is floating in front of the glass, giving the illusion of being held up by the glass. The man is in the foreground, visible through the glass", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 344942, "caption": "According to the picture, there is a small wooden toy, or \"cube,\" positioned on the edge of a laptop. The toy is perched above the keyboard and is looking down at the laptop screen. The image appears to be taken in front of a window, adding a natural and relaxing background.", "mscoco_hallucinated_words": [["keyboard", "keyboard"]], "mscoco_gt_words": ["clock", "laptop", "tv"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 275929, "caption": "According to the picture, there are two porcelain toilets sitting on the stairs. They are placed side-by-side on the steps, creating an unusual and interesting sight. The toilets appear to be old and in need of maintenance or repair, which is likely why they are sitting on the stairs", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "toilet", "sink"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 465601, "caption": " According to the picture, a group of people, including women, is standing on a sidewalk, looking at a building in the background. The people are wearing hats and holding umbrellas, suggesting that they might be protecting themselves from the rain", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "person", "umbrella"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 300124, "caption": "The image depicts a small green chair sitting on the side of a road, near a curb. The chair appears to have a piece of plywood resting on top of it, likely serving as a makeshift table", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "couch", "dining table", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 46919, "caption": "According to the image, a man is walking uphill on a snow-covered hill. He is wearing a backpack and using a pair of snowshoes to traverse the snow-covered surface. The hill has a relatively steep incline", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 434022, "caption": " The image portrays a white train engine traveling down a train track, with smoke billowing from the stack. The long train consists mainly of black and gray railroad tracks", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 490491, "caption": "The image captures a city street scene with a cement wall and a building situated near the sidewalk. Two people can be seen standing in front of the cement wall, possibly passing by the area or having a conversation", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["stop sign", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 175878, "caption": " In the image, there is a group of four people, including a young woman and two men, who are standing and talking near a balcony. They appear to be having a good time together, engaged in a casual conversation", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 12669, "caption": "In this image, a young woman is standing in a room surrounded by people, holding a wine glass. She appears to be smiling and engaged in conversation with the other attendees. The room looks like a social gathering or an event", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "vase", "wine glass", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 192496, "caption": "The image shows a blue train traveling on train tracks, sitting between two passenger cars in front of it. The train appears to be parked and waiting for another train to continue on its journey. The passenger cars, one positioned closer to the left side and the other near the right side, are not as large as the train itself, but they still contribute to the overall scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 133146, "caption": "According to the picture, there is a spacious living room with a fireplace and chairs in it. The living room is filled with comfortable furniture such as a couch situated near the fireplace and two chairs placed around it. The room appears to be tidy and well-maintained, with books and a clock visible on a table", "mscoco_hallucinated_words": [["clock", "clock"], ["table", "dining table"]], "mscoco_gt_words": ["vase", "chair", "couch", "book", "tv"], "hallucination_idxs": [55, 59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6}}, {"image_id": 53744, "caption": "In the image, there is a plate with sliced tomatoes and lettuce. Two sliced tomatoes are placed on the plate, and a knife is next to them, possibly being used to prepare or eat the vegetables", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "sandwich"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 375902, "caption": "The scene depicts a man, dressed in a white ski suit, crossing the finish line after participating in a skiing competition. He is wearing blue ski boots", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 498747, "caption": "According to the picture, the scene is set in a crowded room where multiple people are present. The focus of the image is a young man, standing at the right side of the room, holding a cell phone to his ear", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "bottle", "bird", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 52435, "caption": "A group of people is spending time at a beach, with some of them paddleboarding in the ocean. The scene features a few surfers, including one paddleboarding person, and many other people watching them from the shore or walking along the beach", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "bird", "person", "umbrella"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 301977, "caption": " In this image, a cow is standing in front of a store, possibly waiting to be sold. The store is filled with a variety of sodas, with some bottles closer to the foreground and others positioned further back", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "refrigerator", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 261097, "caption": " The image features a tennis court with a lot of chairs arranged around the court. Several people are present in the scene, including a tennis player with a blue shirt and white shorts who is walking across the court", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "bench", "tennis racket", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 227204, "caption": " In the image, four women are sitting around a table, enjoying a meal together. They are laughing and smiling as they chat and enjoy the food they are sharing. The table is filled with various cups, bowls, and utensils, which suggests that they are indulging in a variety of breakfast items like bananas, milk, and cereal", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "bowl", "dining table", "banana", "spoon", "bottle", "knife", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5555555555555556}}, {"image_id": 353981, "caption": "The image is of an old airplane parked on the runway. There are three people standing near the plane", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["airplane", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 185327, "caption": "The image depicts a group of elephants walking down a street. There are five elephants walking in a line, with a vehicle in the background and another elephant in the far right of the scene. The elephants appear to be walking toward a bus or an adjacent building", "mscoco_hallucinated_words": [["elephant", "elephant"], ["elephant", "elephant"], ["elephant", "elephant"], ["elephant", "elephant"]], "mscoco_gt_words": ["bus", "horse", "car", "person", "cow"], "hallucination_idxs": [6, 15, 29, 39], "metrics": {"CHAIRs": 1, "CHAIRi": 1.0, "Recall": 0.0}}, {"image_id": 348654, "caption": "A white cell phone is sitting on a desk in front of a computer. The phone has various buttons and a speaker, and it appears to be connected to a keyboard. The keyboard has its own screen and has a mouse nearby, placed to the right of the phone", "mscoco_hallucinated_words": [["keyboard", "keyboard"], ["keyboard", "keyboard"], ["mouse", "mouse"]], "mscoco_gt_words": ["cell phone", "laptop", "tv", "dining table"], "hallucination_idxs": [32, 35, 43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.75}}, {"image_id": 482432, "caption": "According to the picture, there is a black and white image showcasing a toothbrush and toothpaste sitting next to a sink. The toothbrush is positioned closer to the center, while the toothpaste is to the left", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "toothbrush", "sink"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 454161, "caption": "The image features a delicious pizza that is being enjoyed by a couple at a table. The pizza is cut into slices, placed on a white plate in the center of the table. Near the pizza is a fork, which may be used for serving or eating. Additionally, the table also has a knife positioned close to the plate of pizza", "mscoco_hallucinated_words": [["knife", "knife"]], "mscoco_gt_words": ["pizza", "fork", "dining table", "wine glass"], "hallucination_idxs": [59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.75}}, {"image_id": 144723, "caption": "According to the picture, a small dog is flying through the air while catching a frisbee. The dog appears to be leaping from the ground into the air, excitedly catching the flying disc", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "dog", "frisbee", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 129707, "caption": "The image shows a person holding a cell phone in their hand, displaying a large screen. The phone appears to be black, with buttons on the front. The cell phone is being used, and the person may be speaking or receiving a call", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "book", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 379842, "caption": "The image depicts a couple of video games laid out on a table. There is a small white Nintendo Wii game controller, along with a book, which could be a guide or manual for the gaming system", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "dining table", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 203110, "caption": "The image captures a large, orange-colored trolley traveling down a city street, passing by buildings on both sides. There are several people visible in the scene, some walking on the street and others standing nearby", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "train", "orange", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 556478, "caption": " According to the image, two pizzas are being cooked on a large oven surface, with one being more finished than the other. The finished pizza is placed in a pan, and the other pizza is still on the tray, indicating that the second pizza is being cooked further.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "oven", "bottle", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 103815, "caption": "The image captures a moment of action in a baseball game, with a baseball player on the mound, pitching. There is a baseball player wielding a baseball bat, likely trying to hit the pitched ball during the game", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 451943, "caption": " The image features a bed or beds in a room. A white and grey pillow sits atop the mattress", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "suitcase", "dining table", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 268854, "caption": "According to the picture, the scene is of a group of people standing on the beach near water, enjoying a sunny day. There are multiple people, possibly a group or a family, spread out in various positions along the beach. They appear to be moving around the area, possibly interacting with each other. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 34464, "caption": "In the image, a group of people, including children and adults, are standing on the beach enjoying themselves. One person is holding onto a red inflatable kayak, while a surfboard can be seen floating near the water. The group appears to be having a good time, interacting with each other and playing in the ocean", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "surfboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 495513, "caption": " The image shows a group of people gathered outside at night, enjoying themselves at a party or event. There are two main individuals, a man and a woman, standing close together and holding hot dogs over an open fire", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "hot dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 529256, "caption": "The image depicts a dog sleeping on one side of the bed, while a cat is sleeping on the other side, creating a cozy atmosphere in the room. The bed is adorned with a red blanket, adding a warm and inviting ambiance", "mscoco_hallucinated_words": [["cat", "cat"]], "mscoco_gt_words": ["dog", "book", "bed"], "hallucination_idxs": [15], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 37907, "caption": " The scene features a cat sitting on a flat-screen television in a room. The cat is looking towards the television, which is on", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cat", "potted plant", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 506401, "caption": "The scene takes place on a street, with an intersection at the end of the road. There is a large, white statue of a wolf near the intersection", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cat", "traffic light", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 406611, "caption": "According to the picture, there is a large group of people gathered around a ski slope. Many of them are watching skiers and snowboarders as they navigate down the mountain. The people are standing near the snowy slope, chatting and enjoying themselves", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 267225, "caption": " According to the picture, a white truck, or possibly an RV, is parked on a street next to other vehicles. There is an air conditioning unit on the truck, and three satellite dishes are located on top of the truck and behind it. Several people are standing around the truck, likely waiting for work to be done on the RV, or watching the workers as they go about their tasks", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "car", "person", "truck", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 278365, "caption": "According to the picture, a man and a woman are sitting on a patio, enjoying a meal together. They are seated around a table filled with various food items, including slices of pizza and a couple of beverages, likely drinks. The woman appears to be eating a slice of pizza, while the man is holding a piece of food. The setting seems relaxed, with a comfortable atmosphere", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "dining table", "chair", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 69757, "caption": "According to the picture, the scene features a group of people gathered around a table celebrating a special occasion. Three individuals can be seen, including a woman and a man, as well as a child sitting at a chair nearby. A delicious cake has been placed on the table, decorated with colorful candies", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "cake", "knife", "wine glass", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 432213, "caption": "The image features a plate of food, which appears to consist of a sandwich and French fries. The sandwich is likely a cheeseburger, as it appears with a burger-like shape and features a couple of meat items", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "sandwich", "wine glass", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 368367, "caption": "According to the picture, there is a young woman sitting at a table in a restaurant, smiling as she talks on a cell phone. The phone is positioned on the right side of her face, and she appears to be talking to someone while enjoying her meal", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["cell phone", "person", "cup"], "hallucination_idxs": [13], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 201555, "caption": "The scene features a stretch of sand, possibly dirt or mud, piled along the side of a street. The sand appears to be covering a significant portion of the roadway, making it difficult to see the actual road surface", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 92131, "caption": "In the image, there are several people standing near a large green bus parked on a street. A woman is among the group, and a man can also be seen in the scene. The group of people appears to be waiting for the green bus to start or simply observing it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bus", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 21779, "caption": " The image shows a young woman, laying on the floor, and holding a red backpack. She is smiling and appears to be happy, perhaps after unpacking or taking a break from traveling.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "suitcase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 565148, "caption": "A baseball game is being played in the image. In the foreground, a baseball player is standing at home plate, holding a baseball bat in his hands and ready to hit the ball. A baseball glove can be seen nearby, placed on the field", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 160183, "caption": " According to the picture, a fire truck is parked in front of a building with a large fire hose extended from the back of the truck. The truck is surrounded by a group of people, and there are a few individuals standing near the fire truck.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 555456, "caption": " According to the image, it is snowing at a ski resort with a group of people enjoying the slopes. There are at least six people on the slope, some of them are skiing and some are just standing. Two snowboards can be seen on the slopes, which may indicate that some of the skiers have decided to use snowboards instead. The scene appears to be set against a forest, with some trees visible in the background", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 235595, "caption": "According to the picture, there is a group of motorcycles parked along a busy street. The motorcycles are lined up in a row, with some of them having sidecars attached. The scene captures the busy street and the parked motorcycles", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "potted plant", "car", "parking meter", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 489909, "caption": "According to the picture, there is a variety of different fruits and vegetables placed together in a basket. The basket includes oranges, bananas, and a couple of medium-sized potatoes, all laid out in a visually appealing and inviting arrangement. The oranges are placed towards the left side of the image, while the bananas and potatoes occupy different parts of the basket", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["banana", "orange", "bowl"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 501080, "caption": " The image portrays a large vase or a flower pot filled with a variety of pink and purple flowers, creating a beautiful arrangement. The flowers are spread out evenly across the vase, with some larger flowers standing out more prominently", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "potted plant", "vase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 405183, "caption": "According to the picture, the scene depicts a young girl standing in front of a group of people, holding a remote, possibly a Wii remote. The group of people, which consists of both men and women, is gathered around her, watching her. It seems that she is about to demonstrate something, such as playing a video game", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "potted plant", "chair", "couch", "book", "handbag", "person", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 409138, "caption": "In the scene, there is a white and red bus parked on the street, surrounded by variously sized trees growing along both sides of the road. A number of cars are parked nearby, with some closer to the sidewalk and others further back along the street", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 411840, "caption": "According to the picture, there are two women walking on a grass tennis court. One of the women is wearing a white tennis outfit, while the other is wearing a blue tennis skirt. Both women are holding tennis rackets in their hands, one of them is close to the right edge of the court and the other closer to the middle", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "chair", "sports ball", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 333712, "caption": " The image depicts a close-up view of a pizza slice on a plate. The slice is covered with cheese, greens, and bacon", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "spoon", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 148977, "caption": "In the image, there is a teddy bear wearing headphones, placed on top of a table or surface. The teddy bear is surrounded by various items, including books, a few located near the teddy bear itself and the rest scattered across the room", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "dining table", "bottle", "bear", "keyboard", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 399058, "caption": "The scene takes place on a road with an elephant riding on the street, with a person on its back. The elephant is quite large, and the person is sitting on its back, facing towards the right side of the image", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "orange", "person", "truck", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 183571, "caption": "According to the picture, a man is sitting on a boat and playing with a dog, which is sitting on the floor of the boat. The man and dog appear to be interacting, possibly playing games or having fun together", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 325836, "caption": "The scene features a large, beautifully-designed motorcycle parked on a red carpet. The motorcycle is prominently displayed, taking up a significant portion of the display area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "motorcycle", "bicycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 437910, "caption": "According to the picture, the scene depicts a bathroom situated within a larger room. The bathroom is small and has a toilet and sink, along with a door on both sides. The sink is located closer to the toilet, and both the toilet and sink are within the larger room. The room appears cluttered, making it appear messy and disorganized", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "toilet", "sink"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 155885, "caption": " The image features a picturesque scene of a village situated on the water, with several boats docked along the shore. The village is surrounded by houses and a few birds can be seen flying above, creating a lively atmosphere. There are a couple of seagulls perched on a roof, providing a further sense of life in the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "bird", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 467887, "caption": " In the image, a man wearing a red shirt sits at a desk using a laptop, which is displayed in front of him on the table. There is another laptop in the background, situated on a lower part of the table", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "couch", "mouse", "person", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 174402, "caption": "The image features a man wearing a blue and white tennis shirt and holding a tennis racket, standing on a tennis court. He is posing for the camera, looking focused on his upcoming game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 352734, "caption": " The image depicts a young boy sitting at a dining table, eating ice cream or other sweet treats. The child is enjoying his meal with a fork in hand, biting into a piece of chocolate or another dessert", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "spoon", "cake", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 346716, "caption": "According to the image, there are two red double-decker buses driving down a city street in the city. The buses seem to be running parallel to each other as they travel along the road.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 451144, "caption": "According to the picture, there are two men standing on a snowy slope, looking at the view. One of the men is wearing a backpack and the other is wearing a pair of red pants. They are both holding ski poles, possibly while skiing or preparing to ski down the slope", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 295656, "caption": "According to the picture, there is a group of people standing together, dressed in formal attire. They appear to be enjoying a meal, as one man is holding a plate of food. One woman can be seen holding a slice of pizza, while another is sharing a piece of cake with someone else. Another slice of cake is present, and there are two other men in the group, adding to the social atmosphere", "mscoco_hallucinated_words": [["cake", "cake"], ["cake", "cake"]], "mscoco_gt_words": ["chair", "tie", "clock", "car", "person", "pizza", "tv"], "hallucination_idxs": [55, 63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.2857142857142857}}, {"image_id": 129886, "caption": "The image shows a group of people skiing down a snowy slope, surrounded by snow-covered mountains. There are at least nine people in the group, with some standing at the base of the hill, while others are skiing down the slope. Some of the people are wearing skis and are skiing down the hill, while others are dressed in warm clothing and seem to be enjoying the snowy scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 557998, "caption": "According to the picture, a young woman is sitting at a wooden table, enjoying a bottle of wine and a bottle of juice. She appears to be drinking and possibly writing a letter or journal entry. The table is located in a green, lush environment, such as a park or a garden, which adds to the natural atmosphere of the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "bottle", "wine glass", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 401010, "caption": " The image features a room with a large bed positioned in front of a window. The bed is neatly placed and appears to have white sheets. There is a desk or counter close to the bed, possibly for a student or office setting. The room also has a television set, and there is a chair on the other side of the bed", "mscoco_hallucinated_words": [["desk", "dining table"], ["student", "person"]], "mscoco_gt_words": ["chair", "sink", "tv", "bed"], "hallucination_idxs": [31, 42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.75}}, {"image_id": 567562, "caption": "According to the picture, there is a family of three, a woman and two young girls, sitting at a table and enjoying a meal together. They are surrounded by various pizzas, salad, and drinks. There are two bottles of soda on the table. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "bowl", "dining table", "chair", "spoon", "bottle", "book", "broccoli", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.36363636363636365}}, {"image_id": 313928, "caption": "The image depicts a balcony overlooking the ocean with a beach umbrella and surfboard, creating an enjoyable atmosphere on the balcony. There is a small dining table and a chair set out, offering a comfortable seating arrangement to view the scenery. In the foreground, there are two chairs positioned next to each other and facing towards the ocean view, providing a pleasant seating area for people to enjoy their time on the balcony", "mscoco_hallucinated_words": [["umbrella", "umbrella"], ["person", "person"]], "mscoco_gt_words": ["surfboard", "dining table", "chair", "bottle", "wine glass"], "hallucination_idxs": [11, 71], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6}}, {"image_id": 490284, "caption": " A black dog is sitting on a couch, leaning on a person's lap and resting its head on the person's arm. The dog is positioned towards the left side of the image, while the person is on the right", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["couch", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 346940, "caption": "In this image, there is a hot water bottle steaming on the counter of a kitchen. The bottle is located in the middle of the counter, surrounded by various kitchenware. There are two kettles in the scene, one near the hot water bottle and the other further to the right, possibly containing water for boiling", "mscoco_hallucinated_words": [["bottle", "bottle"], ["bottle", "bottle"], ["bottle", "bottle"]], "mscoco_gt_words": ["bowl", "knife", "oven"], "hallucination_idxs": [9, 19, 47], "metrics": {"CHAIRs": 1, "CHAIRi": 1.0, "Recall": 0.0}}, {"image_id": 222782, "caption": "The image is of a black and white scene featuring a sign that says \"AMOCO\" in the middle of a road. There is a person in the background walking towards the left side of the image, and there are several people standing near the sign. The street appears to be empty, and there are no cars or other vehicles in the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 255934, "caption": " According to the picture, there are two elephants in the scene, one larger and one smaller. The larger elephant is walking alongside the smaller ones, possibly leading the way. The elephants are in a dirt-covered area, possibly a zoo or a preserve. There is a handbag in the scene, which may be the handbag of the person in the picture or just an object belonging to someone else", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "elephant", "person", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 278627, "caption": "According to the image, there are multiple photos showing different dining experiences. In one photo, a man and a woman are sitting at a table, surrounded by various food items. There are bowls of soup, various fruit dishes, a plate of seafood, and a large salad with vegetables", "mscoco_hallucinated_words": [["bowl", "bowl"]], "mscoco_gt_words": ["fork", "dining table", "potted plant", "spoon", "chair", "bottle", "person", "cup"], "hallucination_idxs": [37], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.25}}, {"image_id": 400851, "caption": "In the image, there is a large group of people enjoying a sunny day at the beach. Some are lounging on the sand, while others are relaxing on a bench", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "boat", "chair", "sports ball", "umbrella", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 472610, "caption": "A woman and her dog are standing near a field surrounded by cows. The woman appears to be smiling, and the dog is wearing a leash", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "person", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 78565, "caption": " According to the image, a group of people is gathered around the beach, enjoying a day at the water. There are multiple boats in the water, including a small boat with four men on it, and a larger boat with a sail, likely a windsurfing sailboat, with a dog on it. The scene captures a moment at the beach with the people spending time there and exploring the water.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 388237, "caption": "The image depicts a group of young people participating in skateboarding activities in the street, with several skateboards present. There is a man in the center of the scene, riding his skateboard while others are skating around him", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "chair", "bottle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 289640, "caption": "According to the picture, a group of people is gathered at a dining table with a frisbee on top of it. They appear to be enjoying a drink at a party, laughing and creating a festive atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "wine glass", "orange", "person", "cup", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 122602, "caption": "According to the picture, there is a person working on a craft project on a table. He is using a variety of tools such as scissors, a bottle, and a cutting mat. The table is covered in a collection of various items, including scissors and a cutting mat", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["scissors", "bottle", "dining table"], "hallucination_idxs": [8], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 24104, "caption": "According to the picture, a person is sitting at a desk with a computer set up in front of them. The desk is equipped with three monitors displaying content: two computer monitors and one TV monitor. There are two cups on the desk, likely for coffee or other beverages, and a book in front of them", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["dining table", "bed", "mouse", "keyboard", "book", "laptop", "tv", "cup"], "hallucination_idxs": [6], "metrics": {"CHAIRs": 1, "CHAIRi": 0.08333333333333333, "Recall": 0.625}}, {"image_id": 183709, "caption": "According to the picture, there is a group of people gathered at an intersection. The people, estimated to be around 15, are standing in the middle of a crosswalk, looking at a man who is standing near a traffic light. There are cars parked and driving on the streets around the intersection", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 59710, "caption": "The image features a black refrigerator with its doors open, showcasing the interior. The refrigerator has several bottles and cups inside, providing a view of a kitchen setup. Additionally, there are a few more bottles visible in the scene, further enhancing the refrigerator's contents. The refrigerator and bottles are set in a bedroom, which can be seen in the background of the scene", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cup", "cup"], ["bottle", "bottle"], ["bottle", "bottle"]], "mscoco_gt_words": ["chair", "dining table", "refrigerator"], "hallucination_idxs": [19, 21, 39, 55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 279407, "caption": "According to the picture, a young boy, possibly a baseball player, is in the middle of a field wearing a green and yellow baseball uniform. He is holding a baseball in his hand and appears to be smiling or laughing. There are multiple other people in the scene, but their exact roles or connections to the main subject are not evident", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "sports ball", "bench", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 314729, "caption": "According to the picture, it is a vintage baseball image featuring a man in a baseball uniform. The man is positioned in a batting stance, holding a baseball bat in both hands. The image appears on a white background, and the man is prominently displayed", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 527497, "caption": "According to the picture, there is a desktop sitting on a desk with a keyboard and mouse sitting next to it. The computer setup is complete with two monitors, one on the left side and another on the right side of the desk. A cell phone is also present, placed near the top of the desk, to the right of the left monitor", "mscoco_hallucinated_words": [["cell phone", "cell phone"]], "mscoco_gt_words": ["dining table", "mouse", "keyboard", "laptop", "tv"], "hallucination_idxs": [48], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 1.0}}, {"image_id": 411564, "caption": "According to the picture, a young man is sitting on a chair near a window, looking down at his phone. The window is open, allowing natural light to enter the room. The chair sits near a dining table, which contains various items such as a cup, a bottle, and a bowl. The man is positioned in front of a refrigerator, with a few additional bottles placed around the table", "mscoco_hallucinated_words": [["cup", "cup"], ["bowl", "bowl"]], "mscoco_gt_words": ["cell phone", "dining table", "chair", "bottle", "person", "refrigerator"], "hallucination_idxs": [51, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 320350, "caption": "According to the picture, there are two people standing on snowy ground with two dogs, one closer to the left and another on the right. The man is wearing skis, and the woman is wearing skis as well. The dogs are wearing skis, too", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 401857, "caption": "The image captures a baseball game with several players on the field, all wearing baseball uniforms. The batter, wearing a number 10 jersey, is up to bat, holding a baseball bat", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "sports ball", "bench", "person", "baseball glove", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 285302, "caption": " According to the picture, the image features a young boy sitting at a dining table, surrounded by many pieces of paper and various types of scissors. He appears to be engaged in making a paper craft, possibly a card or a sculpture. The dining table is a significant aspect of the scene, along with various chairs positioned around it, some closer to the boy and others further away", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "scissors", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 61959, "caption": "The image captures a large city bus driving down the street past various businesses at an intersection. The bus is surrounded by other vehicles on the road, and there is a stop sign on the corner. The intersection appears to be a busy part of the city, as there are several cars and a person crossing the street nearby. The image depicts a typical day in the city with the bus, cars, and pedestrians all moving about", "mscoco_hallucinated_words": [["stop sign", "stop sign"], ["person", "person"]], "mscoco_gt_words": ["bus", "traffic light", "car", "truck"], "hallucination_idxs": [33, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 229659, "caption": " Two people are sitting in a cafe, one at a table with a laptop in front of him, looking at the screen, while the other people gather around him in a restaurant. The laptop screen shows an image, likely a website or a social media platform, as the individual focuses on his device", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "bottle", "clock", "person", "laptop", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 473118, "caption": "According to the picture, there's a man riding a skateboard on a ramp, performing an aerial trick by jumping off the ramp and flying through the air. The skateboarder appears to be in his mid-twenties and is wearing a blue shirt. There are other people around, either enjoying the ramp or waiting their turn", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "bicycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 391222, "caption": "According to the image, two people are standing outdoors, wearing skis and standing in a snow-covered area. They appear to be discussing something together, possibly at a signpost, possibly navigating their path through the snowy terrain. The signpost is positioned near the center of the image, attracting their attention", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 416104, "caption": "According to the picture, there is a group of people gathered around various blue dining tables outside, creating a relaxed atmosphere. The people are sitting, standing, and walking around the area, enjoying their time together under large outdoor umbrellas", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "bottle", "wine glass", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 511852, "caption": "According to the image, there are two people skiing down a snowy slope, enjoying their time together. One person is wearing a blue jacket and the other is wearing a red jacket. The two skiers appear to be a little distant from each other", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 59386, "caption": "In the image, a group of giraffes is gathered together in a grassy area, likely a park or zoo exhibit. They seem to be eating leaves off a tree or from the surrounding vegetation, providing them with nourishment", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["zebra", "giraffe", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 540180, "caption": "According to the picture, a street is empty with only a few cars, and a green traffic light is positioned in the middle of the road, near a city streetlight. The light is mounted on a post or a streetlight pole, signaling a green traffic light is present", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 153692, "caption": "According to the picture, a man and his dog are crossing a street at an intersection, which is heavily populated with cars. The man is riding a bicycle while the dog is walking alongside him, crossing the street at the same time", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bicycle", "traffic light", "car", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 265628, "caption": "According to the picture, a man is sitting on a toilet located in a small room with a round window. There is another toilet in the room, close to the man, suggesting it might be shared. The man is wearing two pairs of shoes, one closer to the camera and the other slightly lower in the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "toilet", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 499594, "caption": "According to the picture, a plate of food has been prepared, containing a piece of grilled steak with broccoli. The steak is nicely cooked and accompanied by the green vegetables, which add a nice contrast to the dish. This meat meal is being served on a white plate, likely with a salad or side dishes", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "bottle", "knife", "broccoli", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 5934, "caption": "According to the picture, there is a baseball glove, baseball, and baseball bat sitting on a bench. The baseball glove is holding the baseball, creating an ambiance of a baseball game. The baseball and glove are placed close together, emphasizing their connection and essence of a baseball game", "mscoco_hallucinated_words": [["baseball bat", "baseball bat"]], "mscoco_gt_words": ["bench", "sports ball", "baseball glove"], "hallucination_idxs": [14], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 47828, "caption": "According to the image, it is a dark night with blue lights shining on a body of water. A group of people are walking along the river's edge, some of them on the shore side while others are out on the water, likely on a boat. The river or canal appears to be quite long and may be a popular attraction in the city", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 47916, "caption": " In the image, several dogs are running together in a grassy field. There are two large dogs, one closer to the left and the other closer to the center, with a small dog, one on the left and another on the right, playing together", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "dog", "car", "truck", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 255483, "caption": "The image is a black and white photo of a man sitting at a desk, possibly working on a piece of writing. There are several pieces of paper spread out in front of him. One of them has an image of a person on it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["scissors", "dining table", "book", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 131138, "caption": "According to the picture, a busy office setting is shown with a computer placed on a desk. The computer has a screen, a keyboard, and mouse pad, all situated on the left side of the desk. On the right side, there is a laptop open with a keyboard and mouse, as well as a few additional items placed throughout the scene. There is a cell phone placed at the top of the desk, and another cell phone placed near the bottom of the desk", "mscoco_hallucinated_words": [["cell phone", "cell phone"], ["cell phone", "cell phone"]], "mscoco_gt_words": ["dining table", "potted plant", "spoon", "mouse", "keyboard", "person", "laptop", "tv", "cup"], "hallucination_idxs": [73, 85], "metrics": {"CHAIRs": 1, "CHAIRi": 0.15384615384615385, "Recall": 0.4444444444444444}}, {"image_id": 258209, "caption": "According to the picture, a man is sitting on a bench, leaning against a building, and looking down at his phone. He is in a city setting, possibly a street in the background. The man appears to be focused on his phone, enjoying a moment of leisure", "mscoco_hallucinated_words": [["phone", "cell phone"], ["phone", "cell phone"]], "mscoco_gt_words": ["bicycle", "bench", "book", "person"], "hallucination_idxs": [23, 47], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.5}}, {"image_id": 347453, "caption": "According to the picture, the city scene is bustling with multiple buses and cars. The street is lined with buses going in opposite directions, creating a busy cityscape. There are also several cars driving along the street, adding to the traffic on the city road", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "bench", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 285820, "caption": "According to the picture, there is a large living room filled with furniture, including a couch located on one side of the room. There are three cats present in the image, sitting in different parts of the space. A large TV is situated in the center of the room, dominating the view", "mscoco_hallucinated_words": [["cat", "cat"]], "mscoco_gt_words": ["cell phone", "bowl", "dining table", "couch", "book", "bench", "tv"], "hallucination_idxs": [29], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.2857142857142857}}, {"image_id": 245664, "caption": "According to the picture, an individual is seated in front of a wooden table that has a laptop, a computer monitor, and other computer-related items placed on it. There is a keyboard and a mouse on the table, suggesting that the individual is using the laptop and the computer for work or other purposes. Additionally, there is a cup and a wine bottle visible on the table. The presence of these items and the man sitting in front of them suggest a working environment or an area for personal or professional tasks", "mscoco_hallucinated_words": [["bottle", "bottle"], ["man", "person"]], "mscoco_gt_words": ["dining table", "mouse", "keyboard", "wine glass", "book", "laptop", "tv", "cup"], "hallucination_idxs": [69, 82], "metrics": {"CHAIRs": 1, "CHAIRi": 0.15384615384615385, "Recall": 0.75}}, {"image_id": 453731, "caption": "According to the picture, a dog is laying on a wooden floor, surrounded by various items like a barrel, a bottle, and a wine glass. The dog is likely enjoying the floor the dog is laying on, with the wine glass and possibly even sleeping. There is also another bottle and an umbrella nearby", "mscoco_hallucinated_words": [["wine glass", "wine glass"], ["wine glass", "wine glass"], ["umbrella", "umbrella"]], "mscoco_gt_words": ["bottle", "dog", "cup"], "hallucination_idxs": [27, 45, 59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.6666666666666666}}, {"image_id": 524601, "caption": " According to the picture, there is a group of four people in a room playing a video game together. Three of the people are actively participating in the game, while one person is watching and laughing. The participants consist of two women and a man, while the observer appears to be the third woman.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "chair", "couch", "book", "person", "refrigerator"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 253262, "caption": "The scene shows a group of people standing next to an airplane on the tarmac. There are six people in the image, with a man and a woman being the most prominent figures. They are spread across the scene and seem to be watching the airplane or performing some activity", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "airplane", "person", "truck", "suitcase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 87052, "caption": "A young couple is standing in a kitchen, looking at each other while they work together at the counter. There is a microwave on the counter, which is near the refrigerator", "mscoco_hallucinated_words": [["microwave", "microwave"]], "mscoco_gt_words": ["bowl", "vase", "oven", "bottle", "person", "refrigerator", "cup"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.14285714285714285}}, {"image_id": 357578, "caption": "According to the picture, the scene shows a fire truck parked on a city street, likely waiting to respond to a call. The fire truck is red and has a ladder on its side, indicating its role as a fire engine. There are multiple cars driving on the street around the fire truck, with one white car in the foreground, a few cars further back, and another white car visible on the right side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 418692, "caption": " A couple is sitting on a couch in a living room. The man is reading a book, while the woman is sleeping on the couch. The living room is filled with comfortable furniture, such as a couch, chairs, and potted plants", "mscoco_hallucinated_words": [["potted plant", "potted plant"]], "mscoco_gt_words": ["cell phone", "remote", "chair", "couch", "book", "person"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.6666666666666666}}, {"image_id": 259345, "caption": "According to the picture, a man is wearing a colorful, rainbow-patterned shirt and standing under a pink umbrella. He is smiling at the camera while he holds the umbrella with one hand. The image is set outdoors, with a few cars parked in the background, adding context to the scene", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["chair", "person", "umbrella"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 119709, "caption": "According to the picture, the scene features a woman wearing an apron and working at a table dotted with numerous donuts. She seems to be preparing the donuts, possibly by spreading a mixture of flour and sugar onto them. The table is placed near the left edge of the image, and several donuts are spread around it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "dining table", "person", "donut"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 443299, "caption": "According to the picture, a man and a woman are sitting on a bench in a public place. They are both wearing jackets, and they look calm and focused as they use their devices. They are holding their smartphones and appear to be engaged in activities using their devices", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "backpack", "handbag", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 120776, "caption": "According to the picture, a group of people is gathered on a large outdoor patio, likely sitting outside a building. They are situated under multiple umbrellas that line the patio, with some of the umbrellas in front of a fountain. The people appear to be engaged in conversation and spending time together", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "person", "umbrella"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 411765, "caption": "The image features a young man playing tennis on a tennis court. He is in the middle of the court, holding a tennis racket and swinging it to hit the ball during his serve. Several other individuals are present in the scene, scattered around the court", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 447991, "caption": "A group of people is walking down a city street at night. They are in a hurry and seem to be heading to their destinations, which are located along the street. The street is lined with various shops on either side, making the area a bustling commercial district", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "bicycle", "handbag", "traffic light", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.14285714285714285}}, {"image_id": 276719, "caption": "According to the picture, there is a dog walking down a street with several classic car, creating a unique scene. There are people around the street as well, likely observing or interacting with the dog and the cars.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "truck", "chair", "handbag", "dog", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 422328, "caption": " The image depicts a horse standing in the snow wearing a harness. The horse is next to a snow-covered pile of hay and nearby, there are two vehicles parked, one on the left and the other on the right. The horse seems to be the center of attention, as it is pulling a sled along with the plowing of the snow", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "horse", "skis", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 354559, "caption": "According to the picture, a young woman is standing in front of the background, holding a teddy bear. She is wearing a black and red costume, which adds a playful touch to her overall appearance. The woman is smiling, and hugging the teddy bear, creating a whimsical and joyful scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "bear", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 438788, "caption": "According to the picture, a group of people is gathered around a table with cake on display. The cake, cut into a couple of pieces, is on a plate placed on the table. The people in the image appear to be enjoying the cake, indicating that they are socializing and having a good time", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "knife", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 468005, "caption": " The image shows a large elephant walking down a pathway surrounded by a crowd of people. There are many people walking with the elephant, some closer to the front and others near the back", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "elephant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 545594, "caption": "In the image, there are several people on a large, public bus. The bus has multiple rows of seats, and some of the seats are occupied", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "orange", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 219546, "caption": " The image showcases a kitchen table filled with a variety of different foods, including meat and other assorted dishes. There are several bowls placed across the table, some with salad dressing, a couple with vegetables, and a few others containing meats.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "bowl", "dining table", "chair", "spoon", "bottle", "knife"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 386645, "caption": "According to the picture, there is a large, old-fashioned building with an asphalt parking area in a city street. A blue truck and a white vehicle are parked outside the building. The building has a sign above the entrance that reads \"King's Chinese Restaurant\" in Chinese characters", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 155131, "caption": "According to the picture, a woman is playing tennis on a grass tennis court. She is wearing a tennis outfit and has leaned her hand back to hit a tennis ball. She is in the act of hitting the ball, with her racquet and her hand.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "sports ball", "bench", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 188752, "caption": "According to the picture, a man is standing in a grocery store surrounded by numerous bananas hanging from the ceiling. The man is looking at the bananas, which are hanging from various points, possibly considering purchasing one or several of them.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["banana", "apple", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 228350, "caption": "According to the picture, a group of people is seated at a restaurant, enjoying their meal. The people are sitting at various tables, with several chairs placed around the tables. There are multiple umbrellas set up around the restaurant, offering shade from the sun", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "chair", "tie", "wine glass", "umbrella", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 437720, "caption": "According to the image, a naked woman sits at a desk with a computer in front of her. There are several red chairs in the scene. In one of the chairs, there is a vase with a bouquet of flowers", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "dining table", "chair", "person", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 378831, "caption": "According to the picture, a woman is walking down the street front of a yellow fire hydrant, carrying grocery bags. There are several other pedestrians also visible in the scene, with one person walking towards the woman, another individual farther away, and a couple approaching from the left side of the image", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "fire hydrant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 375503, "caption": "The image depicts a grand staircase with ornate handrails leading up to a room with a large clock on the wall. The clock has pink and yellow decorative walls and carved details", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "clock", "person", "laptop", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 330369, "caption": "The scene depicts a red fishing boat, docked on a large body of water. The boat is situated in the foreground, with a second boat visible in the background, docked to the right of the primary vessel", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "bird", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 288576, "caption": "According to the picture, there is a man in a green field and a child nearby. They are standing next to a tree, and the man is tossing the frisbee over the child. The scene captures an interactive moment between the two individuals, showcasing their playful activity", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "person", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 32190, "caption": "According to the picture, there is a brown and white colored toy horse on the sidewalk in front of a business. The toy horse is placed on the sidewalk, giving it a unique position in the street scene. There are several other small toy horses, possibly arranged by the business. A car is parked close by, while several other vehicles, including a police car, are visible in the distance", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["horse", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 535403, "caption": "According to the picture, there is a small bathroom with a sink and a bathtub. In the room, there is also a mirror, giving the illusion of a bigger space. The bathroom has multiple grooming tools, including a hair brush, a hair dryer, and several hair brushes, displayed on the counter next to the sink", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["potted plant", "clock", "bottle", "sink", "hair drier", "toilet"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 449798, "caption": "The image features a workspace with a desk containing various items, including a keyboard and a mouse, sitting on top of the table. A laptop is also placed on the desk, indicating the presence of a computer setup. A TV set is visible nearby, possibly serving as a screen for the computer.", "mscoco_hallucinated_words": [["tv", "tv"]], "mscoco_gt_words": ["cell phone", "dining table", "mouse", "keyboard", "book", "laptop"], "hallucination_idxs": [44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.6666666666666666}}, {"image_id": 375769, "caption": "According to the picture, there is a young child wearing a helmet sitting at a table, eating a sandwich, while looking at the bikes outside. The bikes are placed around the dining area, with some visible behind the table and others in the background", "mscoco_hallucinated_words": [["sandwich", "sandwich"]], "mscoco_gt_words": ["dining table", "bicycle", "chair", "hot dog", "umbrella", "car", "person", "cup"], "hallucination_idxs": [20], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.375}}, {"image_id": 426172, "caption": " The image showcases a table with a display stand full of cakes, cheese, and other food items. A variety of dishes, including some cake slices and different types of cheese, are placed on the table and are available for customers to browse and purchase. ", "mscoco_hallucinated_words": [["cake", "cake"], ["cake", "cake"]], "mscoco_gt_words": ["dining table", "chair", "bottle", "knife", "book"], "hallucination_idxs": [11, 27], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.2}}, {"image_id": 20650, "caption": "According to the picture, there is a sandwich on a plate accompanied by broccoli. The sandwich appears to be filled with a variety of ingredients, such as lettuce, tomato, and cucumber, making it a tasty and healthy meal. The broccoli is also visible, likely contributing to the nutritious value of the meal", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "bowl", "dining table", "sandwich", "orange", "broccoli"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 138078, "caption": "The image portrays a young baseball player standing on the field, holding a baseball glove. He is wearing a red baseball uniform and glasses, and is looking down at a ball held by another player.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 152499, "caption": "According to the image, there is a group of people gathered around a swimming pool. They appear to be participating in a fun activity, possibly involving pool noodles. There are adults and children in the group, with adults holding various pool noodles", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "dining table", "potted plant", "chair", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 180764, "caption": "According to the picture, a man is eating a doughnut while sitting in a room. There is another person in the room, but it's not clear what their relation is to the man eating the doughnut", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "microwave", "person", "donut"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 147762, "caption": "According to the picture, there is a white plate on a table, topped with a piece of cake. The cake has an orangish color, covering part of the plate. Additionally, a small, spherical item is placed atop the cake, possibly a scoop of ice cream or a sweet treat", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "fork", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 113860, "caption": "According to the picture, a black and white boat is floating on the water. The boat is quite large and is positioned near the middle of the scene, traveling down a canal", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "person", "umbrella"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 460866, "caption": "The image features various oral hygiene items, including toothbrushes and toothpaste, arranged in a sink or a cup. There are eight toothbrushes inside the sink, with some placed together and others scattered throughout the cup", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "toothbrush", "sink", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 177015, "caption": " According to the picture, a man is sitting on a couch with his laptop open, and a cat is sitting on his lap, both engaged in their respective activities. The man is working on his computer, and the cat seems to be enjoying the warmth and attention from the man. In the scene, the couch covers a significant part of the background, and other furniture can be seen in the room, such as a chair positioned further back on the right side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cat", "chair", "couch", "person", "apple", "laptop", "refrigerator"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7142857142857143}}, {"image_id": 333998, "caption": "According to the picture, the scene features a small kitchen with oak cabinets and chairs. The kitchen is clean and organized, with several appliances visible. A refrigerator and a microwave can be found in the room, with a sink located in the middle.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["bowl", "oven", "bottle", "sink", "microwave", "person", "suitcase", "refrigerator", "cup"], "hallucination_idxs": [15], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 47131, "caption": "The image shows a group of people gathered around an old train wreck. There are at least 13 men in the picture, with some of them on the ground and others standing around the wreck. The train appears to be a large, metal structure, which has been destroyed in some places", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 425447, "caption": "According to the picture, there is a white pickup truck parked on the street, with water used to clean the area. The truck is parked next to a tree, and there are a few other people in the area, including two individuals walking towards the left of the scene, a man near the right side, and another person standing towards the center of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fire hydrant", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 295105, "caption": "According to the picture, a man is standing in a kitchen, looking at the refrigerator with a curious expression on his face. He appears to be checking the contents of the refrigerator, deciding what to eat. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "oven", "bottle", "sink", "person", "apple", "refrigerator"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 443259, "caption": "According to the picture, there is a spacious modern kitchen with white countertops, a range, a sink, a white refrigerator, and a freezer. The room is well-lit and clean, showcasing a well-organized space. Two apples can be seen, one on the left side and one on the right side of the kitchen, possibly arranged as a decoration or as part of a snack", "mscoco_hallucinated_words": [["apple", "apple"]], "mscoco_gt_words": ["bowl", "dining table", "potted plant", "oven", "chair", "bottle", "sink", "orange", "refrigerator"], "hallucination_idxs": [42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.2222222222222222}}, {"image_id": 537861, "caption": "According to the picture, a train is parked at a train station, possibly due to a delay or a stop. The train has a yellow and blue color scheme, and it is being pulled by an engine. The train is traveling on the tracks, and it appears to be on a bridge.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 145020, "caption": "According to the image, there is a life-sized donald duck statue placed in front of a large Christmas tree. The donald duck character is surrounded by a large group of people in the surrounding area. The image appears to be from a Disneyland park, possibly at Christmas time, with the festive decorations on the street and in the park", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "mouse", "bird", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 249969, "caption": "The image features a delicious and elaborate meal served on a white plate. The meal features a roasted chicken along with a variety of vegetables, including broccoli and carrots. The vegetables are accompanied by numerous pieces of meat, likely beef", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "spoon", "knife", "wine glass", "broccoli", "carrot", "bird"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 146504, "caption": " In the image, a man is sitting at a desk and typing on a computer keyboard, using his mouse. The keyboard and mouse are placed on a dining table, and the man's hand can be seen positioned over the keyboard, possibly about to click a button or mouse click. In the background, there are two cups on the table, one closer to the keyboard and mouse and the other on the left side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "mouse", "keyboard", "person", "laptop", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 40037, "caption": "According to the picture, there is a large, white clock tower situated in front of a building with a green roof. The clock tower has a tall and slender design, while the building has a similar, grey appearance. The clock tower is situated in front of the building with a green roof, which could be a restaurant, a hotel, or another public building", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 343218, "caption": "The image shows a young man playing tennis on a court, holding a tennis racket and attempting to hit a ball with it. There are two other individuals in the scene, standing on the left and right sides of the court", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "car", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 301837, "caption": "The image shows a spacious and modern kitchen, featuring white cabinets and countertops. The kitchen is equipped with several appliances such as a stainless steel refrigerator, an oven, a microwave, and a dishwasher. There are two large windows in the kitchen, allowing plenty of natural light to enter and making the space feel brighter", "mscoco_hallucinated_words": [["microwave", "microwave"]], "mscoco_gt_words": ["oven", "clock", "sink", "apple", "refrigerator"], "hallucination_idxs": [33], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.4}}, {"image_id": 309071, "caption": "According to the picture, a man is standing next to a parked bicycle, adjusting it. He is wearing a blue shirt and pants, and has a tie. The man is also wearing a hat, which might suggest that he is working on the bicycle", "mscoco_hallucinated_words": [["tie", "tie"]], "mscoco_gt_words": ["backpack", "potted plant", "bicycle", "bench", "person"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.4}}, {"image_id": 240681, "caption": "The image depicts a person standing next to a red motorcycle on the side of the road. This individual is holding a cell phone and appears to be taking a picture of the motorcycle or possibly taking a call. There are no other significant elements in the scene, just the person, motorcycle, and cell phone", "mscoco_hallucinated_words": [["cell phone", "cell phone"], ["cell phone", "cell phone"]], "mscoco_gt_words": ["backpack", "motorcycle", "car", "person"], "hallucination_idxs": [23, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.5}}, {"image_id": 178753, "caption": " According to the picture, the scene shows a young girl sitting in a red chair, wearing braces and smiling while holding a cell phone. She appears to be in a seated position, possibly at an event or stadium. The image captures the girl in an active and cheerful moment while using her cell phone", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 278890, "caption": " According to the picture, the scene features two people, one man and one woman, standing near each other. Both individuals are looking down, likely at their phones, as they are holding their devices in their hands. The man appears to be wearing a blue shirt, while the woman wears a shirt that is not visible in the picture", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 47781, "caption": "According to the image, there is a modern bedroom with a red-colored bed sitting in the middle of the room. The bed is made of dark wood, adding a touch of elegance and sophistication to the room. The furniture consists of a desk with various books and a chair, both of which can be found on the sides of the room", "mscoco_hallucinated_words": [["desk", "dining table"], ["chair", "chair"]], "mscoco_gt_words": ["vase", "bed", "book", "person", "tv"], "hallucination_idxs": [46, 52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.4}}, {"image_id": 542792, "caption": " A large baseball bat is positioned high up on a building, creating an attention-grabbing and iconic image. The baseball bat sits atop a tall building, likely located near a city", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 293703, "caption": "According to the picture, there are three men sitting around a dining table. Two of the men are located on the right side of the table, and the third person is seated closer to the center. Each man has a plate filled with food, suggesting they are eating a meal together. The table features several bowls and cups placed around the table, with cups on the right side and bowls on the left", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "bowl", "dining table", "chair", "bottle", "knife", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4444444444444444}}, {"image_id": 318777, "caption": "According to the picture, a group of baseball players is on the field, playing a game. They are all wearing their respective team's uniforms and are positioned around the baseball field, ready to make their next move. The field is covered with grass and dirt, and spectators are scattered across the stadium, watching the game and preparing for their viewing experience", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "chair", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 14773, "caption": " The image shows a dining table filled with a variety of food. A hot dog, hamburgers, and french fries are visible, arranged in a car-shaped container. There are additional food items on the table, including a bottle, two cups and a spoon", "mscoco_hallucinated_words": [["hamburger", "sandwich"], ["container", "bowl"]], "mscoco_gt_words": ["fork", "dining table", "spoon", "hot dog", "bottle", "car", "person", "cup"], "hallucination_idxs": [17, 29], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.625}}, {"image_id": 521863, "caption": "According to the picture, there is a person on a skateboard riding down a sidewalk through a large, glass window. He appears to be skating down the sidewalk, enjoying the skateboarding experience. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "bicycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 250540, "caption": "According to the picture, there is a group of men on motorcycles in a city setting, with several people on them. One of the motorcycles is on the right side of the scene, while another motorcycle is in the middle, and a third one is on the left side. There are five people visible, including four seated on the motorcycles and one standing nearby", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "motorcycle", "handbag", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 211604, "caption": " The image features a young woman laying on a wooden floor, wearing a white tennis outfit. She has her head towards the right side of the picture, and appears to be relaxed while resting on the ground. She is holding a tennis racket beside her, possibly taking a break from the game or resting after a match", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 298067, "caption": "This image features a close-up view of a plate with some chicken pieces on it, and there are several beers placed in the background. A bottle of beer is located close to the edge of the table, creating an interesting and visually appealing setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "dining table", "bowl", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 563938, "caption": " The image shows three people sitting on a couch, with one man and two women. The man and one woman are in the foreground, while the other woman sits further back", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "couch", "bottle", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 461311, "caption": "The scene depicts a large group of people, including children, gathered in a park during a kite flying event. The sky is filled with numerous kites of various shapes and sizes, with some of them being flown by the people in the park. There are kites of different sizes, with one slightly larger kite flying over a smaller kite. The people in the park appear to be enjoying the event, possibly flying their own kites or cheering on others", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "kite", "handbag", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 47940, "caption": " The image depicts a person lying on their bed with their legs dangling off the side. A variety of books can be seen spread out on the bed around them, providing reading material for this relaxed moment. There is another stuffed animal positioned near the person's lower body, and a laptop is placed close to the person's head", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "laptop", "book", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 261318, "caption": "In the image, a man is sitting on a chair, using a cell phone to make a call. He is surrounded by a number of suitcases and backpacks, suggesting that he is traveling or possibly returning from a trip", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["cell phone", "chair", "handbag", "bench", "person", "suitcase"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 134768, "caption": "According to the picture, there is a keyboard sitting on a desk in front of a computer monitor, with a pair of headphones and another set of speakers nearby. The person seems to be using the computer and enjoying the music that's being played.", "mscoco_hallucinated_words": [["monitor", "tv"], ["person", "person"]], "mscoco_gt_words": ["mouse", "laptop", "keyboard", "dining table"], "hallucination_idxs": [18, 33], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.75}}, {"image_id": 526414, "caption": "According to the picture, the scene is a home office with a table located in the center of the room. On the table is a sewing machine, a pair of scissors, and a bottle. The sewing area contains a desk with some books and a laptop", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["scissors", "bowl", "dining table", "cat", "remote", "couch", "book", "laptop", "tv", "cup"], "hallucination_idxs": [37], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.4}}, {"image_id": 273132, "caption": " According to the image, there is a woman sitting on a park bench in an older black-and-white photo. She seems to be wearing a cap and looking at something outside the frame, possibly towards a court nearby. The woman appears to be elderly", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 505213, "caption": "According to the picture, there is a man in a crowded area eating hot dogs, likely having fun with the scene. One of the hot dogs appears to be in his mouth, with others arranged around him", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["hot dog", "sandwich", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 192656, "caption": "According to the picture, there is a young man standing on a baseball field, wearing a black and white baseball uniform. He is holding a baseball and is likely waving or signaling to someone off the field. The man is in the middle of the image, showcasing his baseball-related activity", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 272728, "caption": " The image features a beautiful bouquet of colorful flowers placed in a wooden vase, creating a captivating visual display. The bouquet contains various flowers with different colors and shapes, such as pink, purple, and red. There are a total of 10 flowers in the bouquet", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "vase", "dining table", "potted plant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 509811, "caption": "According to the picture, there is a group of people sitting in a classroom or study area surrounded by desks and chairs. Multiple laptops are placed on some of the desks, suggesting that they are being used for studying or working. The people in the scene appear to be students or employees working on their laptops", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "book", "apple", "person", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 412407, "caption": "According to the picture, there is a woman holding a cell phone up to show it to someone in the room. The phone appears to be open, and it has an activated camera for taking pictures", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "clock", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 353658, "caption": "According to the picture, a woman is walking down a street in a city, carrying a pink umbrella to protect herself from the rain. There are several other people present in the scene, one of them being a man. Some of them appear to be shopping, browsing, and carrying handbags", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "suitcase", "handbag", "umbrella", "person", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 441363, "caption": "According to the picture, there is a young man wearing a red shirt standing on a tennis court, playing tennis with a racket. He is swinging the racket to hit a ball in the air, likely during a game. Another person is also in the scene, standing closer to the right side of the court", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 331084, "caption": " A group of people, including two adults and two kids, are gathered around a table in the image, preparing food for a pizza party. On the table, there are several ingredients, such as vegetables like tomatoes, onions, and mushrooms, which can be seen placed on a kitchen board.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "spoon", "knife", "person", "pizza", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 282553, "caption": "According to the image, a man is walking down a path through a grassy area. The path is setting in the background, casting a shadow on the grass. The sun is setting in the background, creating a beautiful scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "cell phone", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 328818, "caption": "According to the picture, a young woman is sitting on a bench while putting on her shoes. She is in a park setting with a bicycle nearby, indicating that she is likely preparing to go for a walk some other day. The image captures a casual moment, with the woman possibly getting ready to enjoy a bike ride or a walk in the park", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bench", "bottle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 3103, "caption": "According to the picture, a man in a yellow helmet and black pants is performing a trick while wearing a helmet, showing off his skills on a skateboard. The man is actively involved in the activity, with a crowd of people in the background observing or possibly waiting for their turn to ride the ramps", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "clock", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 188136, "caption": "According to the picture, there is a kitchen area with various items arranged on the countertop. There are a few items such as a pitcher, a bowl, a large pot, and a bottle, giving the scene a clean, white atmosphere. A few apples and oranges are visible on the counter, creating a simple yet appealing setting. Two refrigerators, a stove, a sink, and multiple vases can also be seen in the room, giving it a more complete look", "mscoco_hallucinated_words": [["bottle", "bottle"], ["apple", "apple"], ["refrigerator", "refrigerator"], ["stove", "oven"], ["vase", "vase"]], "mscoco_gt_words": ["bowl", "dining table", "sink", "orange", "person", "carrot"], "hallucination_idxs": [37, 50, 67, 70, 77], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5555555555555556, "Recall": 0.6666666666666666}}, {"image_id": 449675, "caption": "According to the image, there is a white plate sitting on a table filled with food. The plate contains a sandwich, french fries, and a fork. The sandwich appears to be a grilled cheese sandwich, which is a popular item at many fast food restaurants", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "spoon", "bottle", "sandwich", "knife", "carrot", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.375}}, {"image_id": 191691, "caption": "According to the picture, the man is shaving in a bathroom. He is standing in front of a large mirror, using a toothbrush to clean his teeth. In the background, there is another person in the same room, possibly a friend or family member watching him", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "toothbrush", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 89378, "caption": "According to the picture, there is a man standing next to a motorcycle in front of a red building. The motorcycle appears to be a sporty bike with a white and black color scheme, and the man is wearing a black helmet. Another person is visible in the background, possibly taking a picture", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "backpack", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 99874, "caption": " According to the picture, a young woman is sitting at a table in a restaurant, wearing sunglasses and talking on a cell phone. She looks relaxed, possibly chatting with a friend or family member", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "dining table", "cat", "bottle", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 437895, "caption": "The image is of a bed with a colorful, ornate quilt covering it. The bed has a frame or metal canopy on top, which gives the image an artistic and unique look", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 557447, "caption": "The image features a man, possibly an executive, sitting at a table wearing a suit and tie. He appears to be in a meeting room and is focused on a document in front of him", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "spoon", "tie", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 159979, "caption": " The image captures a group of people sitting on a wooden boat located in the water, surrounded by various umbrellas. There are several umbrellas, each with a different size and color, protecting the individuals from the sun", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "bowl", "person", "umbrella"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 357, "caption": "According to the picture, there is a baseball game taking place on a green field. Several players are actively engaged in the game. A batter is standing near home plate, ready to swing at the ball. The catcher is also in place, preparing to catch the ball when it is pitched", "mscoco_hallucinated_words": [["ball", "sports ball"], ["ball", "sports ball"]], "mscoco_gt_words": ["baseball bat", "person", "baseball glove"], "hallucination_idxs": [39, 52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.3333333333333333}}, {"image_id": 538891, "caption": "According to the picture, there are multiple people in different positions around the young woman, but they appear to be scattered and not particularly close to each other. A young woman is standing on a tennis court, holding a tennis racket and looking up. She is wearing a white tennis shirt and has her hair tied back. She seems to be ready to hit a ball", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 561967, "caption": "In the image, a train has come to a stop on the railroad track. The train is carrying shipping containers, with several visible along the length of the train", "mscoco_hallucinated_words": [["container", "bowl"]], "mscoco_gt_words": ["train", "traffic light", "car"], "hallucination_idxs": [21], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 565778, "caption": "The image shows a blue train traveling along a track next to a tunnel. There are several people in various positions on the train, including one at the front, a few near the middle, and others at the back", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 7125, "caption": "The image portrays a sunny day with several people enjoying the scenic view while riding bicycles down a path next to the water. There are four bicycles visible in the picture, one located towards the left, one in the center, another one near the right side, and a final bicycle closer to the right edge of the image", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bicycle", "bottle", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 221350, "caption": "According to the picture, an older man is wearing a red and white tie, and he is holding the end of the tie on his shirt collar. He appears to be in a formal setting, possibly preparing for a meeting or event. The scene captures the person's attire and actions, showcasing a stylish yet professional appearance", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "tie", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 280930, "caption": " The image shows a young girl standing in a kitchen, looking surprised while holding a pan with baked goods. She is smiling and laughing, making the scene amusing. The girl is wearing a blue shirt, and the kitchen has a stove nearby", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "refrigerator", "person", "oven"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 151081, "caption": " The scene is a field located near a road with a small airplane flying above. The plane can be seen passing over the field, capturing a bird's eye view of the rural area", "mscoco_hallucinated_words": [["bird", "bird"]], "mscoco_gt_words": ["airplane", "person", "truck"], "hallucination_idxs": [28], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 122232, "caption": " In the image, a woman is standing on a tennis court holding a tennis racket. She is wearing a yellow raincoat and is surrounded by several tennis balls placed at various positions on the court", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 134223, "caption": " The image displays a sandwich served with french fries on a plate. The sandwich is made with a hamburger, with various toppings, including lettuce and ketchup", "mscoco_hallucinated_words": [["sandwich", "sandwich"], ["sandwich", "sandwich"], ["hamburger", "sandwich"]], "mscoco_gt_words": ["bowl", "broccoli", "hot dog", "dining table"], "hallucination_idxs": [4, 14, 19], "metrics": {"CHAIRs": 1, "CHAIRi": 1.0, "Recall": 0.0}}, {"image_id": 198163, "caption": "According to the picture, a man is sitting in front of a table covered with small bowls filled with various colors of paint. The man is painting or glazing the items on the table. The scene showcases a man working in an art studio or workshop, where he is creating a collection of hand-painted bowls", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 3742, "caption": "According to the picture, it displays a window filled with a variety of apples, arranged in a bowl and placed on a windowsill. There are numerous apples visible, with some in the front and others scattered across the scene. Additionally, a vase can be seen in the image, set in a window, where a flower is also visible", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "apple", "vase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 217593, "caption": "The scene features a young boy dressed in a baseball uniform and standing in a grassy field, possibly practicing for a game or waiting to play. There are several people in the area, but none of them seem to be involved with the child", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "chair", "handbag", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 279078, "caption": "According to the picture, it shows a young girl holding a bottle of beer and looking at an open refrigerator full of many bottles of beer. The bottles are arranged in multiple positions inside the refrigerator, creating an interesting display. The girl seems excited and curious about the selection of beers available", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "refrigerator", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 572081, "caption": "According to the picture, a man is playing tennis on a court with a net in the background. Another man is watching him. The two tennis players are participating in a tennis match, and the court is surrounded by a crowd of people watching and possibly cheering them on", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "couch", "sports ball", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 477016, "caption": "According to the picture, the image features a small girl, likely around four or five years old, with her mother walking towards a decorative kitchen. The kitchen has various features, including a sink, an oven, and a stove. The oven is colored in a pink and white design, and the stove appears to be in a light blue color", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "potted plant", "oven", "sink", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 301266, "caption": "According to the picture, a group of young people, including a man, are gathered around a table in a kitchen, engaged in a conversation. Some are holding baskets, while others are working on whatever they are doing in the kitchen. There are also a few bowls, such as knives and spoons, visible on the table.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["knife", "knife"]], "mscoco_gt_words": ["fork", "dining table", "spoon", "sink", "book", "person", "cup"], "hallucination_idxs": [52, 56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.42857142857142855}}, {"image_id": 475813, "caption": "The image features a small child standing in a room, holding onto a cell phone with both hands. She appears to be walking around while holding the phone", "mscoco_hallucinated_words": [["cell phone", "cell phone"], ["phone", "cell phone"]], "mscoco_gt_words": ["handbag", "remote", "person"], "hallucination_idxs": [14, 29], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.3333333333333333}}, {"image_id": 558784, "caption": "According to the picture, a man is playing golf on a field, standing near the water holding his golf club. He is dressed in clothes, and a baseball bat placed on the ground next to him. He is captured in a serene landscape, with water in the background", "mscoco_hallucinated_words": [["baseball bat", "baseball bat"]], "mscoco_gt_words": ["backpack", "person", "frisbee"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 308543, "caption": " The image depicts a person's hand holding two silver spoons, which appear to be twins. The individual is standing over a sink, possibly washing the utensils", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["spoon", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 359164, "caption": "According to the picture, there are two dogs - a golden retriever and a small dog - running around on a grassy area, playing and having fun together. The golden retriever is seen picking up something, possibly a toy, while the small dog follows close behind.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "dining table", "dog", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 539951, "caption": "There is a small, Lego-built kitchen on display, featuring a stove, sink, and refrigerator. The kitchen has a black and white checkerboard pattern on the countertop and walls. There are two different types of ovens in the kitchen, one on the left side and the other on the right side, and both are part of the Lego build", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "clock", "sink", "oven"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 143479, "caption": "According to the picture, there is a man, possibly an older man, holding a tray full of hot dogs in front of a woman. They are standing in a grassy field, and the woman appears to be looking at the food.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "hot dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 475024, "caption": "The image features a vase with flowers and a mirror behind it. The vase has a combination of different flowers and leaves, with a mix of red and orange colors, arranged throughout it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "vase", "dining table", "potted plant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 238065, "caption": "The image depicts a young man skateboarding on a wooden ramp, possibly performing tricks in the air. He is wearing a blue shirt and appears to be enjoying the activity. There are a few other skateboards scattered around the scene, including one close to the main subject, another towards the left, and another one near the top left corner", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 314016, "caption": "According to the picture, there is a small bathroom containing various items. A toilet can be seen in the bathroom, along with a sink filled with alcohol, a glass vase containing various items, and a book. The sink has been filled with items such as a bottle, books, and other items", "mscoco_hallucinated_words": [["vase", "vase"]], "mscoco_gt_words": ["bottle", "toilet", "sink", "book"], "hallucination_idxs": [33], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 322163, "caption": " A group of people, including a chef, are gathered around a table in a commercial kitchen, preparing food. The chef is cutting food, while the other people watch and assist. The scene suggests that the chef is about to serve a meal.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "potted plant", "remote", "oven", "bottle", "person", "pizza", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2222222222222222}}, {"image_id": 230983, "caption": "According to the picture, a young man is seen performing a skateboard trick on the sidewalk. He is jumping off a green skateboard, and extending his arms out to fly through the air. A building can be seen in the background, and multiple other people are also present in the scene, possibly observing the skateboarder's trick", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 564629, "caption": "According to the image, the scene features a large kitchen with wood cabinets, multiple ovens, and a sink. There is also a dining table with chairs in the kitchen. The kitchen is decorated with wood cabinets and a range hood. The kitchen appears to be well-organized and functional, providing ample space for cooking and dining", "mscoco_hallucinated_words": [["table", "dining table"], ["chair", "chair"]], "mscoco_gt_words": ["oven", "refrigerator", "sink", "microwave"], "hallucination_idxs": [27, 29], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.5}}, {"image_id": 53431, "caption": "According to the picture, a man is sitting on a bench in front of a brick wall. He is wearing a blue shirt and a tie and is using a laptop while resting his feet on a chair. He is sitting in a chair with a pillow placed on it", "mscoco_hallucinated_words": [["bench", "bench"], ["tie", "tie"]], "mscoco_gt_words": ["chair", "laptop", "person"], "hallucination_idxs": [11, 27], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 1.0}}, {"image_id": 441840, "caption": " The image features a young boy playing baseball in a field, holding a baseball bat and swinging it at a baseball. He is in the center of the image, surrounded by other individuals who are likely watching or observing the play. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "baseball glove", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 53465, "caption": "According to the picture, there is a large, decorated cake placed on a table in front of a group of people, likely a gathering of friends or family celebrating a special occasion. The cake itself is adorned with the words \"Welcome to [IDK],\" with an [IDK] design added to it. This unique cake style adds a touch of fun to the celebration. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "bowl", "dining table", "spoon", "cake", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 381031, "caption": " A table is filled with different types of foods, including a dozen donuts, a knife, an apple, and a glass of orange juice. The dozen donuts are placed on a white plate, while the apple and glass of orange juice are also present on the table. Various bottles and cups, containing additional beverages, can be seen nearby", "mscoco_hallucinated_words": [["knife", "knife"], ["orange", "orange"], ["orange", "orange"]], "mscoco_gt_words": ["dining table", "donut", "spoon", "bottle", "microwave", "apple", "cup"], "hallucination_idxs": [16, 25, 44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2727272727272727, "Recall": 0.7142857142857143}}, {"image_id": 530520, "caption": "The image captures a group of people sitting outside on the grass, using their laptops. There are four men and one woman, all seated together in a casual setting. They seem to be enjoying each other's company while working on their laptops", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "motorcycle", "clock", "person", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 539555, "caption": "The image captures an exciting moment where a young man in a white t-shirt is performing a back flip while holding a green frisbee. There are several people in the background, possibly observing the man's impressive acrobatics", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "person", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 279521, "caption": " According to the picture, two young men are standing next to a subway train, looking at it. One of them is wearing a jacket, while the other has a backpack. They are standing near a sidewalk, discussing something related to their plans, possibly waiting for a train or planning their next move.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bus", "train", "handbag", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 560439, "caption": "According to the picture, there are four people in the scene who appear to be walking along a path in the snow, enjoying themselves and spending time together. The main focus of the scene is the group of people walking the snowy path, with two of them wearing snowshoes", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "backpack", "skis", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 248353, "caption": "According to the picture, there is a group of people skiing down a snowy slope. There are 13 people visible in the scene, most of them wearing skis. There is a mixture of male and female skiers, creating a diverse skiing environment", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 9628, "caption": " The image depicts a man riding on water skis, performing a trick in the middle of the water while a friend observes from shore. The main man is in the air, leaping over the water and enjoying the thrill of the ride", "mscoco_hallucinated_words": [["ski", "skis"]], "mscoco_gt_words": ["boat", "surfboard", "person"], "hallucination_idxs": [8], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 330050, "caption": "According to the image, there is a man and a woman standing next to each other on a tennis court. They appear to be smiling and posing for the camera, suggesting they might have been taking a group photo. Both individuals are holding tennis rackets. The man has a racket in his right hand, while the woman is holding a racket with both hands", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "tennis racket", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 254161, "caption": " According to the picture, it is a warm summer day as several people enjoy an outdoor brick plaza. One man is standing near a tall building, flying a large kite. He appears to be in the middle of the plaza, surrounded by a couple of other individuals", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "kite", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 275749, "caption": "According to the picture, a group of people are standing in a small room, possibly a living room or a dorm room. The people are chatting, laughing, and interacting with each other, creating a casual and social atmosphere. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "bottle", "mouse", "person", "laptop", "tv", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.125}}, {"image_id": 163518, "caption": "According to the picture, a woman in a chef's uniform is standing in a kitchen, placing prepared vegetables into plastic containers. The woman is positioned at the middle of the scene, in front of a table filled with different types of food, including vegetables, as well as a bottle and a cup. The image also includes a sink in the back and a refrigerator further back in the room, near the woman", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cup", "cup"], ["sink", "sink"], ["refrigerator", "refrigerator"]], "mscoco_gt_words": ["bowl", "dining table", "oven", "chair", "person"], "hallucination_idxs": [55, 58, 65, 71], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.6}}, {"image_id": 156696, "caption": " There are two trains at a train station, with one train being longer than the other. Both trains are painted in a blue color. The longer train is parked on the track, while the shorter one is next to it on the same track", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "train", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 525202, "caption": " The image displays a group of people sitting at a table in a restaurant, enjoying a delicious pizza. The pizza is placed on a pizza pan and is likely being shared by the group. There are multiple people at the table, with a man and woman being the main figures in the foreground, while others are in the background. It appears that they are having a meal, likely a casual evening out with friends or family", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "chair", "spoon", "bottle", "knife", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 423984, "caption": "The image features a white plate with a bowl of pasta, a slice of bread, and carrots on it. The pasta is covered in a creamy sauce, and there are two carrots visible on the plate. The carrots are positioned closer to the center of the dish, with the bread placed next to them", "mscoco_hallucinated_words": [["bowl", "bowl"]], "mscoco_gt_words": ["carrot", "broccoli", "dining table"], "hallucination_idxs": [8], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 499486, "caption": "According to the picture, two cats and a kitten are sitting on a bed, laying down comfortably. The bed is covered with a messy sheet, and the cats are surrounded by various items on the bed. The cat at the left is larger, occupying most of the left side of the bed, while the cat at the right is smaller, taking up only a portion of the right side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cat", "book", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 118124, "caption": "According to the picture, it appears to be an image of a young man sitting inside a car, presumably in the driver's seat. The man is dressed in a black shirt with a tie and is wearing glasses. He is the main focus of the image", "mscoco_hallucinated_words": [["seat", "chair"]], "mscoco_gt_words": ["tie", "car", "person"], "hallucination_idxs": [25], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 288729, "caption": "According to the picture, a cake is in the shape of a circus tent, and it is placed in front of a sign. The cake is decorated with clowns, as well as a balloon and a bear. Additionally, there is a figure standing next to the cake, and another sitting nearby", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "cake", "bear", "person", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 177721, "caption": " The image captures a young man riding a skateboard on a ramp. He is in the middle of performing a trick, with his feet off the ground as he showcases his skateboard skills", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 372220, "caption": "The image shows a man in a black suit, standing near a large stone wall and looking at his cell phone. He is wearing a beard and appears to be using his cell phone to communicate with someone or access important information.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "cell phone", "horse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 252625, "caption": "According to the picture, a street with shops and various buildings is occupied by people, giving a lively and vibrant atmosphere. There are multiple people walking and riding bicycles along the street, some carrying backpacks and bags. Additionally, there are a few shops along the street, including a restaurant and some stores with signs", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["bicycle", "traffic light", "potted plant", "person"], "hallucination_idxs": [38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 493610, "caption": "According to the picture, there is a group of people riding in a white truck on a dirt road in a developing country. The truck is carrying several passengers, and it is traveling through a dirt street, possibly in a developing country. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 302789, "caption": " A small room is occupied by a bed, which has a suitcase and other bags packed on it. The suitcase is placed at the left side of the bed, while the other bags are scattered throughout the room", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bed", "bottle", "sink", "handbag", "suitcase", "refrigerator"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 30667, "caption": " According to the picture, there is a brown dog lying on the floor of a boat. The dog appears to be resting or sleeping, and there are two individuals sitting near it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 114108, "caption": "According to the picture, it showcases a group of people sitting around a table, enjoying a homemade pizza. The pizza is cut into slices and served on a white plate. They are having a good time, engaging in conversation and socializing with each other while savoring their food.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "chair", "spoon", "knife", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.375}}, {"image_id": 328110, "caption": " The image shows a woman sitting on a bench with her baby in her arms, looking at her phone. She appears to be engaged in an important call or message", "mscoco_hallucinated_words": [["phone", "cell phone"]], "mscoco_gt_words": ["handbag", "bench", "backpack", "person"], "hallucination_idxs": [19], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 366111, "caption": "According to the picture, a group of people is riding bicycles through the city streets, creating a lively scene. The bicycles are various in size, some of which are larger, and some are smaller. Some people wear hats, and a few of them are dressed in suits, suggesting that this is an old-fashioned photo of people biking in their finery", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 532481, "caption": "According to the picture, a man is flying on a kite, while another person is riding a wave in the ocean. In the background, there are several homes along the waterfront, likely belonging to nearby residents or beachgoers.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "kite", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 291634, "caption": "According to the picture, there is a large group of motorcycles parked in the middle of a busy city street. The motorcycles are lined up along the street, giving a sense of a community gathering", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 463678, "caption": "According to the picture, the main focus of the scene is a young tennis player wearing a white shirt and white shorts, preparing to serve the ball in a tennis match. The player is standing on the tennis court and holds a tennis racket, getting ready to swing and make the serve. The image captures a moment of action and excitement in the game", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 70815, "caption": "According to the image, a small dog is sitting on a sidewalk in front of bikes, with a leash attached to it. The dog is looking up at the camera, possibly waiting for its owner or curious about the environment.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "dog", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 367872, "caption": " The scene is set in a baseball field, with a player standing on the playing field with a baseball bat in his hand, preparing to swing at a ball. There are several other people in the image, possibly teammates, spectators, or coaches", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "sports ball", "umbrella", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 353898, "caption": " The scene features a man sitting on a stone bench in a park surrounded by a large number of white pigeons. The birds appear to be gathered around the man, possibly attracted to the sound of a nearby bird cage, or foraging for food or other reasons", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "backpack", "bench", "person", "bird"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 88269, "caption": "According to the picture, it presents a bowl of hot soup on a table. The soup is accompanied by a couple of slices of grilled cheese sandwiches, giving it a tasty, filling meal. One of the sandwiches has been topped with tomato sauce, adding an extra layer of flavor to the dish", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["spoon", "bowl", "sandwich"], "hallucination_idxs": [14], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 463211, "caption": "According to the image, there are two children standing on a sidewalk, a young girl and a boy, both wearing backpacks. The girl is walking with her backpack in a sideways position, and the boy is walking with his backpack towards the back of the image. They appear to be going towards or walking past a school bus, possibly after getting off a school bus or getting ready to board it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bus", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 396269, "caption": " The scene is a large room with a chandelier hanging from the ceiling, a paneled wall, and several tiled floors. A golden chair is positioned near the center of the room, surrounded by a variety of vases filled with various colors and sizes of flowers", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "vase", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 98979, "caption": " The image features a black microwave sitting on top of a kitchen counter, with its door open. It is prominently displayed, taking up a significant portion of the kitchen scene. Several other appliances, such as a refrigerator and a sink, can be seen surrounding the microwave, indicating that it is placed in a kitchen setting", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"], ["sink", "sink"]], "mscoco_gt_words": ["oven", "knife", "microwave"], "hallucination_idxs": [41, 44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 244455, "caption": "In the image, two motorcycle riders are traveling down the road. Both riders are dressed in uniform, likely being police officers. One of them is on the right side of the image, while the other is more to the left", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "bicycle", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 379443, "caption": "According to the picture, there is an image of a black mesh fabric with a pair of scissors and an orange pair of pinking shears sitting atop it. The pinking shears are also visible in the background, possibly resting on the surface under the black mesh fabric", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["scissors", "dining table", "orange"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 536154, "caption": "A young girl is sitting on a bed surrounded by a large variety of stuffed animals, including teddy bears, elephants, and puppies. The girl seems to be enjoying herself as she has stuffed animals all around her", "mscoco_hallucinated_words": [["bed", "bed"], ["elephant", "elephant"], ["puppy", "dog"]], "mscoco_gt_words": ["teddy bear", "cat", "person"], "hallucination_idxs": [7, 21, 24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.6666666666666666}}, {"image_id": 374061, "caption": "According to the picture, there is a black cat lying on a couch and holding a stuffed animal, possibly a teddy bear, close to its chest. The cat and the toy occupy different parts of the image", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "cat", "bed", "couch", "bear"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 560018, "caption": " According to the picture, several people are enjoying a day at the beach, riding the waves and enjoying themselves. There are about 12 people scattered throughout the image, which is mainly in the ocean and on the sand. A few of them are wearing surfboards and riding the waves, while the others are either standing in the water or close to the shore", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "kite", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 353320, "caption": " A large guitar case, possibly with luggage, is sitting on a bench outside of an airport. Nearby, there are two suitcases. The guitar case and one of the suitcases are on the right side, while the other suitcase is placed further to the left", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "suitcase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 256628, "caption": " The scene shows a golden retriever dog lying on a wooden deck, resting comfortably and enjoying its surroundings. The dog appears to be very relaxed, possibly waiting for its owner to return", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "bowl", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 158806, "caption": " The image features a brown dog eating a sandwich on a blue plate. The dog's mouth is open, as it is in the middle of consuming the sandwich", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 424270, "caption": "In the image, there is a young girl sitting on a bed in a room, possibly in a hotel or vacation setting. The girl seems comfortable and relaxed, sitting on the left side of the bed. The bedspread is slightly raised, and the girl appears to be surrounded by pillows", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "tv", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 368335, "caption": " The scene is set in a parking lot where a brown horse is walking along a parking spot between two cars. The horse is walking towards the cars, passing by them in a narrow space", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["horse", "car", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 483227, "caption": "According to the picture, a group of young children is gathered around a tennis court, participating in a tennis lesson or playing a game of tennis. They are wearing shirts and holding tennis rackets, while some are holding a tennis ball. The scene shows a fun and energetic atmosphere, where kids are learning and enjoying the sport", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 228306, "caption": "There is a large, red double-decker bus in the scene, with its front and back clearly visible. The bus appears to be parked on the street, possibly at a bus stop or designated parking spot", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 416384, "caption": "According to the picture, a young woman is sitting on a chair, reading a book. She is in the middle of the scene, surrounded by other people in the background", "mscoco_hallucinated_words": [["book", "book"]], "mscoco_gt_words": ["cell phone", "chair", "person"], "hallucination_idxs": [16], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 364783, "caption": "According to the picture, a small, red bathroom has a white toilet and a red sink, with a mirror reflecting the image of the bathroom. The bathroom is clean and well-maintained, and there is a toothbrush hanging from the sink. The bathroom is small and designed for the sole purpose of a toilet and sink, with no shower or bathtub present in the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["potted plant", "chair", "bottle", "toothbrush", "sink", "toilet", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 335041, "caption": "The image features a man walking along a snow-covered sidewalk in the middle of a city street. The man is wearing skis and appears to be skiing down the sidewalk while navigating through a section of the street that has been covered with snow", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 531327, "caption": " The image features a white bowl filled with a variety of vegetables, including broccoli, carrots, green beans, and red peppers. The table where the bowl is located is a dining table, and there is a knife placed in the bowl as well.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["bowl", "bowl"], ["bowl", "bowl"]], "mscoco_gt_words": ["fork", "dining table", "spoon", "knife", "wine glass", "broccoli", "carrot"], "hallucination_idxs": [5, 29, 45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.5714285714285714}}, {"image_id": 451228, "caption": "According to the image, a busy city street is filled with a variety of buses and a red double-decker bus. There are several buses on the street, including a group of buses on one side of the road, and another group of buses on the other side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bus", "traffic light", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 256365, "caption": " The image shows a plate filled with multiple cupcakes, each topped with blue frosting and a swirl of white frosting. There are six cupcakes in total, and they appear to be positioned evenly on the white paper plate", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "dining table", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 502936, "caption": "According to the picture, a man is playing tennis in a large courtyard, using his tennis racket and a tennis ball. He is standing at the net, ready to hit the ball back to his opponent. The courtyard is surrounded by buildings, creating an outdoor environment for the game. The man is focused and attentive, as he prepares to serve the ball", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 451449, "caption": "According to the picture, there is a small black and white dog sleeping in a bed covered in various books and papers. The dog is curled up in the bed, possibly enjoying the comfort of being surrounded by the books and papers", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "book", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 51054, "caption": "According to the picture, an elderly woman is sitting in a bedroom, talking on a cell phone. She is sitting on a bed, with her hands holding the phone. The bed is made and there is a bed, indicating that the woman may be in a comfortable and private space", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "bed", "bottle", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 1268, "caption": "According to the picture, there is a park scene featuring a woman sitting on a bench next to the water, looking at a white swan swimming in the water. The woman has a handbag close to her. There is also a bridge in the background, with a few people walking around or under it", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["cell phone", "backpack", "boat", "handbag", "person", "bird"], "hallucination_idxs": [16], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.5}}, {"image_id": 62692, "caption": " In the image, a young woman is playing tennis on a court. She is in the middle of the court, holding a tennis racket and preparing to swing at the approaching tennis ball. Her hair is tied in a ponytail, creating a dynamic look in her hair", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 542771, "caption": "The image features an old-fashioned black and white photograph of a sitting room with a single chair in the middle, surrounded by various items. There are two vases displayed, with one placed close to the chair and the other on the right side of the room", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "dining table", "potted plant", "chair", "book", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 575406, "caption": " The image shows a small red and white parked on the street in front of a building, surrounded by a variety of vehicles. There are several cars in the background, including some parked closely to the bus, a truck near the left side, and another car on the right side", "mscoco_hallucinated_words": [["car", "car"], ["truck", "truck"], ["car", "car"]], "mscoco_gt_words": ["bus", "motorcycle", "potted plant", "person"], "hallucination_idxs": [28, 42, 50], "metrics": {"CHAIRs": 1, "CHAIRi": 1.0, "Recall": 0.0}}, {"image_id": 244834, "caption": "According to the picture, two people, a man and a woman, are walking down a sidewalk in the rain, each holding umbrellas. They are standing next to each other and appear to be together, possibly walking to their destination or just out for a stroll. A couple of cars can be seen in the background, but they seem to be less prominent than the individuals walking on the sidewalk.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["potted plant", "bicycle", "clock", "umbrella", "handbag", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 151159, "caption": " In the image, a group of people, including two skateboarders, are hanging out in an outdoor park. The skateboarders are positioned next to each other on the left side of the park, both on their skateboards", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 277073, "caption": " The image shows a busy city street with several people walking and interacting with each other. One person is sitting on a motorbike, a man and a woman, who appear to be riding together", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "motorcycle", "bicycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 377879, "caption": "According to the picture, a large military tank is being transported on train tracks. The tank is on a metal frame, placed on top of the train cars. The trains are parked and seem to be waiting to move", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["train", "bed", "truck"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 436814, "caption": " According to the picture, there is an office space containing a laptop and a chair. The room is cluttered with papers and books, while other office supplies, such as a bottle, can be found in the background. The office space is located in front of a door, which can be seen in the image", "mscoco_hallucinated_words": [["book", "book"], ["bottle", "bottle"]], "mscoco_gt_words": ["chair", "mouse", "laptop", "dining table"], "hallucination_idxs": [24, 34], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.5}}, {"image_id": 63934, "caption": " According to the picture, there is a person in black garb, dressed as a nun, skateboarding on the sidewalk in front of a building. The person is located near the middle of the scene, with several cars parked around them in different areas of the parking lot", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 275611, "caption": "According to the picture, there is a delicious looking pizza placed on a metal tray, presumably ready to be served or consumed. The pizza appears to have numerous toppings, including cheese and vegetables, making it a tempting option for anyone in the vicinity. Additionally, the dining table upon which the pizza is sitting serves as a background element", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "dining table", "knife"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 516248, "caption": "The image depicts a person sitting in front of a computer, typing away on a keyboard while using a mouse. There are multiple computer components in the scene, including a keyboard, a computer mouse, and a laptop situated at different locations around the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "mouse", "keyboard", "person", "laptop", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 478380, "caption": "A large group of sheep is herded along a road by a bus, with the bus occupying a middle position between the front and back of the herd. There are several sheep visible near the bus and some farther in the distance. The scene appears to be on a narrow road or a path, with a mix of open and enclosed spaces nearby", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sheep", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 422326, "caption": "According to the image, there are three motorcycles and a sidecar parked next to each other in a parking lot. The three motorcycles include a black motorcycle, a red motorcycle, and a silver motorcycle, while the sidecar is a silver color", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "bicycle", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 110449, "caption": "According to the picture, there are two young men in the image. One man appears to be the DJ, while the other man is possibly a bartender or a guest. The DJ is sitting down, with his arms outstretched in front of him, while the bartender is standing next to him", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "dining table", "person", "laptop", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 45550, "caption": " The image features a man posing for a picture while holding a plate with fried chicken on it. The man appears to be smiling, enjoying the moment of having his picture taken. The scene takes place in a restaurant with multiple tables and chairs set around it", "mscoco_hallucinated_words": [["table", "dining table"], ["chair", "chair"]], "mscoco_gt_words": ["clock", "bowl", "sandwich", "person"], "hallucination_idxs": [44, 46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 444913, "caption": "According to the picture, there is a man standing in a grassy area, possibly a park or backyard, holding a baseball bat. He is standing near a tree and has his back to the camera, looking in the direction of the tree. The grass is slightly scattered around the man, possibly indicating a recent activity in the area", "mscoco_hallucinated_words": [["baseball bat", "baseball bat"]], "mscoco_gt_words": ["car", "person", "frisbee"], "hallucination_idxs": [23], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 82709, "caption": " The image depicts a black and white dog in motion, leaping into the air to catch a pink frisbee, with a group of people gathered around to watch the event. The dog is standing in a field, with the frisbee flying towards him, and the people are dispersed across the field, observing the playful activity", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "frisbee", "bottle", "dog", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 132796, "caption": "According to the picture, a group of people are riding on the backs of elephants. There are two elephants present in the image, with people seated on their backs and enjoying the experience. One person is on the left side of the first elephant, while another is on the right side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "chair", "bench", "person", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 303099, "caption": "According to the picture, there are two cats sitting on a toilet, looking at the camera as they pose together. The cats are placed on the toilet bowl, with one cat located closer to the front edge and the other cat positioned further back", "mscoco_hallucinated_words": [["bowl", "bowl"]], "mscoco_gt_words": ["bottle", "toilet", "cat"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6666666666666666}}, {"image_id": 271565, "caption": "According to the picture, there is a group of motorcyclists traveling down a tree-lined street. Several riders are visible, with their bikes in front of them. Some of the motorcycles appear to be traveling in a line, while others are trailing behind. There are multiple cars parked along the sides of the road", "mscoco_hallucinated_words": [["bike", "bicycle"]], "mscoco_gt_words": ["motorcycle", "car", "person", "truck"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.75}}, {"image_id": 205193, "caption": " In the image, there is a toaster oven on the left side with a bag of black tea and a cup on top of it. The cup is full of hot tea", "mscoco_hallucinated_words": [["cup", "cup"], ["cup", "cup"]], "mscoco_gt_words": ["oven", "microwave", "person", "toaster"], "hallucination_idxs": [21, 28], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.5}}, {"image_id": 272566, "caption": "The image features a man sitting on the ground with a bunch of bananas in front of him. He is surrounded by various baskets full of fruits, including bananas, apples, and oranges", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["banana", "person", "apple", "cup"], "hallucination_idxs": [35], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.75}}, {"image_id": 170099, "caption": "According to the picture, a man stands in front of a living room couch, chairs, and other furniture. The man is wearing a suit and a tie, giving a speech or presenting in front of an audience. Another person can be seen standing close to the man, and there are also two cups visible on the couch", "mscoco_hallucinated_words": [["chair", "chair"], ["cup", "cup"]], "mscoco_gt_words": ["tie", "orange", "couch", "person"], "hallucination_idxs": [16, 59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.75}}, {"image_id": 14940, "caption": "According to the picture, the scene features a cat and a stuffed animal cuddling together on a bed. The cat appears orange and white and the stuffed animal is a small, brown one. They both seem to be comfortably resting on the bed, enjoying each other's company.", "mscoco_hallucinated_words": [["bed", "bed"], ["bed", "bed"]], "mscoco_gt_words": ["cat", "chair", "couch", "orange", "dog"], "hallucination_idxs": [18, 46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.4}}, {"image_id": 408449, "caption": "In the image, a young man is riding a skateboard down a city street, passing an empty fountain nearby. There are several other people present in the scene, dispersed around the street", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bus", "handbag", "skateboard", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 140702, "caption": " In the image, a baby elephant, with its mother walking alongside it, is the main focus of the scene. The baby elephant is standing close to the left edge of the frame, while the adult elephant is located closer to the center of the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["elephant", "bird", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 354540, "caption": "According to the picture, a man is sitting on a kitchen counter, likely eating or drinking while using a laptop. There are various cables and electronic equipment surrounding him, with a laptop placed on a wooden surface near him. A few other items are also in the kitchen, including a cell phone, multiple bottles, a microwave oven, and a chair", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["cell phone", "oven", "chair", "person", "toaster", "laptop", "microwave"], "hallucination_idxs": [60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.8571428571428571}}, {"image_id": 19763, "caption": "According to the picture, there is a close-up image of a piece of deep-dish pizza sitting on a white plate. The pizza is covered in cheese and appears to be a delicious piece of food. The plate is surrounded by two glasses of beer, suggesting that the pizza is being enjoyed in a social setting", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "chair", "knife", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.14285714285714285}}, {"image_id": 191301, "caption": "According to the picture, there are two people sitting at a table, enjoying their desserts. One of them is holding a spoon and eating a chocolate cake, while the other woman is eating an ice cream cone. They appear to be in a restaurant, as there is a cup and an additional utensil on the table", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "spoon", "cake", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 124236, "caption": "The image features a small, dimly lit kitchen with appliances such as a microwave, an oven, and a sink. The kitchen also has an assortment of pots and pans arranged on two tables. A few bowls are scattered on one table, while the other contains multiple knives, some of which are leaning to the right. The kitchen is lit by sunlight coming from the left side, making it appear brighter compared to the rest of the room", "mscoco_hallucinated_words": [["microwave", "microwave"], ["sink", "sink"], ["knife", "knife"]], "mscoco_gt_words": ["bowl", "dining table", "oven", "spoon", "toaster"], "hallucination_idxs": [14, 21, 52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 0.6}}, {"image_id": 344930, "caption": "According to the picture, there is a red fire hydrant situated on the sidewalk, close to the edge of a street. It is surrounded by green grass, indicating that the area is possibly in a residential neighborhood. In the background, a car is parked, which could be the reason for the fire hydrant's presence", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "car", "fire hydrant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 14285, "caption": " The image captures a white, fluffy cat lying on a bed in a room. It appears to be a bedroom, with a small TV placed towards the left side of the room and several books spread throughout the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "cat", "bed", "chair", "book", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 82488, "caption": "According to the image, there is a room with two beds, a lamp, and a clock on the left side. Both beds are similar in size, but one has an orange blanket while the other has a light brown blanket. There is a painting or photograph on the wall behind the beds", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "dining table", "bed", "clock", "book", "orange"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 348251, "caption": "According to the picture, a person, dressed in a green jacket, is walking down a ski slope. He is wearing skis and carrying a backpack, likely to ski or snowboard", "mscoco_hallucinated_words": [["snowboard", "snowboard"]], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [34], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 150703, "caption": " According to the picture, there is a close-up image of a delicious Chicago-style hot dog, wrapped in a bun and served in a dish. The hot dog is placed next to each other on a table, creating a combination that those who enjoy both hot dogs and beer may find appealing.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["hot dog", "dining table", "sandwich", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 381615, "caption": "According to the picture, a large window in someone's home is filled with various lush green plants. There are several plants in different spots, creating a vibrant display. The plants are placed on a table, and a chair is also present", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["vase", "potted plant", "chair", "book", "cup"], "hallucination_idxs": [39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.2}}, {"image_id": 352703, "caption": " In the image, a woman is standing in front of a dining table preparing a dish. She is wearing an apron and holding a bowl in her left hand while stirring the contents in the bowl with a spoon. There are several chairs surrounding the dining table, one of which has a bottle resting on it", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["bowl", "dining table", "oven", "spoon", "bottle", "handbag", "person", "cup"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.625}}, {"image_id": 111109, "caption": "The image shows a young boy dressed in a white shirt, standing in a room with a chair nearby. He appears to be posing for the camera, looking confident and smiling. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "chair", "tie", "person", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 226802, "caption": "The scene takes place on a rainy city street with a large red \"Xing\" sign in the center. There is a bus, or a red double-decker bus, parked on the right side, partially blocking the view. Several people can be seen walking and standing along the street in different areas", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 49005, "caption": "According to the picture, the image shows a baseball game underway at a field with several players on the field. The main focus of the scene is a pitcher standing on the mound, pitching a ball. There is a catcher positioned behind home plate, watching the pitcher and waiting to catch the incoming baseball", "mscoco_hallucinated_words": [["ball", "sports ball"]], "mscoco_gt_words": ["baseball bat", "person", "baseball glove"], "hallucination_idxs": [38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 410114, "caption": " In the image, there is a large group of cows walking down the side of the road. They are spread out over a wide area and appear to be in motion, with some animals closer to the road's edge and others in the middle", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "horse", "person", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 100274, "caption": "According to the picture, the scene is an empty train station or rail yard, with railroad tracks running through the center and several train cars scattered around the area. There are several railcars parked on the ground, with some placed close to the tracks and others at a bit of a distance. The image depicts an empty, somewhat quiet and unused environment, with numerous trains present.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "car", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 317254, "caption": "The image captures a person riding a black motorcycle down a city street, wearing a mask. There are two cars in the scene, one white and the other one is smaller, also white", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 425569, "caption": "According to the image, there is an empty, large, shiny kitchen with various pieces of equipment. Several silver refrigerators are visible in the scene, located along the edges of the kitchen. There are also silver stoves and several sinks in the kitchen, dispersed throughout the room", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"]], "mscoco_gt_words": ["bowl", "sink", "oven"], "hallucination_idxs": [22], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 28842, "caption": "The image features a park area where a skateboarder is attempting a difficult ramp and a second person is nearby, possibly waiting or watching the skateboarder. The skateboarder is using the ramp located in the center of the scene, while the person next to them is standing further away to the right", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 319607, "caption": "The image features a busy street with multiple traffic lights, several tall buildings, and a person on a bike. The road intersects with other streets in the urban area, and several cars can be seen on the streets and near the buildings. People are walking on the sidewalk, contributing to the busy and active atmosphere in the area. The overall scene appears to be a typical city setting", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bus", "bicycle", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 210372, "caption": "According to the picture, it depicts a small red kitchen with white cabinets and a sink, a small window towards the upper part of the image, and a refrigerator or similar appliance at the side of the sink. Various dishes can be seen in the sink, creating a cluttered and dirty appearance in the kitchen", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "bottle", "sink"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 156915, "caption": "According to the picture, a single red rose is sitting in a glass vase. The rose is located near the middle of the arrangement, accompanied by white and green flowers, creating a visually appealing bouquet. The arrangement is displayed on a wooden table, adding a touch of elegance to the room", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "dining table", "potted plant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 349642, "caption": "According to the image, a man is sitting on a bench next to his dog, who is sitting next to him, in a park-like setting. The man appears to be wearing a dark brown hat, while the dog is a small brown one. They seem to be enjoying some time together, possibly taking a stroll outdoors. There are also several other benches in the scene, though they are not occupied", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 423008, "caption": "According to the picture, there is a vibrant gathering of people and boats at the water's edge. The boats are spread across the scene, some larger and some smaller, with an array of umbrellas set up among them. The people in the scene appear to be engaged in conversation, enjoying the atmosphere at the water's edge", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "umbrella", "handbag", "person", "suitcase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 45435, "caption": "According to the picture, there is a person holding a cell phone in their hand. The phone displays a white and black screen, which can be seen from the image. The person appears to be holding the cell phone in one hand and possibly checking it or working with it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 35827, "caption": "According to the image, there is a man sitting at a desk surrounded by various electronic devices such as a computer, two laptops, and a telephone. He appears to be typing on a keyboard while wearing a white shirt. The man is in the process of working, likely sitting at a desk working on his computer. There is another laptop positioned further away from the first, which seems to be connected to a TV", "mscoco_hallucinated_words": [["telephone", "cell phone"], ["keyboard", "keyboard"]], "mscoco_gt_words": ["dining table", "chair", "bottle", "person", "laptop", "tv", "cup"], "hallucination_idxs": [28, 37], "metrics": {"CHAIRs": 1, "CHAIRi": 0.18181818181818182, "Recall": 0.5714285714285714}}, {"image_id": 72656, "caption": "According to the picture, a small outdoor gathering is taking place with two children sitting at a small wooden table with chairs, enjoying their lunch or snacks together. The wooden table has colorful chairs, and they are sharing a meal or playing together, smiling and having fun. On the table, there is a plate with food on it, and another plate nearby, indicating the children are enjoying food", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "chair", "person", "pizza", "suitcase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 372087, "caption": "According to the picture, there is a green train on the tracks traveling towards a station. The train appears to be parked at the station, waiting for passengers to board or disembark. There are several people visible in the scene, some standing near the train and others walking along the platform.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "bench", "suitcase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 336209, "caption": " In the image, a young man is performing a trick on a skateboard, flying through the air while jumping over a bench. The man appears to be enjoying the action and is the main focus of the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 46031, "caption": "According to the picture, there is a laptop sitting on a desk, with a mouse and a keyboard in front of it. The desk is also surrounded by several plants, including a couple of different species. In addition to the laptop and peripherals, there are a couple of monitors set up with screens, and another computer positioned atop the desk", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "dining table", "mouse", "keyboard", "laptop", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 283118, "caption": "According to the picture, a family of three is sitting at a table in a restaurant, enjoying their meal together. The little girl is smiling, and the father looking at her, possibly taking a photo of her. They are sharing a meal with hamburgers, creating a happy and memorable dining experience", "mscoco_hallucinated_words": [["hamburger", "sandwich"]], "mscoco_gt_words": ["cell phone", "backpack", "dining table", "chair", "bottle", "handbag", "car", "person", "pizza"], "hallucination_idxs": [49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.2222222222222222}}, {"image_id": 429598, "caption": " According to the image, there is a kitchen with vintage-style wooden cabinets, a stove, a refrigerator, and a sink. The sink is in the middle of the kitchen area, while the stove is on the left side. A refrigerator is placed to the right of the sink, and the wooden cabinets are above the stove and to the right side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "bottle", "refrigerator", "oven"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 333365, "caption": "According to the picture, there are two airplanes parked next to each other at an airport. One of the airplanes is red, and the other is white. They are positioned on a runway, which is likely to be a taxiway", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "airplane", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 424174, "caption": "The image depicts a large plate filled with a variety of fruit, including bananas, apples, oranges, and tangerines, all arranged together in a visually appealing manner. The table is wooden, and the plate containing the fruit is on top of it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "banana", "orange", "apple"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 3035, "caption": "The image depicts an airport tarmac with two large airplanes parked on the ground. One airplane is closer to the viewer, while the other one is further away in the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "airplane", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 177213, "caption": "According to the picture, there is a white plate with a cheese pizza on it sitting on a table. It appears to be the main focus of the image", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "knife", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 438535, "caption": "The image depicts a large city street with a large clock displaying time in the middle of the street, sitting on a black pole. Surrounding the clock, there are many cars parked on the street and a few are in the near vicinity. The clock is facing a busy traffic intersection with tall buildings visible in the distance", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "clock", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 490711, "caption": "The image displays a kitchen with white appliances, including a refrigerator on the left side and a sink on the right side. In the middle, there is a stove surrounded by white cabinets. There are two bottles visible, one near the refrigerator and another closer to the middle of the kitchen", "mscoco_hallucinated_words": [["stove", "oven"], ["bottle", "bottle"]], "mscoco_gt_words": ["sink", "banana", "refrigerator"], "hallucination_idxs": [31, 40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.6666666666666666}}, {"image_id": 163728, "caption": "According to the picture, there is a small black and white kitten sleeping on top of a table. It seems to be a domestic scene where the kitten is napping while also close to a mouse. There is another object, possibly a backpack, lying nearby", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["dining table", "cat", "mouse", "laptop", "cup"], "hallucination_idxs": [46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6}}, {"image_id": 456756, "caption": " The image features a large giraffe standing in a park or enclosed space, likely a zoo or a wildlife exhibit. The giraffe appears to be enjoying its surroundings and taking a break", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["zebra", "giraffe", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 281028, "caption": "According to the picture, there is an image of a train interior featuring a pair of empty blue chairs placed against the wall and a pair of larger chairs arranged near the windows. The seats are designed to provide a comfortable space for passengers. The image suggests that the train could be a commuter vehicle or a modern train, as it features modern interior design and the chairs are situated in a way that provides ample view of the outside", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "chair", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 343937, "caption": " According to the picture, it features a young woman wearing a black and white snowboard standing on a snowy slope, preparing to ride down the slope. She is wearing a hat, indicating her readiness to navigate down the hill. There are also a few additional people in the image, spread out at various places", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 277051, "caption": " The image features two small birds, one on the left side and the other on the right side, standing on a table in the center of the scene. They appear to be eating a piece of food on a plate placed on the table, which could be pizza", "mscoco_hallucinated_words": [["pizza", "pizza"]], "mscoco_gt_words": ["dining table", "chair", "bottle", "knife", "bird"], "hallucination_idxs": [51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.4}}, {"image_id": 145666, "caption": "According to the picture, the image depicts a large bowl filled with fresh vegetables on a stainless steel countertop. There is a mixture of various vegetables including broccoli and carrots. The broccoli and carrots are arranged in a neat, organized manner, making it easy to identify them", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "broccoli", "dining table", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 304984, "caption": "The image showcases a meal consisting of various food items placed on a table. A sandwich and a serving of pickled onions are the main dishes on a white plate, accompanied by a bowl filled with orange juice", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["fork", "bowl", "cake", "sandwich", "wine glass", "orange"], "hallucination_idxs": [13], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 122467, "caption": "The image depicts a person, possibly a child, holding a video game controller in one hand, with the other hand placed on top of the controller. There is a white background surrounding the person, making it easier to focus on the controller and the hand. The controller appears to be a remote for a video game system, such as a Nintendo Wii", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["couch", "remote", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 518287, "caption": "The image features a white toothbrush with a big smile, as if it is enjoying a funny moment. The toothbrush is placed inside a cup or container, likely made to store it when not in use. The smile is quite prominent, and it's surrounded by a blue color that is not too dissimilar from the color of the toothbrush's handle", "mscoco_hallucinated_words": [["container", "bowl"]], "mscoco_gt_words": ["orange", "toothbrush", "dining table", "cup"], "hallucination_idxs": [28], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 145448, "caption": "According to the picture, a black cat is sitting on a motorcycle, taking a rest. There are other items present, such as a mug in the background, a bottle near the left edge, and a backpack slightly right of center", "mscoco_hallucinated_words": [["bottle", "bottle"], ["backpack", "backpack"]], "mscoco_gt_words": ["motorcycle", "dining table", "cat", "chair", "car", "person"], "hallucination_idxs": [33, 41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 352900, "caption": " The image features a plate of pasta topped with a variety of ingredients. A generous amount of broccoli is spread across the plate and mixed with various pasta noodles", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "bowl", "broccoli", "knife"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 153607, "caption": " The image features a blue glass filled with water, and inside the vase, there is a flower. The flower is placed on top of the water and appears to be floating", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "bowl", "dining table", "vase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 157581, "caption": "According to the picture, there is an array of wine bottles lined up on a dining table, showcasing various wine types. The bottles are arranged horizontally on the table, with one bottle in the center of the display and several other bottles surrounding it. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "bottle", "wine glass", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 227438, "caption": "The image showcases a large, luxurious bedroom with a comfortable bed. The bed is situated at the center of the room, positioned against a wall", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "couch", "clock", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 116037, "caption": "According to the picture, a young woman is participating in a softball game. She is playing at the plate and has a bat in hand. The game is being played in a field, with a large crowd watching from the stands. Many people, both young and old, are in various positions around the field, focused on the action taking place at the plate", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 202658, "caption": "According to the picture, a white toilet is sitting next to an unknown object in an open area. The white toilet has a pink character on it. It is not certain if this character is associated with a particular brand or product", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "bowl", "toilet"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 18150, "caption": " According to the picture, it appears to be a family setting with a young boy and a man sitting on a bed, possibly in a living room. The boy is eating a piece of pizza, while the man is sitting next to him and eating some as well", "mscoco_hallucinated_words": [["bed", "bed"]], "mscoco_gt_words": ["backpack", "couch", "bottle", "person", "pizza"], "hallucination_idxs": [22], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.4}}, {"image_id": 418711, "caption": "According to the picture, the scene depicts a man riding a bicycle on the street, carrying a large basket of bananas on the back of the bike. The bananas appear to be a significant amount, potentially going to market or to sell", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["banana", "bicycle", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 92107, "caption": "The image shows a bowl of soup with corn and a spoon next to it on a table or surface. The soup has a light, creamy appearance, and there is corn visible on top of it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["spoon", "bowl", "broccoli", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 491000, "caption": " A group of people are walking down a street in a city, possibly in Latin America. There are three adult men walking on the sidewalk, and another person can be seen walking nearby. The street is lined with old buildings, some of which have the same color as the clothing of the people, likely a brown color", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "handbag", "car", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 401163, "caption": "According to the picture, two men are sitting on a bench in a park, surrounded by trees and greenery. One of the men is elderly and is enjoying the park's scenery while sitting on a bench with his dog by his side. The other man is also enjoying the park, likely resting and taking in the peaceful atmosphere", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 39663, "caption": "According to the image, there is an arrangement of various items on a floor, including a backpack, notebooks, pens, pencils, and a cell phone. These items appear to be neatly placed and organized. There are three notebooks arranged in a row, with the middle one located slightly closer to the cell phone", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "scissors", "backpack", "book", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 268293, "caption": "The image shows a young man playing tennis on a court, with a tennis racket in his hand. He is swinging a tennis racket and has a ball in front of him, possibly getting ready to hit it or after a successful serve", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 474499, "caption": "According to the picture, there is a kitchen with a white refrigerator, a dishwasher, and a stove. Various items are scattered throughout the kitchen, including a bottle of water, a glass bottle, and a jar filled with ketchup. There is also a wine bottle on the counter and multiple bottles in the oven", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"]], "mscoco_gt_words": ["vase", "oven", "spoon", "bottle", "clock", "sink", "apple"], "hallucination_idxs": [12], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.2857142857142857}}, {"image_id": 358086, "caption": "The image features a group of people, likely playing soccer together, with several men on the field and a few more spectators watching the game. The players are spread out across the field, with some in the foreground and others in the background", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "chair", "bottle", "person", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 89541, "caption": "In the picture, there is a yellow container filled with various types of food, including tofu, mushrooms, and a variety of fruits such as bananas, oranges, and strawberries. The fruit is spread out across the dish, creating a colorful mix", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["banana", "cake", "bowl", "dining table"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 239445, "caption": " In the picture, there is a young man with a frisbee in his hands, posing for a photo. He is smiling and appears to be happy while holding the frisbee", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "sports ball", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 105782, "caption": "According to the picture, there is a young kangaroo standing on a grassy field. The kangaroo appears to be enjoying its time in the grassy area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cat", "kite", "dog", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 487990, "caption": "The image depicts a group of people seated at a picnic table, surrounded by various food items. There are several individuals, including a young woman, who is holding a guitar, playing a song", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "bowl", "dining table", "donut", "spoon", "bottle", "cake", "knife", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 138834, "caption": "A person is walking through the snowy terrain while holding ski equipment. He is wearing skis and a backpack, and there is a car parked in the background", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["skis", "car", "person"], "hallucination_idxs": [19], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 492057, "caption": " According to the picture, a man is taking a selfie in a public restroom, wearing a tie and smiling. The reflection of the man is visible in a mirror behind him", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "tie", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 388215, "caption": "According to the picture, a young man in a white shirt is playing a tennis match, holding his tennis racket as he serves the ball. The man is on a tennis court, possibly practicing or playing a game with his friends. In addition to the main player, there are a few other people in the background, possibly observing the game or engaging in other activities", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 111409, "caption": "According to the image, a man is riding a motorcycle on a cement surface, possibly participating in a motorcycle race or enjoying a leisurely ride. He is wearing a helmet and riding a white and orange motorcycle, and he appears to be enjoying riding the motorcycle. A handbag is placed on the ground, and there are a couple of bottles visible on the ground as well, possibly belonging to the man or other individuals in the scene", "mscoco_hallucinated_words": [["orange", "orange"], ["handbag", "handbag"], ["bottle", "bottle"]], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [38, 52, 65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 451391, "caption": "The image features a person holding a large bouquet of green broccoli in front of them. The broccoli has numerous florets in different stages of growth and is quite substantial", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["broccoli", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 92462, "caption": "According to the picture, a group of cows is walking down a dirt road, likely on their way to graze or get some exercise. The cows are of various sizes, with some closer to the camera than others", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "car", "person", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 397268, "caption": "According to the picture, there is a storefront with bicycles and chairs displayed outside. The bicycles appear to be on display, with one in the foreground and another in the background. The store is located in a busy city, where people passing by can see the bikes", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["bicycle", "chair", "bench", "potted plant"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 127659, "caption": "The scene depicts a train, which appears to be a commuter or regional train, on the tracks as it travels on the left side of the image. Two people are present in the vicinity, with one person standing near the center and the other walking closer to the right side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "bench", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 251857, "caption": "According to the picture, there is a group of young men in a skate park, enjoying a sunny day. There are three men standing in the park, with two of them wearing striped outfits. Among them, one man is on a skateboard, enjoying a skate session", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 243212, "caption": "According to the picture, a young boy is standing in a grassy field holding a blue frisbee. There are several vehicles parked outside the field, likely in a parking lot or driveway", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "car", "truck", "person", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 444304, "caption": "According to the picture, there is a man sitting on a couch, watching a television with a remote control in his hand. Near the man, there is a chair, and to the right of the TV, there is another chair", "mscoco_hallucinated_words": [["couch", "couch"], ["chair", "chair"], ["chair", "chair"]], "mscoco_gt_words": ["person", "remote", "tv", "bed"], "hallucination_idxs": [12, 32, 45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.75}}, {"image_id": 305000, "caption": "According to the picture, there are various fruits and vegetables on display at a produce market. A woman stands at the market, surrounded by an array of produce. She is wearing a black and white striped shirt and black pants", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["banana", "apple", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 44136, "caption": " The image shows two men dressed in tuxedos seated at a table while attending a formal event. Both of them are smiling and seem to be enjoying themselves. One of the men is holding a beer bottle, and the other man has a wine glass with him", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "tie", "bottle", "wine glass", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 154846, "caption": "The image features a metal pan filled with several types of pizzas. The pizzas appear to be sliced and arranged across the pan in a variety of sizes", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 567254, "caption": " According to the picture, the scene features a young girl sitting in front of a TV, watching a show or movie. She has a stuffed animal in her hand, possibly holding it while watching TV. The TV is placed further into the room, suggesting that it is a large or flat-screen model.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bottle", "tv", "vase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 79261, "caption": "According to the picture, there is a man standing on a street in the middle of a large fountain filled with water. The man appears to be cleaning the water using a mop. There is a sign nearby that says \u201cplease walk,\u201d indicating the man may be promoting the importance of clean water", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "backpack", "clock", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 301402, "caption": "In the image, a man is sitting in a chair, using his laptop in the dining room. The scene shows two tables: one located closer to the man and the other placed towards the back of the room", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "potted plant", "chair", "bottle", "person", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 415828, "caption": "According to the picture, there are several people gathered on a city street, likely enjoying a conversation or just spending time together. Some people are wearing backpacks, suggesting they might have just come from work or school. They stand in front of a house or building, which has a red roof and white walls", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "stop sign", "umbrella", "handbag", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 543231, "caption": "According to the picture, there is a woman playing a video game using a remote, with several people in the background. She is seated and surrounded by others who are likely watching her play. The remote she is using is located right in the middle of the image", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "bowl", "bed", "book", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 495357, "caption": "The scene depicts a man riding a motorcycle down a city street, surrounded by several pedestrians on the sidewalk. Some of the pedestrians are wearing ties, possibly indicating they are dressed professionally or attending a formal event", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "motorcycle", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 523175, "caption": "According to the picture, it appears to be a bowl filled with soup. The soup is made with a variety of ingredients, including carrots and a variety of vegetables. A spoon can be seen in the picture, which is likely used for scooping up the soup or serving it. There are two oranges and one carrot present in the image as well", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["spoon", "bowl", "carrot", "dining table"], "hallucination_idxs": [58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.75}}, {"image_id": 184945, "caption": " The image features a line of motorcycles parked next to each other in a parking lot, creating a busy scene. There are three motorcycles visible in the middle of the image, while two more motorcycles are located closer to the right side of the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "potted plant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 49942, "caption": "The image shows a young man wearing a tie and holding a black umbrella up to his face. He is standing upright, with his head positioned below the umbrella", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "person", "umbrella"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 223019, "caption": "According to the image, a blue train has arrived at a train station, where multiple people are standing on a platform. They seem to be waiting for the train to depart or simply observing the train while it stops at the station. The train appears to be an old-fashioned train with a white color scheme, and it has a prominent \"T\" shape in the middle of its front", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "train", "handbag", "car", "person", "suitcase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 42889, "caption": "The image features a teddy bear sitting on a table, with glasses and a wireless keyboard beside it. The teddy bear appears to be wearing a pair of glasses, while the keyboard is placed on the table. In the background, there is another electronic device, possibly a computer or a laptop, covered in various items like books, pens, and a cup, likely representing someone working with technology or using it for personal interests", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"], ["computer", "laptop"], ["laptop", "laptop"], ["book", "book"], ["cup", "cup"]], "mscoco_gt_words": ["teddy bear", "bear", "keyboard"], "hallucination_idxs": [9, 39, 53, 56, 63, 69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6, "Recall": 0.6666666666666666}}, {"image_id": 22112, "caption": "According to the picture, there is a wooden table full of various objects including a couple of potted plants. One of the potted plants is a blue vase with green plants, and another vase features a small plant inside it. There is also a skull-shaped item on the table", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "potted plant", "vase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 247587, "caption": " The image features a man standing on a surfboard on a sandy beach. The surfboard is spread out in front of the man, and he appears to be waving to someone in the distance", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 171962, "caption": "There is a large city bus parked at the end of a street. The bus has several passengers, including a woman standing at the bus's front and back of the bus. The bus has several front and rear doors open, giving access to the passengers", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bus", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 545056, "caption": "The image shows a vast, green field with a herd of grazing cows. Many of them are black, while others are brown or possibly a mixture of the two colors. The cows are spread out throughout the field, with some of them lying down in various positions, while others are standing up and grazing", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sheep", "dog", "car", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 401411, "caption": "According to the image, there is a man standing in a kitchen surrounded by a variety of food items. He appears to be preparing food, possibly a meal, as he has multiple foods in front of him, and he is attentive to his task.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "oven", "book", "person", "carrot", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.14285714285714285}}, {"image_id": 2867, "caption": "According to the picture, a group of snowboarders are gathered around a sign, getting ready to ski or snowboard down the slope. There are five people in total in the scene, and they all appear to be wearing backpacks and snow gear. Some of the snowboards can be seen on the ground, likely belonging to the people in the group", "mscoco_hallucinated_words": [["snowboard", "snowboard"], ["snowboard", "snowboard"]], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [20, 50], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 1.0}}, {"image_id": 246963, "caption": "According to the picture, a group of motorcyclists is riding down a road, with some of them wearing helmets. There are several motorcycles visible, including some near the stop sign, which is located on the side of the road. A few of the motorcyclists appear to be holding hands as they ride, forming a social bond", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["stop sign", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 498225, "caption": "A group of people is standing in front of a display of desserts in a store. They appear to be admiring and enjoying the selection of desserts, with some looking more closely at the cakes and others just taking in the scene. There are several cakes on display, varying in size and shape", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "donut", "cake", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 124601, "caption": "According to the picture, it is an image of a small kitchen with a white, stainless countertop filled with various items. A laptop is placed on the countertop, while a couple of bottles are situated on different areas", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bottle", "keyboard", "sink", "wine glass", "microwave", "laptop", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 581749, "caption": " The image displays a variety of vegetables, including broccoli, carrots, green beans, and oranges. The broccoli is placed in the upper right corner of the image, followed by the carrots and green beans on the left and right sides", "mscoco_hallucinated_words": [["carrot", "carrot"], ["carrot", "carrot"]], "mscoco_gt_words": ["orange", "broccoli", "dining table"], "hallucination_idxs": [11, 35], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.6666666666666666}}, {"image_id": 227960, "caption": " The image captures a dining table that is filled with various items. At the table, there are multiple pieces of silverware, including two forks and two spoons, as well as two carrots.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"], ["fork", "fork"]], "mscoco_gt_words": ["bowl", "spoon", "wine glass", "carrot", "cup"], "hallucination_idxs": [5, 15, 26], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6, "Recall": 0.4}}, {"image_id": 286503, "caption": "According to the picture, there is a young elephant standing on the ground at a petting zoo or other outdoor environment. The elephant is close to a tree and appears to be looking at something. Surrounding the elephant are several people, who might be visitors observing the animal", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["elephant", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 315740, "caption": "The image is of a modern kitchen featuring two stainless steel sinks and granite countertops. The kitchen is filled with various cooking utensils and is well-appointed with various cabinets. There are three chairs present in the kitchen area, as well as a large couch in the dining area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "potted plant", "oven", "chair", "couch", "sink", "tv", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.375}}, {"image_id": 142108, "caption": "According to the picture, there is a group of young people, including adults, standing around a dining table with several bananas on it. They appear to be engaged in a fun and educational activity, possibly blindfolded, where they are learning about various foods and working on [IDK]. Some of the bananas are placed closer to the edge of the table while others are situated more towards the middle of the dining area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["banana", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 542933, "caption": "A baseball game is in progress with a batter, ready to swing, in the batter's box. The batter is holding a baseball bat and is prepared for the pitch. A catcher and an umpire are also present in the scene. In total, there are 12 people in the picture, which is likely a group of players and spectators", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "chair", "sports ball", "bench", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 475043, "caption": "According to the picture, there are a few people in a room, sitting on couches or chairs, possibly having a relaxed gathering or social event. Some of the individuals are sitting on the floor, and one person is drinking from a cup. A laptop is also present in the scene, either being used or brought as an additional element to the event", "mscoco_hallucinated_words": [["chair", "chair"], ["cup", "cup"]], "mscoco_gt_words": ["bowl", "spoon", "couch", "bottle", "person", "laptop"], "hallucination_idxs": [18, 46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 456784, "caption": "In the image, a man is holding a knife and slicing a cake, cutting it into smaller portions. The man's position and the knife's movement suggest that he is in the process of serving the cake to others", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 33753, "caption": "According to the picture, it depicts a grey bench in a park setting. A remote control is sitting on the bench, appearing to be a key component in the design", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "bench", "remote", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 448694, "caption": " The scene shows a group of people actively playing in a park with a frisbee. The frisbee is being thrown by a man in the foreground, and two more people are in close pursuit", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "person", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 99053, "caption": " The image displays a bowl of food filled with noodles and vegetables. A spoon can be seen within the bowl, suggesting the dish is ready to be enjoyed", "mscoco_hallucinated_words": [["spoon", "spoon"]], "mscoco_gt_words": ["fork", "bowl", "dining table", "broccoli", "carrot"], "hallucination_idxs": [14], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.2}}, {"image_id": 503210, "caption": "According to the picture, a spacious and bright living room with a wooden floor and yellow walls is furnished with several items. There are two couches in the room, one closer to the middle and the other in the right corner. A piano can be seen in the left corner, and a television is positioned near the couch in the middle of the room", "mscoco_hallucinated_words": [["television", "tv"]], "mscoco_gt_words": ["vase", "potted plant", "chair", "couch", "clock", "book", "orange"], "hallucination_idxs": [57], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.14285714285714285}}, {"image_id": 142257, "caption": "According to the picture, the scene is a busy city street with multiple cars driving down the road. There are four cars present, with a red car, a blue car, and two other cars, one in the center of the street and the other towards the left side. People are present as well, moving around the city. Some people can be seen in the vicinity of the cars, traveling to various destinations", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}], "overall_metrics": {"CHAIRs": 0.244, "CHAIRi": 0.09349593495934959, "Recall": 0.5015945330296128, "num_hallucinated_caps": 122.0, "num_caps": 500.0, "hallucinated_word_count": 207.0, "coco_word_count": 2214.0, "length_response": 52.864, "hallucinated_caps_ls": [144305, 164091, 54870, 324008, 52700, 39053, 493862, 128119, 401027, 507171, 231534, 209347, 12375, 2154, 104326, 300784, 409346, 344942, 133146, 185327, 348654, 454161, 529256, 368367, 295656, 401010, 313928, 346940, 278627, 122602, 24104, 59710, 527497, 411564, 61959, 5934, 131138, 258209, 285820, 245664, 453731, 87052, 418692, 259345, 449798, 375769, 426172, 333998, 443259, 301837, 309071, 240681, 47781, 14773, 261318, 134768, 526414, 188136, 357, 561967, 151081, 134223, 301266, 475813, 558784, 314016, 564629, 53431, 381031, 9628, 163518, 423984, 118124, 252625, 328110, 88269, 98979, 536154, 416384, 531327, 1268, 575406, 377879, 436814, 45550, 444913, 303099, 271565, 205193, 272566, 170099, 14940, 354540, 124236, 348251, 381615, 352703, 49005, 425569, 35827, 283118, 490711, 163728, 277051, 304984, 518287, 145448, 18150, 474499, 89541, 138834, 111409, 397268, 444304, 523175, 42889, 2867, 581749, 227960, 475043, 99053, 503210]}}}