{"overall_metrics": {"CHAIRs": 0.338, "CHAIRi": 0.11631944444444445, "Recall": 0.5480637813211845, "num_hallucinated_caps": 169.0, "num_caps": 500.0, "hallucinated_word_count": 335.0, "coco_word_count": 2880.0, "length_response": 72.166, "hallucinated_caps_ls": [144305, 164091, 54870, 324008, 52700, 39053, 493862, 128119, 401027, 507171, 231534, 227031, 209347, 12375, 13550, 2154, 104326, 300784, 409346, 344942, 275929, 465601, 490491, 133146, 185327, 348654, 482432, 454161, 129707, 203110, 451943, 529256, 37907, 368367, 295656, 401010, 313928, 490284, 346940, 278627, 400851, 122602, 24104, 59710, 527497, 411564, 285302, 61959, 5934, 131138, 258209, 285820, 245664, 453731, 87052, 418692, 259345, 119709, 378831, 449798, 375769, 426172, 152499, 460866, 333998, 443259, 301837, 309071, 240681, 47781, 14773, 261318, 134768, 526414, 441363, 188136, 89378, 99874, 357, 561967, 151081, 134223, 572081, 301266, 475813, 558784, 314016, 564629, 53431, 381031, 9628, 330050, 163518, 423984, 499486, 118124, 288729, 252625, 328110, 367872, 88269, 98979, 536154, 416384, 531327, 1268, 575406, 277073, 377879, 436814, 45550, 444913, 303099, 271565, 205193, 272566, 170099, 14940, 140702, 354540, 191301, 124236, 82488, 348251, 381615, 352703, 226802, 49005, 425569, 423008, 35827, 336209, 283118, 429598, 490711, 163728, 456756, 277051, 145666, 304984, 518287, 145448, 352900, 18150, 418711, 39663, 474499, 89541, 239445, 105782, 138834, 111409, 397268, 444304, 305000, 44136, 301402, 415828, 543231, 523175, 42889, 2867, 124601, 581749, 227960, 475043, 456784, 99053, 503210]}, "results": {"sentences": [{"image_id": 144305, "caption": "The image features a computer with a laptop open on top of a mouse. The mouse is connected to the laptop, and there are multiple computer peripherals, including a keyboard, a monitor, and two mice placed at different heights. The laptop is using a cable and a second mouse, suggesting that it is connected to the computer through a network or a wireless connection.", "mscoco_hallucinated_words": [["mouse", "mouse"], ["mouse", "mouse"], ["monitor", "tv"], ["mouse", "mouse"], ["mouse", "mouse"]], "mscoco_gt_words": ["keyboard", "laptop", "dining table"], "hallucination_idxs": [13, 16, 35, 39, 54], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4166666666666667, "Recall": 0.6666666666666666}}, {"image_id": 164091, "caption": "According to the picture, a doll is sitting in a baby carriage, surrounded by beds. There is a chair on the left side of the scene and two other dolls, one of which appears to be a baby, are located around the baby carriage. The baby carriage, surrounded by the beds, is designed to resemble a traditional, old-fashioned type, with the baby sitting on the carriage. Some dolls are placed on the beds, and there is a bed close to the baby carriage. The arrangement of the furniture is likely part of a display or a toy collection.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["teddy bear", "bed", "person"], "hallucination_idxs": [21], "metrics": {"CHAIRs": 1, "CHAIRi": 0.09090909090909091, "Recall": 0.6666666666666666}}, {"image_id": 308003, "caption": "According to the picture, a wooden bench is sitting in a grassy area, surrounded by trees. The bench has a green and white striped pattern on it, and it appears to be in a shaded area, providing a comfortable place to sit and enjoy the outdoors.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "bench", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 54870, "caption": "The image shows a sandwich, lettuce, and tomato sitting on a white table. The sandwich consists of a meat and a vegetable, while the sides feature a plate of salad. The salad has vegetables such as pickles and tomatoes arranged in a pleasing fashion. A knife can be seen in the foreground, likely used to cut into the sandwich. A fork is also present, possibly for serving or consuming the sandwich. The plate of salad appears fresh and appealing.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["sandwich", "fork", "knife"], "hallucination_idxs": [14], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 95908, "caption": "The image showcases a silver cup or mug placed on a keyboard, likely a computer keyboard. This makes it appear as if the cup has been placed in a place where someone might usually sit down with a computer or other electronic device. There are multiple keyboards around the cup, including a mouse pad close by. The computer, mouse, and keyboard create a scene with technology and a cozy, sitting area ambiance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "dining table", "laptop", "mouse"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 329614, "caption": "According to the image, there is a church situated near a street with two lanes and a sidewalk. A car is driving down the street in front of the church, and a red bus is further down the street. There are also some pedestrians visible in the area, with some closer to the car and others scattered around. The overall scene suggests a busy street in a city with a religious structure nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 324008, "caption": "The image depicts a tray of food, with various items placed on it. The tray includes a banana, a couple of oranges, yogurt, a sandwich, and a muffin. The food is arranged in an assortment of containers, including cups and bowls. There are also two bottles and two cups on the tray.", "mscoco_hallucinated_words": [["sandwich", "sandwich"], ["container", "bowl"], ["bowl", "bowl"], ["bottle", "bottle"]], "mscoco_gt_words": ["cup", "dining table", "train", "orange", "airplane", "person", "banana"], "hallucination_idxs": [29, 43, 48, 54], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.42857142857142855}}, {"image_id": 52700, "caption": "According to the picture, there is a group of people standing around a table filled with a variety of cakes and desserts, all on different plates and decorated differently. The cakes range from chocolate cake to vanilla, carrot, and poppyseed ones. There are also several forks and knives on the table, ready to be used for serving the food. ", "mscoco_hallucinated_words": [["carrot", "carrot"], ["knife", "knife"]], "mscoco_gt_words": ["handbag", "dining table", "spoon", "cake", "fork", "person"], "hallucination_idxs": [41, 53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.6666666666666666}}, {"image_id": 501311, "caption": "According to the picture, there is a baseball player pitching a baseball on a grassy field. The pitcher is in the middle of the field, while a crowd of people stands around the pitcher watching the game. There are a total of 15 people visible in the scene, some of whom are likely fans attending the game. The baseball game is set on a baseball field with a green infield and outfield, with a baseball being thrown during the action.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 247179, "caption": "According to the image, it appears to be a street scene during the day, possibly during a cultural event or a parade. There are three elephants, two of which are decorated, in the scene, accompanied by several people. Some of the people are walking down the street, while others are standing next to the elephants.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "clock", "person", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 577912, "caption": "According to the picture, the bathroom features a large white tub sitting underneath a window, surrounded by light-colored walls. The bathroom also includes a sink and a mirror, providing an attractive and functional space for bathing and grooming. The sink is situated on the right side of the image, while the mirror is found on the left side of the scene. The tub and sink occupy the center of the bathroom, allowing for efficient use of the limited space. Overall, the bathroom appears to be clean and well-maintained.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "toothbrush", "toilet", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 196341, "caption": "The image features a group of people standing in front of a large clock located in the interior of a historical building. The clock, which can be seen at the very bottom of the image, is positioned in a prominent location within the room. The clock's numerals are in white and can be seen quite well, even from this angle. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 234433, "caption": "A couple is sitting on a cement path near the street, enjoying a sunny day. In the foreground, there is a pole on which a traffic sign is mounted, indicating a bike ride. The pole is also adorned with a \"left right\" and \"none\" sign. A few people can be seen in the background, walking around and interacting with one another, while a few vehicles are parked nearby on the street. The scene captures a pleasant atmosphere with a couple enjoying a bike ride and other people going about their day.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 455610, "caption": " In the image, there is a large red and green motorcycle parked on the street in front of a building, catching the attention of people around it. There are several individuals visible in the scene, some of them standing closer to the motorcycle while others appear further away. They might be enjoying the sight of the motorcycle or simply passing by.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "motorcycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 39053, "caption": "The image displays a table filled with various food items. There is a pita bread and a bowl of creamy sauce, along with an assortment of cups, bowls, and a vase arranged on the dining table. Among the cups, there is one placed near the bowl of sauce, while two more are positioned closer to the left side of the table.", "mscoco_hallucinated_words": [["vase", "vase"]], "mscoco_gt_words": ["cup", "dining table", "bowl", "spoon", "cake", "cell phone", "apple"], "hallucination_idxs": [34], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.42857142857142855}}, {"image_id": 493862, "caption": " The image depicts a person standing on a grassy hill, holding a tennis racket and looking up at the sky. The person appears to be wearing a red shirt and has a backpack resting at their back. There is another person visible in the distance, but they are quite far away from the main subject. The overall scene appears to be an open outdoor space with a person enjoying their surroundings.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["person", "tennis racket", "dog"], "hallucination_idxs": [34], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 269873, "caption": " The image depicts a black and white photo of a group of people walking down the sidewalk in front of a bus stop. There are many people scattered across the image, some standing closer to the bus stop, while others are walking further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "person", "bus", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 524656, "caption": "In a small kitchen, a microwave oven is placed at the top, along with a toaster oven. There is a refrigerator located near the oven, which is also situated at the top. There are two sinks in the kitchen, both situated at the front, one slightly to the left and the other closer to the right side. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toaster", "cup", "knife", "oven", "refrigerator", "bowl", "spoon", "bottle", "wine glass", "sink", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.45454545454545453}}, {"image_id": 369547, "caption": "The scene features a black and white dog sitting on a beach, surrounded by pebbles. The dog is looking at its owner who is standing behind it, as if waiting for attention. The background reveals a vast ocean, and other birds can be seen scattered throughout the area. The presence of these birds gives the impression that the dog is not alone, and they are enjoying the time together on the beach. Overall, the image captures a charming scene on the sandy beach.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "dog", "cat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 181446, "caption": "According to the picture, a man is standing in front of a refrigerator, holding a bottle. He is wearing a black shirt and glasses, and he has a big smile on his face. He is looking at the refrigerator with an open bottle, possibly because he forgot to close the bottle or left it open for later use. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "bowl", "bottle", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 128119, "caption": "According to the picture, a white city bus is parked on a city street, with people standing nearby. The bus has a few passengers on board, and it is stopped at a corner. The street is quiet, with no cars in sight. There are two traffic lights along the street, one closer to the left side and another further to the right. The bicycle can be seen parked alongside the bus, and a few bicycles are parked in the vicinity, with one close to the front of the bus.", "mscoco_hallucinated_words": [["car", "car"], ["traffic light", "traffic light"]], "mscoco_gt_words": ["bicycle", "bus", "handbag", "cell phone", "backpack", "person"], "hallucination_idxs": [45, 52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 401027, "caption": "According to the picture, there is a sandwich made with cheese, cucumber, and lettuce on a plate. The sandwich appears well-prepared and looks appetizing. A variety of vegetables, including carrots, are scattered throughout the plate. The table is adorned with a vase, and there are candles located next to the vase, as well as a wine glass near the bottom of the table. In total, there are six cups positioned around the table, which could be used for drinking or serving purposes.", "mscoco_hallucinated_words": [["carrot", "carrot"], ["vase", "vase"], ["vase", "vase"], ["wine glass", "wine glass"]], "mscoco_gt_words": ["cup", "knife", "dining table", "sandwich", "fork"], "hallucination_idxs": [35, 49, 59, 65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.6}}, {"image_id": 253630, "caption": "According to the picture, a living room is situated next to a large window. In the center of the room, there is a dark brown dining table sitting on a hardwood floor. The table has a glass top, and it is surrounded by various items like a television on the right side and another piece of furniture nearby. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["potted plant", "chair", "tv", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 326021, "caption": "According to the image, a woman is sitting at a table with a teddy bear placed on it. There is another teddy bear nearby, suggesting it might be part of the event. The table is covered with a red tablecloth, and the woman is sitting on the left side of the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 317320, "caption": "The image portrays a busy street scene with a man and a bicycle passing by. Several people are walking on the street, and a couple of motorcycles are also present in the scene. One of the motorcycles is situated behind the man on the bicycle, while the other one is closer to the sidewalk.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 507171, "caption": "According to the image, a man is seen riding a surfboard in the ocean, attempting to catch a wave. He is partially submerged in the water as he rides the wave, demonstrating his surfing skills. The waves are caused by a boat in the background, creating some excitement in the scene. ", "mscoco_hallucinated_words": [["boat", "boat"]], "mscoco_gt_words": ["surfboard", "person", "cat"], "hallucination_idxs": [46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 161227, "caption": "A chair is sitting in the middle of the image, with a remote control placed on it. A second remote control can also be seen on the side of the couch, partially covered by a pillow. There is a laptop visible in the picture, placed near the pillow. The laptop appears to be closed, but a keyboard can be seen on top of it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "keyboard", "laptop", "cell phone", "chair", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 25812, "caption": "According to the picture, there is a person skiing down a snowy slope wearing a suit. Another person, possibly on a snowboard, is also visible in the background, heading up the same slope. The first person appears to be enjoying the snowy terrain, and both skiers are wearing appropriate clothing for the conditions.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 267537, "caption": "According to the picture, several people are sitting on steps in the image. There are three women sitting together and holding umbrellas. They are dressed in long coats and appear to be enjoying the event or outing together. The three women are holding their umbrellas and possibly having a conversation.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "umbrella", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 195645, "caption": "According to the picture, a group of people, including adults, are gathered together on a grassy hillside. They are sitting at various spots, including several benches, with some people seated closer to the stroller and others more dispersed. The scene appears to be a relaxed, social occasion where people are chatting and enjoying each other's company.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "dining table", "bottle", "wine glass", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 155671, "caption": "A small town is depicted in the image with numerous cars and trucks on the road at the intersection. There are a total of four vehicles on the scene, including a police car, a few cars, a taxi, and a truck. The police car and taxi are located near the intersection, while the other cars and trucks are spread out in different locations on the street.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 412894, "caption": "According to the picture, there is a crowd of people walking on the sidewalk in front of a building. Some of them are standing in front of a traffic light at the corner. They are spread out across the scene, covering a large area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "person", "handbag", "backpack"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 32625, "caption": " There is a table in the image with a variety of food on it. To the right of the table, there is a bowl containing a delicious looking meal, and a plate with some food on it is placed closer to the left side of the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "bowl", "fork", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 18688, "caption": " The image captures a baseball game in progress, with several players involved. A batter is at home plate, holding a bat and getting ready to swing. There's also a pitcher, standing at the mound, as well as several other baseball players standing in different positions around the field.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 343534, "caption": "According to the picture, there is a small doll or toy next to two stuffed animals. The small doll is dressed in a pink outfit and boots, and there is a teddy bear and another stuffed animal nearby. The teddy bear appears to be the same size as the other stuffed animal. Both dolls are placed near the stuffed animals.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "potted plant", "dog", "book", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 370657, "caption": "According to the picture, a group of people is riding on motorcycles. There is a man and a woman in the front, with a young girl behind them. Several baskets can be seen, suggesting that the riders are carrying things or groceries.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 481596, "caption": "According to the picture, there is a man sitting in a baseball field, surrounded by a crowd of people. He is holding a hot dog in his hand and eating it while watching the game. The field appears to be an outdoor venue, and many fans are sitting or standing nearby. There are several people in various positions around the field, watching the man eat and enjoying the game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["hot dog", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 460347, "caption": "According to the picture, a city street or highway features multiple lanes, with a combination of buses and cars traveling through them. There are several cars dispersed across the scene, driving in different directions, while buses can be spotted traveling both in front of the buses and in the background. The presence of various vehicles on the street suggests that it is a bustling transportation route.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 398753, "caption": "The image shows a large crowd of people gathered around a baseball field, watching a baseball game in progress. The field is filled with baseball players, with a baseball bat held in one player's hand as they stand on the field. The crowd, consisting of many spectators, watches the game with excitement and engagement.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 67833, "caption": "According to the picture, a large pizza with various toppings, such as fish, olives, and onions, is being served on a table. The pizza is placed on a white plate, and several bowls and cups are also placed on the table. A bottle of red wine is positioned in the middle of the table, along with several glasses. There are two forks and two knives on the table, and two people can be seen in the background, potentially enjoying the food.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "bowl", "bottle", "fork", "wine glass", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8888888888888888}}, {"image_id": 272880, "caption": " In the image, there is a pizza with the label \u201cMystic Pizza\u201d, which appears to be the only item on the plate. The pizza is cut into various pieces, with one piece close to the right side of the plate and another one close to the center of the plate. There is also a fork on the table, possibly used for serving the pizza. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 254368, "caption": "The image features a man in bed eating food. He is sitting on the bed and holding a plate with food on it. The man is eating while the food is on the plate in front of him. Additionally, there is another piece of food placed on the bed, potentially placed there before the man finished his meal.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "bed", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 193108, "caption": "According to the picture, there are multiple baseball players involved in a game, including a young man in the center, who is swinging at a baseball while holding a bat. Two other players are positioned behind the batter, one closer to the left and another closer to the right, watching the batter's movements.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 459141, "caption": "According to the image, a woman is riding a bicycle while wearing a conical hat, carrying a large bag on the bicycle's back rack. The woman appears to be in the middle of a street or sidewalk, possibly on her way to or from a store. There is also a small pile of items near her bicycle, which could be additional grocery items or items she is carrying with her.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "potted plant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 231534, "caption": "The image depicts a tarmac filled with multiple vehicles, including an airplane parked near a ramp. A variety of vehicles are involved in loading or unloading the airplane, including trucks, vans, and cars. There are five trucks in the scene, with three on the left side of the airplane, one near the middle, and one on the right side. A van is also located on the right side of the airplane, while a car is positioned towards the middle of the tarmac.", "mscoco_hallucinated_words": [["van", "car"], ["car", "car"], ["van", "car"], ["car", "car"]], "mscoco_gt_words": ["truck", "suitcase", "handbag", "airplane", "person"], "hallucination_idxs": [34, 37, 70, 84], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.4}}, {"image_id": 227031, "caption": " The image portrays a man and woman, dressed in formal attire, standing next to a multi-tiered cake, likely during a special occasion like a wedding or anniversary. They appear to be joining the cake together, likely cutting it for the first time as a couple. The cake is positioned on a dining table, with a bottle visible in the background. ", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["knife", "dining table", "tie", "potted plant", "cake", "person"], "hallucination_idxs": [62], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 209347, "caption": "According to the picture, a woman is sitting on a subway train, looking at her cell phone. She is wearing a black jacket and has a purse, and she is surrounded by other passengers. Some people are nearby, while others are sitting further back in the train car. A couple of bottles are also visible, likely placed on the train's surfaces or held by some of the passengers.", "mscoco_hallucinated_words": [["car", "car"], ["bottle", "bottle"]], "mscoco_gt_words": ["handbag", "tie", "train", "cell phone", "person"], "hallucination_idxs": [53, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.18181818181818182, "Recall": 0.8}}, {"image_id": 523292, "caption": "According to the picture, a fluffy cat is sitting on a table near a laptop, looking at something on the screen of the laptop. The cat is spread out on the table and the laptop is placed in front of it. It seems like the cat might be peering closely at the laptop's screen.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tv", "dining table", "laptop", "cat", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 12375, "caption": "According to the picture, the image features a white plate filled with a variety of foods. The plate has carrots, potatoes, and some other vegetables, along with some meat and rolls. The food is placed on a table, with multiple meat balls and rolls on the dining surface. A cup can also be found on the table, completing the meal.", "mscoco_hallucinated_words": [["ball", "sports ball"], ["cup", "cup"]], "mscoco_gt_words": ["carrot", "broccoli", "dining table"], "hallucination_idxs": [48, 57], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.6666666666666666}}, {"image_id": 13550, "caption": "According to the picture, it shows a woman sitting on a rock next to an elephant, petting it while it is standing on the rock. The woman seems to be enjoying the company of the elephant and has her arms around it. There are several bottles on the ground, spread throughout the scene, some of which are near the woman and elephant. The overall atmosphere of the scene suggests a relaxing or joyful environment.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["person", "handbag", "elephant"], "hallucination_idxs": [49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6666666666666666}}, {"image_id": 146837, "caption": "According to the image, a young man is performing a trick while riding a skateboard on a city street. The man is actively in motion, catching some air and likely trying to show off his skills to the bystanders. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 67042, "caption": "A man is standing in a long, narrow corridor, looking at his cell phone as he walks. He appears to be concentrated and focused on the device, possibly reading messages or checking for updates. There are also two other people in the scene, one near the middle of the hallway and another closer to the right side. They both appear to be in the background and are not as prominent as the cell phone user.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "person", "cell phone"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 209925, "caption": "According to the picture, an image depicts a baseball player standing on a field, getting ready to pitch a baseball. The player is wearing a baseball uniform, and has a baseball in his hand. He is shown on the mound, getting set to throw the ball.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 2154, "caption": "According to the image, a shopping cart is parked near a bench and a dining table. There is a trash can and a few other items scattered around the area. There are also several people nearby, sitting on the bench and possibly enjoying a meal at the table. Additionally, there are multiple dining chairs placed around the dining table. The scene appears to be located in a public space, possibly a street or a sidewalk.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"], ["chair", "chair"], ["table", "dining table"]], "mscoco_gt_words": ["car", "umbrella", "bench", "person"], "hallucination_idxs": [16, 51, 59, 64], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5714285714285714, "Recall": 0.5}}, {"image_id": 274931, "caption": "The image is a black and white photograph of a street scene featuring several parked cars, all of which are old cars. They are all lined up along the curb of the street, creating a line of parked cars.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "fire hydrant", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 104326, "caption": "According to the picture, there is a long dining room table situated between two rooms: a kitchen and a living room. There are chairs of various heights on each side of the table, and they can seat more than six people. Several potted plants are situated around the dining area, adding a touch of natural beauty. The kitchen contains a dishwasher, oven, and a microwave, along with multiple bowls placed on the counter. The living room features a television set in the center of the room, and a vase on the left side of the screen. The rooms appear clean and organized, making it a comfortable and inviting space for various activities.", "mscoco_hallucinated_words": [["person", "person"], ["bowl", "bowl"], ["television", "tv"], ["vase", "vase"]], "mscoco_gt_words": ["clock", "knife", "oven", "dining table", "potted plant", "sink", "microwave", "chair"], "hallucination_idxs": [44, 78, 89, 100], "metrics": {"CHAIRs": 1, "CHAIRi": 0.36363636363636365, "Recall": 0.625}}, {"image_id": 233888, "caption": "According to the picture, there is a young boy playing baseball in front of a television. The boy is holding a baseball bat and has an outfielder's glove on his left hand. The TV is placed on the right side of the image and appears to be showing a baseball game, perhaps providing a virtual experience.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tv", "potted plant", "baseball bat", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 213905, "caption": "According to the picture, there are two women standing on a grassy field. They appear to be friends and are smiling at each other, possibly in the middle of a conversation. Both women are wearing different outfits \u2013 one is wearing a white and blue top, while the other is wearing a black and white outfit. They seem to be enjoying their time on the tennis court, possibly during a social or informal match before a formal event.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "tennis racket", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 277479, "caption": " The image depicts a young girl, possibly a toddler, walking towards a refrigerator in a kitchen. She is holding a bottle in her hand and has a toy in her other hand, possibly about to drink or eat. The refrigerator has several bottles in it, with one located near the middle and others further down towards the left side. There are two people present in the scene, a mother figure on the left and another person on the right, possibly a caregiver. They appear to be observing the young girl in the process of approaching the refrigerator.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 300784, "caption": "According to the picture, it features a group of children sitting on a bench in front of a large dog. The dog seems to be a focus of attention for the children, as they appear to be looking at the animal. One of the children is holding a drink, while the other two are holding something that appears to be a book. The dog is seated next to them. It appears to be a fun and engaging environment for the children.", "mscoco_hallucinated_words": [["book", "book"]], "mscoco_gt_words": ["person", "bench", "dog", "couch"], "hallucination_idxs": [66], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.75}}, {"image_id": 183766, "caption": "The image displays a dessert, likely a cheesecake, decorated with fresh berries and on a plate. There are two pieces of cake on the plate \u2013 one larger one and another smaller one. The dessert is likely to be perfect for a special occasion or as a treat to indulge in. In the background, there is a table with a couple of glasses on it, indicating that this might be a party setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "fork", "knife", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 169169, "caption": " According to the image, there is a street scene showing a sign advertising an event, likely the Springs in the City event, with a green banner and a poster. The banner is visible above the sign and the poster appears to be promoting the event.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "person", "potted plant", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 409346, "caption": "According to the picture, a group of people, including a woman and a man, are standing around a table, enjoying a meal. The woman is holding a knife while the man is holding a fork. There is a cake in front of them, and various snacks and appetizers, such as cupcakes, are spread out on the table. The table is covered with plates, cups, and silverware to ensure that the guests can enjoy their food. The scene portrays a relaxed social gathering or a business meeting over a meal.", "mscoco_hallucinated_words": [["knife", "knife"], ["fork", "fork"], ["cup", "cup"]], "mscoco_gt_words": ["dining table", "bowl", "spoon", "cake", "person"], "hallucination_idxs": [32, 39, 74], "metrics": {"CHAIRs": 1, "CHAIRi": 0.23076923076923078, "Recall": 0.6}}, {"image_id": 522262, "caption": " The image captures a sidewalk where many bicycles are parked, lined up in a row parallel to the sidewalk. There are different types of bicycles, such as two small bikes, two medium-sized bikes, and two large bikes. Some of the bikes are closer to the sidewalk, while others are spaced further away. The bikes are parked along the sidewalk, and there is a path or walkway on the left side of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 79014, "caption": "According to the image, a man is holding a yellow ball in his hand and preparing to throw it against a wall. He is standing in front of glass, and the ball is floating in front of the glass, giving the illusion of being held up by the glass. The man is in the foreground, visible through the glass.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 344942, "caption": "According to the picture, there is a small wooden toy, or \"cube,\" positioned on the edge of a laptop. The toy is perched above the keyboard and is looking down at the laptop screen. The image appears to be taken in front of a window, adding a natural and relaxing background.", "mscoco_hallucinated_words": [["keyboard", "keyboard"]], "mscoco_gt_words": ["clock", "laptop", "tv"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 275929, "caption": "According to the picture, there are two porcelain toilets sitting on the stairs. They are placed side-by-side on the steps, creating an unusual and interesting sight. The toilets appear to be old and in need of maintenance or repair, which is likely why they are sitting on the stairs. There are no other objects or people in the scene, focusing the attention solely on the two porcelain toilets on the stairs.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["sink", "bowl", "toilet"], "hallucination_idxs": [61], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 465601, "caption": " According to the picture, a group of people, including women, is standing on a sidewalk, looking at a building in the background. The people are wearing hats and holding umbrellas, suggesting that they might be protecting themselves from the rain. The sidewalk appears to be lined with streetlights. In addition to the people, there are several handbags scattered around the scene, belonging to some of the individuals in the group. One of the people is wearing a gray jacket and has a hat and an umbrella.", "mscoco_hallucinated_words": [["streetlight", "traffic light"]], "mscoco_gt_words": ["handbag", "umbrella", "bench", "person"], "hallucination_idxs": [54], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.75}}, {"image_id": 300124, "caption": "The image depicts a small green chair sitting on the side of a road, near a curb. The chair appears to have a piece of plywood resting on top of it, likely serving as a makeshift table. There is a microwave located in front of the chair, placed on the ground and possibly on a base, suggesting a small sitting area or a makeshift kitchen.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["microwave", "couch", "chair", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 46919, "caption": "According to the image, a man is walking uphill on a snow-covered hill. He is wearing a backpack and using a pair of snowshoes to traverse the snow-covered surface. The hill has a relatively steep incline. The man appears to be alone in the scene, with no one else present.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 434022, "caption": " The image portrays a white train engine traveling down a train track, with smoke billowing from the stack. The long train consists mainly of black and gray railroad tracks. Some smaller, individual train cars can be seen dispersed along the length of the train, including two on each side of the main engine. Overall, it appears to be a significant train traveling through a rural area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 490491, "caption": "The image captures a city street scene with a cement wall and a building situated near the sidewalk. Two people can be seen standing in front of the cement wall, possibly passing by the area or having a conversation. A street light can be seen at the left side of the scene, while a few signs are scattered around, indicating various signs and road regulations. The area appears to be quiet and peaceful, with no cars or other vehicles visible on the street.", "mscoco_hallucinated_words": [["street light", "traffic light"]], "mscoco_gt_words": ["stop sign", "car", "person"], "hallucination_idxs": [43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 175878, "caption": " In the image, there is a group of four people, including a young woman and two men, who are standing and talking near a balcony. They appear to be having a good time together, engaged in a casual conversation. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "cell phone", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 12669, "caption": "In this image, a young woman is standing in a room surrounded by people, holding a wine glass. She appears to be smiling and engaged in conversation with the other attendees. The room looks like a social gathering or an event. The scene captures a moment in the day at a wine-themed gathering, as people interact and enjoy their time together. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["wine glass", "handbag", "vase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 192496, "caption": "The image shows a blue train traveling on train tracks, sitting between two passenger cars in front of it. The train appears to be parked and waiting for another train to continue on its journey. The passenger cars, one positioned closer to the left side and the other near the right side, are not as large as the train itself, but they still contribute to the overall scene. The train track extends across the entire image, with portions of it near the front and back of the trains.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 133146, "caption": "According to the picture, there is a spacious living room with a fireplace and chairs in it. The living room is filled with comfortable furniture such as a couch situated near the fireplace and two chairs placed around it. The room appears to be tidy and well-maintained, with books and a clock visible on a table. There are three vases of different sizes, with one on the left side of the room, one in the center, and another on the right side. The room has a hardwood floor and is decorated with plants and a book collection.", "mscoco_hallucinated_words": [["clock", "clock"], ["table", "dining table"]], "mscoco_gt_words": ["tv", "vase", "book", "chair", "couch"], "hallucination_idxs": [55, 59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.8}}, {"image_id": 53744, "caption": "In the image, there is a plate with sliced tomatoes and lettuce. Two sliced tomatoes are placed on the plate, and a knife is next to them, possibly being used to prepare or eat the vegetables. Additionally, a sandwich can be seen in the background, likely being the source of the vegetables.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "knife", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 375902, "caption": "The scene depicts a man, dressed in a white ski suit, crossing the finish line after participating in a skiing competition. He is wearing blue ski boots. The man is holding blue skis, which he is crossing over with the finish line.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 498747, "caption": "According to the picture, the scene is set in a crowded room where multiple people are present. The focus of the image is a young man, standing at the right side of the room, holding a cell phone to his ear. He is possibly talking on the phone or answering it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "bird", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 52435, "caption": "A group of people is spending time at a beach, with some of them paddleboarding in the ocean. The scene features a few surfers, including one paddleboarding person, and many other people watching them from the shore or walking along the beach. The water level seems to be fairly high, with a couple of individuals standing near the waterline, while others are walking along the beach. The scene showcases a lively beach atmosphere, with people enjoying various activities and the sandy terrain.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "person", "umbrella", "bird"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 301977, "caption": " In this image, a cow is standing in front of a store, possibly waiting to be sold. The store is filled with a variety of sodas, with some bottles closer to the foreground and others positioned further back. The sodas appear to be sold in a store or market, and there are multiple bottles visible in the picture. The cow is the main focus of the scene, positioned close to the display of sodas.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cow", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 261097, "caption": " The image features a tennis court with a lot of chairs arranged around the court. Several people are present in the scene, including a tennis player with a blue shirt and white shorts who is walking across the court. He appears to be holding a tennis racquet and may be either the referee or one of the players. The other people in the scene are spectators or teammates, some sitting near the chairs and others standing around the court. Overall, it looks like an event involving a sport, possibly a tennis tournament, where people are watching or participating.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "tennis racket", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 227204, "caption": " In the image, four women are sitting around a table, enjoying a meal together. They are laughing and smiling as they chat and enjoy the food they are sharing. The table is filled with various cups, bowls, and utensils, which suggests that they are indulging in a variety of breakfast items like bananas, milk, and cereal. The breakfast items can be seen scattered around the table, including multiple bananas, a bowl with an unknown item in it, and a cup of milk.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "bowl", "spoon", "bottle", "cell phone", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5555555555555556}}, {"image_id": 353981, "caption": "The image is of an old airplane parked on the runway. There are three people standing near the plane. It appears to be a black and white photograph. The people are not wearing uniforms or any other distinctive clothing. The airplane is parked on the right side of the image. The people are standing on the left and right sides of the plane.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 185327, "caption": "The image depicts a group of elephants walking down a street. There are five elephants walking in a line, with a vehicle in the background and another elephant in the far right of the scene. The elephants appear to be walking toward a bus or an adjacent building. There are also people in the scene, with three individuals visible. Two are walking on the sidewalk to the left of the elephants, while the other person is closer to the parked car and walking on the street near the animals.", "mscoco_hallucinated_words": [["elephant", "elephant"], ["elephant", "elephant"], ["elephant", "elephant"], ["elephant", "elephant"], ["elephant", "elephant"]], "mscoco_gt_words": ["horse", "bus", "cow", "car", "person"], "hallucination_idxs": [6, 15, 29, 39, 76], "metrics": {"CHAIRs": 1, "CHAIRi": 0.625, "Recall": 0.4}}, {"image_id": 348654, "caption": "A white cell phone is sitting on a desk in front of a computer. The phone has various buttons and a speaker, and it appears to be connected to a keyboard. The keyboard has its own screen and has a mouse nearby, placed to the right of the phone. The phone seems to be the focus of the scene, with the computer serving as a backdrop.", "mscoco_hallucinated_words": [["keyboard", "keyboard"], ["keyboard", "keyboard"], ["mouse", "mouse"]], "mscoco_gt_words": ["cell phone", "laptop", "tv", "dining table"], "hallucination_idxs": [32, 35, 43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3, "Recall": 0.75}}, {"image_id": 482432, "caption": "According to the picture, there is a black and white image showcasing a toothbrush and toothpaste sitting next to a sink. The toothbrush is positioned closer to the center, while the toothpaste is to the left. The sink is located near the top of the image, and a cell phone can be seen on the left side, just above the toothpaste.", "mscoco_hallucinated_words": [["cell phone", "cell phone"]], "mscoco_gt_words": ["toothbrush", "sink", "bowl"], "hallucination_idxs": [53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 454161, "caption": "The image features a delicious pizza that is being enjoyed by a couple at a table. The pizza is cut into slices, placed on a white plate in the center of the table. Near the pizza is a fork, which may be used for serving or eating. Additionally, the table also has a knife positioned close to the plate of pizza. The atmosphere is casual and relaxed, with the couple enjoying their food and wine.", "mscoco_hallucinated_words": [["knife", "knife"]], "mscoco_gt_words": ["wine glass", "pizza", "fork", "dining table"], "hallucination_idxs": [59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.75}}, {"image_id": 144723, "caption": "According to the picture, a small dog is flying through the air while catching a frisbee. The dog appears to be leaping from the ground into the air, excitedly catching the flying disc. The action takes place in a large, open area, allowing for the dog's movements to be showcased. There are also a few other dogs in the scene, although they are not quite as prominent as the main dog.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "frisbee", "car", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 129707, "caption": "The image shows a person holding a cell phone in their hand, displaying a large screen. The phone appears to be black, with buttons on the front. The cell phone is being used, and the person may be speaking or receiving a call. There are several chairs visible in the background, indicating that multiple people may be present in the space.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["book", "cell phone", "person"], "hallucination_idxs": [52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6666666666666666}}, {"image_id": 379842, "caption": "The image depicts a couple of video games laid out on a table. There is a small white Nintendo Wii game controller, along with a book, which could be a guide or manual for the gaming system. There is a remote next to the controller, possibly belonging to another device.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "remote", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 203110, "caption": "The image captures a large, orange-colored trolley traveling down a city street, passing by buildings on both sides. There are several people visible in the scene, some walking on the street and others standing nearby. There are also a couple of traffic lights in the picture, indicating that this street may be a main thoroughfare.", "mscoco_hallucinated_words": [["traffic light", "traffic light"]], "mscoco_gt_words": ["truck", "bus", "train", "orange", "car", "person"], "hallucination_idxs": [46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 556478, "caption": " According to the image, two pizzas are being cooked on a large oven surface, with one being more finished than the other. The finished pizza is placed in a pan, and the other pizza is still on the tray, indicating that the second pizza is being cooked further.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "oven", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 103815, "caption": "The image captures a moment of action in a baseball game, with a baseball player on the mound, pitching. There is a baseball player wielding a baseball bat, likely trying to hit the pitched ball during the game. In addition to the main players, there are other people visible in the scene, either waiting for their turn to play or cheering from the sidelines.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 451943, "caption": " The image features a bed or beds in a room. A white and grey pillow sits atop the mattress. A cup and a saucer are placed nearby on a table or surface.", "mscoco_hallucinated_words": [["cup", "cup"]], "mscoco_gt_words": ["backpack", "suitcase", "bed", "dining table"], "hallucination_idxs": [22], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 268854, "caption": "According to the picture, the scene is of a group of people standing on the beach near water, enjoying a sunny day. There are multiple people, possibly a group or a family, spread out in various positions along the beach. They appear to be moving around the area, possibly interacting with each other. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "person", "boat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 34464, "caption": "In the image, a group of people, including children and adults, are standing on the beach enjoying themselves. One person is holding onto a red inflatable kayak, while a surfboard can be seen floating near the water. The group appears to be having a good time, interacting with each other and playing in the ocean. They are likely spending a fun day at the beach, enjoying the outdoor activities that the coastal area has to offer.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 495513, "caption": " The image shows a group of people gathered outside at night, enjoying themselves at a party or event. There are two main individuals, a man and a woman, standing close together and holding hot dogs over an open fire. They are dressed in warm clothing, with the woman wearing a blue coat and the man wearing a jacket.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "hot dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 529256, "caption": "The image depicts a dog sleeping on one side of the bed, while a cat is sleeping on the other side, creating a cozy atmosphere in the room. The bed is adorned with a red blanket, adding a warm and inviting ambiance.", "mscoco_hallucinated_words": [["cat", "cat"]], "mscoco_gt_words": ["book", "bed", "dog"], "hallucination_idxs": [15], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 37907, "caption": " The scene features a cat sitting on a flat-screen television in a room. The cat is looking towards the television, which is on. To the left of the cat, there is a small table that may be holding various electronics, such as a television remote. A few books are placed in different parts of the room, including one in the upper right corner and another closer to the cat. There is also a plant visible in the scene, located to the right of the cat and above the table.", "mscoco_hallucinated_words": [["table", "dining table"], ["remote", "remote"], ["book", "book"], ["table", "dining table"]], "mscoco_gt_words": ["cat", "potted plant", "tv"], "hallucination_idxs": [37, 49, 53, 97], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 506401, "caption": "The scene takes place on a street, with an intersection at the end of the road. There is a large, white statue of a wolf near the intersection. Also present is a traffic light at the intersection. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "traffic light", "cat", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 406611, "caption": "According to the picture, there is a large group of people gathered around a ski slope. Many of them are watching skiers and snowboarders as they navigate down the mountain. The people are standing near the snowy slope, chatting and enjoying themselves. Some are wearing helmets, further highlighting the exciting experience for those who enjoy winter sports. The scene depicts a lively and energetic atmosphere, with people interacting and participating in various activities.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 267225, "caption": " According to the picture, a white truck, or possibly an RV, is parked on a street next to other vehicles. There is an air conditioning unit on the truck, and three satellite dishes are located on top of the truck and behind it. Several people are standing around the truck, likely waiting for work to be done on the RV, or watching the workers as they go about their tasks. They appear to be positioned at various distances from the truck.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "remote", "tv", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 278365, "caption": "According to the picture, a man and a woman are sitting on a patio, enjoying a meal together. They are seated around a table filled with various food items, including slices of pizza and a couple of beverages, likely drinks. The woman appears to be eating a slice of pizza, while the man is holding a piece of food. The setting seems relaxed, with a comfortable atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "dining table", "cell phone", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 69757, "caption": "According to the picture, the scene features a group of people gathered around a table celebrating a special occasion. Three individuals can be seen, including a woman and a man, as well as a child sitting at a chair nearby. A delicious cake has been placed on the table, decorated with colorful candies. The woman appears to be cutting the cake, as she is holding a knife in her hand.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "cake", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7142857142857143}}, {"image_id": 432213, "caption": "The image features a plate of food, which appears to consist of a sandwich and French fries. The sandwich is likely a cheeseburger, as it appears with a burger-like shape and features a couple of meat items. The French fries appear as a side dish or part of the burger.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "bowl", "sandwich", "wine glass", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 368367, "caption": "According to the picture, there is a young woman sitting at a table in a restaurant, smiling as she talks on a cell phone. The phone is positioned on the right side of her face, and she appears to be talking to someone while enjoying her meal. In the background, there are a couple of other tables that are closer to the camera, one on the left side and the other on the right side. There are also a couple of chairs placed nearby, one closer to the left and another one towards the right side.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"], ["chair", "chair"]], "mscoco_gt_words": ["cell phone", "cup", "person"], "hallucination_idxs": [13, 62, 89], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.6666666666666666}}, {"image_id": 201555, "caption": "The scene features a stretch of sand, possibly dirt or mud, piled along the side of a street. The sand appears to be covering a significant portion of the roadway, making it difficult to see the actual road surface. Several traffic lights can be observed scattered around the scene, as well as street signs, one of which can be seen between the traffic lights and the sand pile.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 92131, "caption": "In the image, there are several people standing near a large green bus parked on a street. A woman is among the group, and a man can also be seen in the scene. The group of people appears to be waiting for the green bus to start or simply observing it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "handbag"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 21779, "caption": " The image shows a young woman, laying on the floor, and holding a red backpack. She is smiling and appears to be happy, perhaps after unpacking or taking a break from traveling.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "suitcase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 565148, "caption": "A baseball game is being played in the image. In the foreground, a baseball player is standing at home plate, holding a baseball bat in his hands and ready to hit the ball. A baseball glove can be seen nearby, placed on the field.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 160183, "caption": " According to the picture, a fire truck is parked in front of a building with a large fire hose extended from the back of the truck. The truck is surrounded by a group of people, and there are a few individuals standing near the fire truck.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 555456, "caption": " According to the image, it is snowing at a ski resort with a group of people enjoying the slopes. There are at least six people on the slope, some of them are skiing and some are just standing. Two snowboards can be seen on the slopes, which may indicate that some of the skiers have decided to use snowboards instead. The scene appears to be set against a forest, with some trees visible in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 235595, "caption": "According to the picture, there is a group of motorcycles parked along a busy street. The motorcycles are lined up in a row, with some of them having sidecars attached. The scene captures the busy street and the parked motorcycles.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "motorcycle", "potted plant", "car", "parking meter", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 489909, "caption": "According to the picture, there is a variety of different fruits and vegetables placed together in a basket. The basket includes oranges, bananas, and a couple of medium-sized potatoes, all laid out in a visually appealing and inviting arrangement. The oranges are placed towards the left side of the image, while the bananas and potatoes occupy different parts of the basket. Overall, the image captures the essence of a colorful and bountiful harvest.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "orange", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 501080, "caption": " The image portrays a large vase or a flower pot filled with a variety of pink and purple flowers, creating a beautiful arrangement. The flowers are spread out evenly across the vase, with some larger flowers standing out more prominently. The vase is placed on the surface, most likely a table or a countertop. It appears to be the main focus of the image, capturing a festive, colorful atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "potted plant", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 405183, "caption": "According to the picture, the scene depicts a young girl standing in front of a group of people, holding a remote, possibly a Wii remote. The group of people, which consists of both men and women, is gathered around her, watching her. It seems that she is about to demonstrate something, such as playing a video game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "handbag", "tv", "potted plant", "book", "chair", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 409138, "caption": "In the scene, there is a white and red bus parked on the street, surrounded by variously sized trees growing along both sides of the road. A number of cars are parked nearby, with some closer to the sidewalk and others further back along the street.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 411840, "caption": "According to the picture, there are two women walking on a grass tennis court. One of the women is wearing a white tennis outfit, while the other is wearing a blue tennis skirt. Both women are holding tennis rackets in their hands, one of them is close to the right edge of the court and the other closer to the middle. They appear to be walking along the court in a comfortable pace, possibly walking back and forth during a game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "chair", "backpack", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 333712, "caption": " The image depicts a close-up view of a pizza slice on a plate. The slice is covered with cheese, greens, and bacon. There is another piece of pizza visible in the upper left corner of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "spoon", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 148977, "caption": "In the image, there is a teddy bear wearing headphones, placed on top of a table or surface. The teddy bear is surrounded by various items, including books, a few located near the teddy bear itself and the rest scattered across the room. There are also multiple bottles, likely containing drinks, scattered in different locations in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "teddy bear", "bear", "dining table", "bottle", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 399058, "caption": "The scene takes place on a road with an elephant riding on the street, with a person on its back. The elephant is quite large, and the person is sitting on its back, facing towards the right side of the image. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "elephant", "orange", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 183571, "caption": "According to the picture, a man is sitting on a boat and playing with a dog, which is sitting on the floor of the boat. The man and dog appear to be interacting, possibly playing games or having fun together. There is another person on the boat, but they are not part of the activity with the dog.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "boat", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 325836, "caption": "The scene features a large, beautifully-designed motorcycle parked on a red carpet. The motorcycle is prominently displayed, taking up a significant portion of the display area. There are several people in the background, likely viewing the exhibit and discussing their interest in the motorcycle. Some are closer to the motorcycle, while others are located further away, making the atmosphere feel lively and interactive. The arrangement gives a sense of the event being a public showcase or a trade show, with a mix of people and fascinating vehicles.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 437910, "caption": "According to the picture, the scene depicts a bathroom situated within a larger room. The bathroom is small and has a toilet and sink, along with a door on both sides. The sink is located closer to the toilet, and both the toilet and sink are within the larger room. The room appears cluttered, making it appear messy and disorganized.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "toilet", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 155885, "caption": " The image features a picturesque scene of a village situated on the water, with several boats docked along the shore. The village is surrounded by houses and a few birds can be seen flying above, creating a lively atmosphere. There are a couple of seagulls perched on a roof, providing a further sense of life in the scene. Overall, the image captures the essence of a coastal village, with boats docked and seagulls flying overhead.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 467887, "caption": " In the image, a man wearing a red shirt sits at a desk using a laptop, which is displayed in front of him on the table. There is another laptop in the background, situated on a lower part of the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "mouse", "chair", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 174402, "caption": "The image features a man wearing a blue and white tennis shirt and holding a tennis racket, standing on a tennis court. He is posing for the camera, looking focused on his upcoming game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 352734, "caption": " The image depicts a young boy sitting at a dining table, eating ice cream or other sweet treats. The child is enjoying his meal with a fork in hand, biting into a piece of chocolate or another dessert. There are several people in the scene, including other children, a man, and a woman. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "spoon", "cake", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 346716, "caption": "According to the image, there are two red double-decker buses driving down a city street in the city. The buses seem to be running parallel to each other as they travel along the road.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 451144, "caption": "According to the picture, there are two men standing on a snowy slope, looking at the view. One of the men is wearing a backpack and the other is wearing a pair of red pants. They are both holding ski poles, possibly while skiing or preparing to ski down the slope. A person with a camera is also present in the image, likely capturing the moment.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 295656, "caption": "According to the picture, there is a group of people standing together, dressed in formal attire. They appear to be enjoying a meal, as one man is holding a plate of food. One woman can be seen holding a slice of pizza, while another is sharing a piece of cake with someone else. Another slice of cake is present, and there are two other men in the group, adding to the social atmosphere.", "mscoco_hallucinated_words": [["cake", "cake"], ["cake", "cake"]], "mscoco_gt_words": ["pizza", "clock", "tv", "tie", "car", "chair", "person"], "hallucination_idxs": [55, 63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.2857142857142857}}, {"image_id": 129886, "caption": "The image shows a group of people skiing down a snowy slope, surrounded by snow-covered mountains. There are at least nine people in the group, with some standing at the base of the hill, while others are skiing down the slope. Some of the people are wearing skis and are skiing down the hill, while others are dressed in warm clothing and seem to be enjoying the snowy scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 557998, "caption": "According to the picture, a young woman is sitting at a wooden table, enjoying a bottle of wine and a bottle of juice. She appears to be drinking and possibly writing a letter or journal entry. The table is located in a green, lush environment, such as a park or a garden, which adds to the natural atmosphere of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bottle", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 401010, "caption": " The image features a room with a large bed positioned in front of a window. The bed is neatly placed and appears to have white sheets. There is a desk or counter close to the bed, possibly for a student or office setting. The room also has a television set, and there is a chair on the other side of the bed. Additionally, a mirror can be seen hanging on the wall.", "mscoco_hallucinated_words": [["desk", "dining table"], ["student", "person"]], "mscoco_gt_words": ["sink", "chair", "bed", "tv"], "hallucination_idxs": [31, 42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.75}}, {"image_id": 567562, "caption": "According to the picture, there is a family of three, a woman and two young girls, sitting at a table and enjoying a meal together. They are surrounded by various pizzas, salad, and drinks. There are two bottles of soda on the table. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "dining table", "bowl", "spoon", "bottle", "fork", "broccoli", "book", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.36363636363636365}}, {"image_id": 313928, "caption": "The image depicts a balcony overlooking the ocean with a beach umbrella and surfboard, creating an enjoyable atmosphere on the balcony. There is a small dining table and a chair set out, offering a comfortable seating arrangement to view the scenery. In the foreground, there are two chairs positioned next to each other and facing towards the ocean view, providing a pleasant seating area for people to enjoy their time on the balcony.", "mscoco_hallucinated_words": [["umbrella", "umbrella"], ["person", "person"]], "mscoco_gt_words": ["surfboard", "dining table", "bottle", "wine glass", "chair"], "hallucination_idxs": [11, 71], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6}}, {"image_id": 490284, "caption": " A black dog is sitting on a couch, leaning on a person's lap and resting its head on the person's arm. The dog is positioned towards the left side of the image, while the person is on the right. A few more people are also visible in the scene, sitting in chairs or couches at different angles. The image has a warm, cozy feel, and the presence of others suggests a shared living space or a social gathering.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["couch", "person", "dog"], "hallucination_idxs": [58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 1.0}}, {"image_id": 346940, "caption": "In this image, there is a hot water bottle steaming on the counter of a kitchen. The bottle is located in the middle of the counter, surrounded by various kitchenware. There are two kettles in the scene, one near the hot water bottle and the other further to the right, possibly containing water for boiling. Additionally, there are several bottles spread throughout the kitchen area, adding to the overall clutter and atmosphere of the image.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["bottle", "bottle"], ["bottle", "bottle"], ["bottle", "bottle"]], "mscoco_gt_words": ["bowl", "knife", "oven"], "hallucination_idxs": [9, 19, 47, 67], "metrics": {"CHAIRs": 1, "CHAIRi": 1.0, "Recall": 0.0}}, {"image_id": 222782, "caption": "The image is of a black and white scene featuring a sign that says \"AMOCO\" in the middle of a road. There is a person in the background walking towards the left side of the image, and there are several people standing near the sign. The street appears to be empty, and there are no cars or other vehicles in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "cell phone", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 255934, "caption": " According to the picture, there are two elephants in the scene, one larger and one smaller. The larger elephant is walking alongside the smaller ones, possibly leading the way. The elephants are in a dirt-covered area, possibly a zoo or a preserve. There is a handbag in the scene, which may be the handbag of the person in the picture or just an object belonging to someone else. The image captures the close relationship between the large and small elephants.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "handbag", "elephant", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 278627, "caption": "According to the image, there are multiple photos showing different dining experiences. In one photo, a man and a woman are sitting at a table, surrounded by various food items. There are bowls of soup, various fruit dishes, a plate of seafood, and a large salad with vegetables. Another photo features a man holding food, possibly chicken or fish. A third image displays a group of people, including a man and a woman, eating various dishes. ", "mscoco_hallucinated_words": [["bowl", "bowl"]], "mscoco_gt_words": ["cup", "dining table", "spoon", "potted plant", "bottle", "fork", "chair", "person"], "hallucination_idxs": [37], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.25}}, {"image_id": 400851, "caption": "In the image, there is a large group of people enjoying a sunny day at the beach. Some are lounging on the sand, while others are relaxing on a bench. There is also a table set up near the beach, where people are standing around or sitting on chairs. The group is spread out, with individuals scattered across the beach and the area around the table.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"]], "mscoco_gt_words": ["bench", "umbrella", "chair", "backpack", "boat", "sports ball", "person"], "hallucination_idxs": [38, 72], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.42857142857142855}}, {"image_id": 472610, "caption": "A woman and her dog are standing near a field surrounded by cows. The woman appears to be smiling, and the dog is wearing a leash. There are at least eight cows scattered around the field, with a few of them located in the background. The woman and the dog seem to be enjoying the scene together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cow", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 78565, "caption": " According to the image, a group of people is gathered around the beach, enjoying a day at the water. There are multiple boats in the water, including a small boat with four men on it, and a larger boat with a sail, likely a windsurfing sailboat, with a dog on it. The scene captures a moment at the beach with the people spending time there and exploring the water.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "boat", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 388237, "caption": "The image depicts a group of young people participating in skateboarding activities in the street, with several skateboards present. There is a man in the center of the scene, riding his skateboard while others are skating around him. Some of the skateboards are located closer to the bottom of the image, while others are placed in different locations.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "chair", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 289640, "caption": "According to the picture, a group of people is gathered at a dining table with a frisbee on top of it. They appear to be enjoying a drink at a party, laughing and creating a festive atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "frisbee", "orange", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 122602, "caption": "According to the picture, there is a person working on a craft project on a table. He is using a variety of tools such as scissors, a bottle, and a cutting mat. The table is covered in a collection of various items, including scissors and a cutting mat. In addition, there is a few books and a few bottles around the table, indicating a well-stocked craft space.", "mscoco_hallucinated_words": [["person", "person"], ["book", "book"]], "mscoco_gt_words": ["dining table", "scissors", "bottle"], "hallucination_idxs": [8, 62], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 1.0}}, {"image_id": 24104, "caption": "According to the picture, a person is sitting at a desk with a computer set up in front of them. The desk is equipped with three monitors displaying content: two computer monitors and one TV monitor. There are two cups on the desk, likely for coffee or other beverages, and a book in front of them.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["keyboard", "cup", "tv", "dining table", "laptop", "mouse", "book", "bed"], "hallucination_idxs": [6], "metrics": {"CHAIRs": 1, "CHAIRi": 0.08333333333333333, "Recall": 0.625}}, {"image_id": 183709, "caption": "According to the picture, there is a group of people gathered at an intersection. The people, estimated to be around 15, are standing in the middle of a crosswalk, looking at a man who is standing near a traffic light. There are cars parked and driving on the streets around the intersection. Some of the people are standing on the sidewalk, and others are standing on the street. The scene is captured during a daytime moment in a city, likely in a downtown area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "handbag", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 59710, "caption": "The image features a black refrigerator with its doors open, showcasing the interior. The refrigerator has several bottles and cups inside, providing a view of a kitchen setup. Additionally, there are a few more bottles visible in the scene, further enhancing the refrigerator's contents. The refrigerator and bottles are set in a bedroom, which can be seen in the background of the scene.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cup", "cup"], ["bottle", "bottle"], ["bottle", "bottle"]], "mscoco_gt_words": ["chair", "refrigerator", "dining table"], "hallucination_idxs": [19, 21, 39, 55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 279407, "caption": "According to the picture, a young boy, possibly a baseball player, is in the middle of a field wearing a green and yellow baseball uniform. He is holding a baseball in his hand and appears to be smiling or laughing. There are multiple other people in the scene, but their exact roles or connections to the main subject are not evident.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "bottle", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 314729, "caption": "According to the picture, it is a vintage baseball image featuring a man in a baseball uniform. The man is positioned in a batting stance, holding a baseball bat in both hands. The image appears on a white background, and the man is prominently displayed. The image is a black and white photo, capturing a historic baseball scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 527497, "caption": "According to the picture, there is a desktop sitting on a desk with a keyboard and mouse sitting next to it. The computer setup is complete with two monitors, one on the left side and another on the right side of the desk. A cell phone is also present, placed near the top of the desk, to the right of the left monitor. A telephone handset can be seen, too, situated on the right side of the desk, close to the computer. In the background, there are some bottles, including one on the left side and another on the right side, as well as a couple of chairs, placed close to the desk.", "mscoco_hallucinated_words": [["cell phone", "cell phone"], ["telephone", "cell phone"], ["bottle", "bottle"], ["chair", "chair"]], "mscoco_gt_words": ["keyboard", "tv", "dining table", "laptop", "mouse"], "hallucination_idxs": [48, 71, 100, 121], "metrics": {"CHAIRs": 1, "CHAIRi": 0.26666666666666666, "Recall": 1.0}}, {"image_id": 411564, "caption": "According to the picture, a young man is sitting on a chair near a window, looking down at his phone. The window is open, allowing natural light to enter the room. The chair sits near a dining table, which contains various items such as a cup, a bottle, and a bowl. The man is positioned in front of a refrigerator, with a few additional bottles placed around the table. The scene features a mix of kitchen and living room elements, giving it a casual, at-home ambiance.", "mscoco_hallucinated_words": [["cup", "cup"], ["bowl", "bowl"]], "mscoco_gt_words": ["refrigerator", "dining table", "bottle", "cell phone", "chair", "person"], "hallucination_idxs": [51, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 320350, "caption": "According to the picture, there are two people standing on snowy ground with two dogs, one closer to the left and another on the right. The man is wearing skis, and the woman is wearing skis as well. The dogs are wearing skis, too. It appears that they are enjoying a snowy day and outdoor winter activities.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "skis", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 401857, "caption": "The image captures a baseball game with several players on the field, all wearing baseball uniforms. The batter, wearing a number 10 jersey, is up to bat, holding a baseball bat. Standing near the batter are a catcher, wearing a number 33 jersey, and an umpire. Other players are spread throughout the field, including two on the left side of the image, one on the right side, and another on the far left side of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "bench", "baseball bat", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 285302, "caption": " According to the picture, the image features a young boy sitting at a dining table, surrounded by many pieces of paper and various types of scissors. He appears to be engaged in making a paper craft, possibly a card or a sculpture. The dining table is a significant aspect of the scene, along with various chairs positioned around it, some closer to the boy and others further away. The environment suggests a living room or a children's playroom setting, with a couch in the background.", "mscoco_hallucinated_words": [["couch", "couch"]], "mscoco_gt_words": ["person", "chair", "scissors", "dining table"], "hallucination_idxs": [91], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 1.0}}, {"image_id": 61959, "caption": "The image captures a large city bus driving down the street past various businesses at an intersection. The bus is surrounded by other vehicles on the road, and there is a stop sign on the corner. The intersection appears to be a busy part of the city, as there are several cars and a person crossing the street nearby. The image depicts a typical day in the city with the bus, cars, and pedestrians all moving about.", "mscoco_hallucinated_words": [["stop sign", "stop sign"], ["person", "person"]], "mscoco_gt_words": ["traffic light", "bus", "truck", "car"], "hallucination_idxs": [33, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 229659, "caption": " Two people are sitting in a cafe, one at a table with a laptop in front of him, looking at the screen, while the other people gather around him in a restaurant. The laptop screen shows an image, likely a website or a social media platform, as the individual focuses on his device. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "tv", "dining table", "laptop", "bottle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 473118, "caption": "According to the picture, there's a man riding a skateboard on a ramp, performing an aerial trick by jumping off the ramp and flying through the air. The skateboarder appears to be in his mid-twenties and is wearing a blue shirt. There are other people around, either enjoying the ramp or waiting their turn.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 391222, "caption": "According to the image, two people are standing outdoors, wearing skis and standing in a snow-covered area. They appear to be discussing something together, possibly at a signpost, possibly navigating their path through the snowy terrain. The signpost is positioned near the center of the image, attracting their attention. The people seem to be focusing on the information provided on the signpost or using it to help guide their snow-skiing activity.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 416104, "caption": "According to the picture, there is a group of people gathered around various blue dining tables outside, creating a relaxed atmosphere. The people are sitting, standing, and walking around the area, enjoying their time together under large outdoor umbrellas. Some of the people are talking, while others seem focused on their drinks.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "dining table", "bottle", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 511852, "caption": "According to the image, there are two people skiing down a snowy slope, enjoying their time together. One person is wearing a blue jacket and the other is wearing a red jacket. The two skiers appear to be a little distant from each other. There are two sets of skis visible in the image, one pair belonging to the person wearing the blue jacket and another pair belonging to the person wearing the red jacket. They are both equipped for their outing, and it seems that they are having a good time.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 59386, "caption": "In the image, a group of giraffes is gathered together in a grassy area, likely a park or zoo exhibit. They seem to be eating leaves off a tree or from the surrounding vegetation, providing them with nourishment. There are at least six giraffes in the group, some close together and some more spread out, with two giraffes standing near a post.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["zebra", "giraffe", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 540180, "caption": "According to the picture, a street is empty with only a few cars, and a green traffic light is positioned in the middle of the road, near a city streetlight. The light is mounted on a post or a streetlight pole, signaling a green traffic light is present.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 153692, "caption": "According to the picture, a man and his dog are crossing a street at an intersection, which is heavily populated with cars. The man is riding a bicycle while the dog is walking alongside him, crossing the street at the same time.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "traffic light", "dog", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 265628, "caption": "According to the picture, a man is sitting on a toilet located in a small room with a round window. There is another toilet in the room, close to the man, suggesting it might be shared. The man is wearing two pairs of shoes, one closer to the camera and the other slightly lower in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "person", "toilet"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 499594, "caption": "According to the picture, a plate of food has been prepared, containing a piece of grilled steak with broccoli. The steak is nicely cooked and accompanied by the green vegetables, which add a nice contrast to the dish. This meat meal is being served on a white plate, likely with a salad or side dishes. The plate of food is positioned on a table, possibly a dining table, where it is ready to be enjoyed by someone.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "bottle", "fork", "broccoli"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 5934, "caption": "According to the picture, there is a baseball glove, baseball, and baseball bat sitting on a bench. The baseball glove is holding the baseball, creating an ambiance of a baseball game. The baseball and glove are placed close together, emphasizing their connection and essence of a baseball game.", "mscoco_hallucinated_words": [["baseball bat", "baseball bat"]], "mscoco_gt_words": ["bench", "sports ball", "baseball glove"], "hallucination_idxs": [14], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 47828, "caption": "According to the image, it is a dark night with blue lights shining on a body of water. A group of people are walking along the river's edge, some of them on the shore side while others are out on the water, likely on a boat. The river or canal appears to be quite long and may be a popular attraction in the city. The blue lights give a unique and lively atmosphere to the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "boat", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 47916, "caption": " In the image, several dogs are running together in a grassy field. There are two large dogs, one closer to the left and the other closer to the center, with a small dog, one on the left and another on the right, playing together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bench", "dog", "frisbee", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 255483, "caption": "The image is a black and white photo of a man sitting at a desk, possibly working on a piece of writing. There are several pieces of paper spread out in front of him. One of them has an image of a person on it. There are pens, pencils, and other writing tools near the man. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "person", "scissors", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 131138, "caption": "According to the picture, a busy office setting is shown with a computer placed on a desk. The computer has a screen, a keyboard, and mouse pad, all situated on the left side of the desk. On the right side, there is a laptop open with a keyboard and mouse, as well as a few additional items placed throughout the scene. There is a cell phone placed at the top of the desk, and another cell phone placed near the bottom of the desk. Additionally, a remote control can be found in the lower left corner of the desk.", "mscoco_hallucinated_words": [["cell phone", "cell phone"], ["cell phone", "cell phone"], ["remote", "remote"]], "mscoco_gt_words": ["keyboard", "cup", "tv", "dining table", "spoon", "laptop", "potted plant", "mouse", "person"], "hallucination_idxs": [73, 85, 98], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.4444444444444444}}, {"image_id": 258209, "caption": "According to the picture, a man is sitting on a bench, leaning against a building, and looking down at his phone. He is in a city setting, possibly a street in the background. The man appears to be focused on his phone, enjoying a moment of leisure.", "mscoco_hallucinated_words": [["phone", "cell phone"], ["phone", "cell phone"]], "mscoco_gt_words": ["bicycle", "book", "bench", "person"], "hallucination_idxs": [23, 47], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.5}}, {"image_id": 347453, "caption": "According to the picture, the city scene is bustling with multiple buses and cars. The street is lined with buses going in opposite directions, creating a busy cityscape. There are also several cars driving along the street, adding to the traffic on the city road. Among the vehicles, there is a traffic light in the middle of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "bench", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 285820, "caption": "According to the picture, there is a large living room filled with furniture, including a couch located on one side of the room. There are three cats present in the image, sitting in different parts of the space. A large TV is situated in the center of the room, dominating the view. The living room features large windows that allow natural light to enter and illuminate the space. The entire room is well-lit and clean, with several items placed around the area.", "mscoco_hallucinated_words": [["cat", "cat"]], "mscoco_gt_words": ["bench", "tv", "dining table", "bowl", "cell phone", "book", "couch"], "hallucination_idxs": [29], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.2857142857142857}}, {"image_id": 245664, "caption": "According to the picture, an individual is seated in front of a wooden table that has a laptop, a computer monitor, and other computer-related items placed on it. There is a keyboard and a mouse on the table, suggesting that the individual is using the laptop and the computer for work or other purposes. Additionally, there is a cup and a wine bottle visible on the table. The presence of these items and the man sitting in front of them suggest a working environment or an area for personal or professional tasks.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["man", "person"]], "mscoco_gt_words": ["keyboard", "cup", "tv", "dining table", "laptop", "mouse", "book", "wine glass"], "hallucination_idxs": [69, 82], "metrics": {"CHAIRs": 1, "CHAIRi": 0.15384615384615385, "Recall": 0.75}}, {"image_id": 453731, "caption": "According to the picture, a dog is laying on a wooden floor, surrounded by various items like a barrel, a bottle, and a wine glass. The dog is likely enjoying the floor the dog is laying on, with the wine glass and possibly even sleeping. There is also another bottle and an umbrella nearby.", "mscoco_hallucinated_words": [["wine glass", "wine glass"], ["wine glass", "wine glass"], ["umbrella", "umbrella"]], "mscoco_gt_words": ["cup", "dog", "bottle"], "hallucination_idxs": [27, 45, 59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.6666666666666666}}, {"image_id": 524601, "caption": " According to the picture, there is a group of four people in a room playing a video game together. Three of the people are actively participating in the game, while one person is watching and laughing. The participants consist of two women and a man, while the observer appears to be the third woman.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "refrigerator", "book", "chair", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 253262, "caption": "The scene shows a group of people standing next to an airplane on the tarmac. There are six people in the image, with a man and a woman being the most prominent figures. They are spread across the scene and seem to be watching the airplane or performing some activity.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "suitcase", "tie", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 87052, "caption": "A young couple is standing in a kitchen, looking at each other while they work together at the counter. There is a microwave on the counter, which is near the refrigerator. A bottle of water is on the counter, close to the couple. The couple is situated in the right corner of the room, with the man wearing a gray shirt and the woman wearing a white shirt. They appear to be sharing a meal or prepping one together. The kitchen has white cabinets and a white sink in the center of the room.", "mscoco_hallucinated_words": [["microwave", "microwave"], ["sink", "sink"]], "mscoco_gt_words": ["cup", "oven", "refrigerator", "bowl", "bottle", "vase", "person"], "hallucination_idxs": [24, 96], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.42857142857142855}}, {"image_id": 357578, "caption": "According to the picture, the scene shows a fire truck parked on a city street, likely waiting to respond to a call. The fire truck is red and has a ladder on its side, indicating its role as a fire engine. There are multiple cars driving on the street around the fire truck, with one white car in the foreground, a few cars further back, and another white car visible on the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 418692, "caption": " A couple is sitting on a couch in a living room. The man is reading a book, while the woman is sleeping on the couch. The living room is filled with comfortable furniture, such as a couch, chairs, and potted plants. There are also two tables placed in the room - one near the couch and the other near the back wall.", "mscoco_hallucinated_words": [["potted plant", "potted plant"], ["table", "dining table"]], "mscoco_gt_words": ["remote", "cell phone", "book", "chair", "person", "couch"], "hallucination_idxs": [45, 52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 259345, "caption": "According to the picture, a man is wearing a colorful, rainbow-patterned shirt and standing under a pink umbrella. He is smiling at the camera while he holds the umbrella with one hand. The image is set outdoors, with a few cars parked in the background, adding context to the scene.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["umbrella", "chair", "person"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 119709, "caption": "According to the picture, the scene features a woman wearing an apron and working at a table dotted with numerous donuts. She seems to be preparing the donuts, possibly by spreading a mixture of flour and sugar onto them. The table is placed near the left edge of the image, and several donuts are spread around it. There is a second donut on the right side of the table, and another one placed near the top. A cup is situated towards the right side of the table.", "mscoco_hallucinated_words": [["cup", "cup"]], "mscoco_gt_words": ["chair", "donut", "person", "dining table"], "hallucination_idxs": [85], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 0.75}}, {"image_id": 443299, "caption": "According to the picture, a man and a woman are sitting on a bench in a public place. They are both wearing jackets, and they look calm and focused as they use their devices. They are holding their smartphones and appear to be engaged in activities using their devices.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "cell phone", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 120776, "caption": "According to the picture, a group of people is gathered on a large outdoor patio, likely sitting outside a building. They are situated under multiple umbrellas that line the patio, with some of the umbrellas in front of a fountain. The people appear to be engaged in conversation and spending time together. The scene features a total of nine people scattered around the patio, enjoying the outdoor ambiance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 411765, "caption": "The image features a young man playing tennis on a tennis court. He is in the middle of the court, holding a tennis racket and swinging it to hit the ball during his serve. Several other individuals are present in the scene, scattered around the court.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 447991, "caption": "A group of people is walking down a city street at night. They are in a hurry and seem to be heading to their destinations, which are located along the street. The street is lined with various shops on either side, making the area a bustling commercial district. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "handbag", "traffic light", "cell phone", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.14285714285714285}}, {"image_id": 276719, "caption": "According to the picture, there is a dog walking down a street with several classic car, creating a unique scene. There are people around the street as well, likely observing or interacting with the dog and the cars.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "handbag", "motorcycle", "dog", "car", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 422328, "caption": " The image depicts a horse standing in the snow wearing a harness. The horse is next to a snow-covered pile of hay and nearby, there are two vehicles parked, one on the left and the other on the right. The horse seems to be the center of attention, as it is pulling a sled along with the plowing of the snow.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["horse", "snowboard", "car", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 354559, "caption": "According to the picture, a young woman is standing in front of the background, holding a teddy bear. She is wearing a black and red costume, which adds a playful touch to her overall appearance. The woman is smiling, and hugging the teddy bear, creating a whimsical and joyful scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "bear", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 438788, "caption": "According to the picture, a group of people is gathered around a table with cake on display. The cake, cut into a couple of pieces, is on a plate placed on the table. The people in the image appear to be enjoying the cake, indicating that they are socializing and having a good time.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "knife", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 468005, "caption": " The image shows a large elephant walking down a pathway surrounded by a crowd of people. There are many people walking with the elephant, some closer to the front and others near the back. Many of them appear to be taking pictures, enjoying the sight of the elephant. The scene appears to be taking place in a park or on a beach, as there is water nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "person", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 545594, "caption": "In the image, there are several people on a large, public bus. The bus has multiple rows of seats, and some of the seats are occupied. Some people are standing, while others are sitting. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "chair", "orange", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 219546, "caption": " The image showcases a kitchen table filled with a variety of different foods, including meat and other assorted dishes. There are several bowls placed across the table, some with salad dressing, a couple with vegetables, and a few others containing meats.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "bowl", "spoon", "bottle", "fork", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 386645, "caption": "According to the picture, there is a large, old-fashioned building with an asphalt parking area in a city street. A blue truck and a white vehicle are parked outside the building. The building has a sign above the entrance that reads \"King's Chinese Restaurant\" in Chinese characters. In addition to the truck and car, there are several people on the street, with one person standing near the building, another walking further down the street, and a third one closer to the left side of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "motorcycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 155131, "caption": "According to the picture, a woman is playing tennis on a grass tennis court. She is wearing a tennis outfit and has leaned her hand back to hit a tennis ball. She is in the act of hitting the ball, with her racquet and her hand.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "tie", "tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 188752, "caption": "According to the picture, a man is standing in a grocery store surrounded by numerous bananas hanging from the ceiling. The man is looking at the bananas, which are hanging from various points, possibly considering purchasing one or several of them.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["apple", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 228350, "caption": "According to the picture, a group of people is seated at a restaurant, enjoying their meal. The people are sitting at various tables, with several chairs placed around the tables. There are multiple umbrellas set up around the restaurant, offering shade from the sun. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "umbrella", "dining table", "bowl", "tie", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 437720, "caption": "According to the image, a naked woman sits at a desk with a computer in front of her. There are several red chairs in the scene. In one of the chairs, there is a vase with a bouquet of flowers. On the desk, there is a painting, likely created by the person sitting in the chair. The image captures a moment of intimacy, with the naked woman typing on her computer.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "vase", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 378831, "caption": "According to the picture, a woman is walking down the street front of a yellow fire hydrant, carrying grocery bags. There are several other pedestrians also visible in the scene, with one person walking towards the woman, another individual farther away, and a couple approaching from the left side of the image. In the background, there are a couple of cars and a fence that might be located near the street.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["fire hydrant", "handbag", "person"], "hallucination_idxs": [68], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 375503, "caption": "The image depicts a grand staircase with ornate handrails leading up to a room with a large clock on the wall. The clock has pink and yellow decorative walls and carved details. There is a table near the clock in the room, which is positioned in the middle of the image. A person is standing near the table, and a chair can be seen to the right of the image. The room is likely a living area or reception area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "tv", "dining table", "laptop", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 330369, "caption": "The scene depicts a red fishing boat, docked on a large body of water. The boat is situated in the foreground, with a second boat visible in the background, docked to the right of the primary vessel. Both boats appear to be similar in size and shape.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 288576, "caption": "According to the picture, there is a man in a green field and a child nearby. They are standing next to a tree, and the man is tossing the frisbee over the child. The scene captures an interactive moment between the two individuals, showcasing their playful activity.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "frisbee", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 32190, "caption": "According to the picture, there is a brown and white colored toy horse on the sidewalk in front of a business. The toy horse is placed on the sidewalk, giving it a unique position in the street scene. There are several other small toy horses, possibly arranged by the business. A car is parked close by, while several other vehicles, including a police car, are visible in the distance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["horse", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 535403, "caption": "According to the picture, there is a small bathroom with a sink and a bathtub. In the room, there is also a mirror, giving the illusion of a bigger space. The bathroom has multiple grooming tools, including a hair brush, a hair dryer, and several hair brushes, displayed on the counter next to the sink. Additionally, a hair dryer is hanging on a rack nearby. The bathroom features a white sink, with a collection of cosmetic items and various grooming tools, adding to the aesthetics of the bathroom.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "potted plant", "bottle", "toilet", "sink", "hair drier"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 449798, "caption": "The image features a workspace with a desk containing various items, including a keyboard and a mouse, sitting on top of the table. A laptop is also placed on the desk, indicating the presence of a computer setup. A TV set is visible nearby, possibly serving as a screen for the computer.", "mscoco_hallucinated_words": [["tv", "tv"]], "mscoco_gt_words": ["keyboard", "dining table", "laptop", "mouse", "cell phone", "book"], "hallucination_idxs": [44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.6666666666666666}}, {"image_id": 375769, "caption": "According to the picture, there is a young child wearing a helmet sitting at a table, eating a sandwich, while looking at the bikes outside. The bikes are placed around the dining area, with some visible behind the table and others in the background. The child is seated in front of the dining table and appears to be enjoying their meal.", "mscoco_hallucinated_words": [["sandwich", "sandwich"]], "mscoco_gt_words": ["bicycle", "cup", "umbrella", "dining table", "hot dog", "car", "chair", "person"], "hallucination_idxs": [20], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.375}}, {"image_id": 426172, "caption": " The image showcases a table with a display stand full of cakes, cheese, and other food items. A variety of dishes, including some cake slices and different types of cheese, are placed on the table and are available for customers to browse and purchase. ", "mscoco_hallucinated_words": [["cake", "cake"], ["cake", "cake"]], "mscoco_gt_words": ["knife", "dining table", "bottle", "book", "chair"], "hallucination_idxs": [11, 27], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.2}}, {"image_id": 20650, "caption": "According to the picture, there is a sandwich on a plate accompanied by broccoli. The sandwich appears to be filled with a variety of ingredients, such as lettuce, tomato, and cucumber, making it a tasty and healthy meal. The broccoli is also visible, likely contributing to the nutritious value of the meal. A fork is also present on the table, suggesting it could be used for consuming the sandwich or other foods on the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "sandwich", "fork", "broccoli", "orange"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 138078, "caption": "The image portrays a young baseball player standing on the field, holding a baseball glove. He is wearing a red baseball uniform and glasses, and is looking down at a ball held by another player.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 152499, "caption": "According to the image, there is a group of people gathered around a swimming pool. They appear to be participating in a fun activity, possibly involving pool noodles. There are adults and children in the group, with adults holding various pool noodles. A bottle is placed near the left edge of the pool. The atmosphere seems to be friendly and engaging. ", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["surfboard", "umbrella", "dining table", "potted plant", "chair", "person"], "hallucination_idxs": [49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.16666666666666666}}, {"image_id": 180764, "caption": "According to the picture, a man is eating a doughnut while sitting in a room. There is another person in the room, but it's not clear what their relation is to the man eating the doughnut.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["microwave", "donut", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 147762, "caption": "According to the picture, there is a white plate on a table, topped with a piece of cake. The cake has an orangish color, covering part of the plate. Additionally, a small, spherical item is placed atop the cake, possibly a scoop of ice cream or a sweet treat. The dessert is decorated with some yellow and brown features, giving it a nice presentation.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "cake", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 113860, "caption": "According to the picture, a black and white boat is floating on the water. The boat is quite large and is positioned near the middle of the scene, traveling down a canal. It appears to be the main focus of the image, as it stands out among the various boats in the vicinity.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 460866, "caption": "The image features various oral hygiene items, including toothbrushes and toothpaste, arranged in a sink or a cup. There are eight toothbrushes inside the sink, with some placed together and others scattered throughout the cup. Additionally, there are two separate toothpaste bottles, one towards the left and the other towards the right side of the image. The arrangement suggests a collection of used toothbrushes and toothpaste within the sink.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["toothbrush", "cup", "dining table", "bowl", "sink"], "hallucination_idxs": [47], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.6}}, {"image_id": 177015, "caption": " According to the picture, a man is sitting on a couch with his laptop open, and a cat is sitting on his lap, both engaged in their respective activities. The man is working on his computer, and the cat seems to be enjoying the warmth and attention from the man. In the scene, the couch covers a significant part of the background, and other furniture can be seen in the room, such as a chair positioned further back on the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "laptop", "cat", "apple", "chair", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7142857142857143}}, {"image_id": 333998, "caption": "According to the picture, the scene features a small kitchen with oak cabinets and chairs. The kitchen is clean and organized, with several appliances visible. A refrigerator and a microwave can be found in the room, with a sink located in the middle.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["cup", "suitcase", "oven", "refrigerator", "bowl", "bottle", "sink", "microwave", "person"], "hallucination_idxs": [15], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 47131, "caption": "The image shows a group of people gathered around an old train wreck. There are at least 13 men in the picture, with some of them on the ground and others standing around the wreck. The train appears to be a large, metal structure, which has been destroyed in some places. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 425447, "caption": "According to the picture, there is a white pickup truck parked on the street, with water used to clean the area. The truck is parked next to a tree, and there are a few other people in the area, including two individuals walking towards the left of the scene, a man near the right side, and another person standing towards the center of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "fire hydrant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 295105, "caption": "According to the picture, a man is standing in a kitchen, looking at the refrigerator with a curious expression on his face. He appears to be checking the contents of the refrigerator, deciding what to eat. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["oven", "refrigerator", "bowl", "bottle", "apple", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 443259, "caption": "According to the picture, there is a spacious modern kitchen with white countertops, a range, a sink, a white refrigerator, and a freezer. The room is well-lit and clean, showcasing a well-organized space. Two apples can be seen, one on the left side and one on the right side of the kitchen, possibly arranged as a decoration or as part of a snack. The area has plenty of room, with the oven, sink, and refrigerator located near the center and along the sides.", "mscoco_hallucinated_words": [["apple", "apple"]], "mscoco_gt_words": ["oven", "refrigerator", "dining table", "bowl", "potted plant", "bottle", "orange", "sink", "chair"], "hallucination_idxs": [42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.3333333333333333}}, {"image_id": 537861, "caption": "According to the picture, a train is parked at a train station, possibly due to a delay or a stop. The train has a yellow and blue color scheme, and it is being pulled by an engine. The train is traveling on the tracks, and it appears to be on a bridge.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 145020, "caption": "According to the image, there is a life-sized donald duck statue placed in front of a large Christmas tree. The donald duck character is surrounded by a large group of people in the surrounding area. The image appears to be from a Disneyland park, possibly at Christmas time, with the festive decorations on the street and in the park.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "bird", "mouse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 249969, "caption": "The image features a delicious and elaborate meal served on a white plate. The meal features a roasted chicken along with a variety of vegetables, including broccoli and carrots. The vegetables are accompanied by numerous pieces of meat, likely beef. This well-rounded and attractive meal is accompanied by a drink, likely wine or a beverage.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "spoon", "fork", "broccoli", "wine glass", "carrot", "bird"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 146504, "caption": " In the image, a man is sitting at a desk and typing on a computer keyboard, using his mouse. The keyboard and mouse are placed on a dining table, and the man's hand can be seen positioned over the keyboard, possibly about to click a button or mouse click. In the background, there are two cups on the table, one closer to the keyboard and mouse and the other on the left side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "dining table", "laptop", "mouse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 40037, "caption": "According to the picture, there is a large, white clock tower situated in front of a building with a green roof. The clock tower has a tall and slender design, while the building has a similar, grey appearance. The clock tower is situated in front of the building with a green roof, which could be a restaurant, a hotel, or another public building.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 343218, "caption": "The image shows a young man playing tennis on a court, holding a tennis racket and attempting to hit a ball with it. There are two other individuals in the scene, standing on the left and right sides of the court. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 301837, "caption": "The image shows a spacious and modern kitchen, featuring white cabinets and countertops. The kitchen is equipped with several appliances such as a stainless steel refrigerator, an oven, a microwave, and a dishwasher. There are two large windows in the kitchen, allowing plenty of natural light to enter and making the space feel brighter. The long, narrow kitchen is adorned with various objects such as knives, spoons, and bowls. Among these, a cell phone is also visible in the picture. Additionally, there are a few bottles placed on the countertops, likely containing beverages or liquids.", "mscoco_hallucinated_words": [["microwave", "microwave"], ["knife", "knife"], ["spoon", "spoon"], ["bowl", "bowl"], ["cell phone", "cell phone"], ["bottle", "bottle"]], "mscoco_gt_words": ["clock", "oven", "refrigerator", "apple", "sink"], "hallucination_idxs": [33, 74, 76, 79, 85, 100], "metrics": {"CHAIRs": 1, "CHAIRi": 0.75, "Recall": 0.4}}, {"image_id": 309071, "caption": "According to the picture, a man is standing next to a parked bicycle, adjusting it. He is wearing a blue shirt and pants, and has a tie. The man is also wearing a hat, which might suggest that he is working on the bicycle. In addition, there is a bench located nearby, along with a cup resting on it.", "mscoco_hallucinated_words": [["tie", "tie"], ["cup", "cup"]], "mscoco_gt_words": ["bicycle", "bench", "potted plant", "backpack", "person"], "hallucination_idxs": [30, 64], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.6}}, {"image_id": 240681, "caption": "The image depicts a person standing next to a red motorcycle on the side of the road. This individual is holding a cell phone and appears to be taking a picture of the motorcycle or possibly taking a call. There are no other significant elements in the scene, just the person, motorcycle, and cell phone.", "mscoco_hallucinated_words": [["cell phone", "cell phone"], ["cell phone", "cell phone"]], "mscoco_gt_words": ["backpack", "motorcycle", "person", "car"], "hallucination_idxs": [23, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.5}}, {"image_id": 178753, "caption": " According to the picture, the scene shows a young girl sitting in a red chair, wearing braces and smiling while holding a cell phone. She appears to be in a seated position, possibly at an event or stadium. The image captures the girl in an active and cheerful moment while using her cell phone.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 278890, "caption": " According to the picture, the scene features two people, one man and one woman, standing near each other. Both individuals are looking down, likely at their phones, as they are holding their devices in their hands. The man appears to be wearing a blue shirt, while the woman wears a shirt that is not visible in the picture.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 47781, "caption": "According to the image, there is a modern bedroom with a red-colored bed sitting in the middle of the room. The bed is made of dark wood, adding a touch of elegance and sophistication to the room. The furniture consists of a desk with various books and a chair, both of which can be found on the sides of the room. There is a chair in the room that appears to be slightly closer to the desk, and a vase is placed on a table. The overall atmosphere of the room suggests a relaxing and refined setting.", "mscoco_hallucinated_words": [["desk", "dining table"], ["chair", "chair"], ["chair", "chair"], ["desk", "dining table"], ["table", "dining table"]], "mscoco_gt_words": ["tv", "vase", "book", "bed", "person"], "hallucination_idxs": [46, 52, 70, 82, 91], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5555555555555556, "Recall": 0.6}}, {"image_id": 542792, "caption": " A large baseball bat is positioned high up on a building, creating an attention-grabbing and iconic image. The baseball bat sits atop a tall building, likely located near a city. There are several other buildings nearby, likely situated in the same city. Some of these buildings have shorter heights, while others are taller and more prominent in the landscape. Overall, the scene captures the distinctiveness of the baseball bat, which draws attention and creates an eye-catching sight in the cityscape.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 293703, "caption": "According to the picture, there are three men sitting around a dining table. Two of the men are located on the right side of the table, and the third person is seated closer to the center. Each man has a plate filled with food, suggesting they are eating a meal together. The table features several bowls and cups placed around the table, with cups on the right side and bowls on the left. Additionally, there are a few pizzas on the table, indicating that they are enjoying Italian food during their dining experience. The atmosphere appears to be casual and social, as the men are smiling while dining together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "bowl", "bottle", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5555555555555556}}, {"image_id": 318777, "caption": "According to the picture, a group of baseball players is on the field, playing a game. They are all wearing their respective team's uniforms and are positioned around the baseball field, ready to make their next move. The field is covered with grass and dirt, and spectators are scattered across the stadium, watching the game and preparing for their viewing experience.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 14773, "caption": " The image shows a dining table filled with a variety of food. A hot dog, hamburgers, and french fries are visible, arranged in a car-shaped container. There are additional food items on the table, including a bottle, two cups and a spoon. The dining table is located in the center of the scene, with chairs on each side of it.", "mscoco_hallucinated_words": [["hamburger", "sandwich"], ["container", "bowl"], ["chair", "chair"]], "mscoco_gt_words": ["cup", "dining table", "spoon", "bottle", "fork", "hot dog", "car", "person"], "hallucination_idxs": [17, 29, 63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3, "Recall": 0.625}}, {"image_id": 521863, "caption": "According to the picture, there is a person on a skateboard riding down a sidewalk through a large, glass window. He appears to be skating down the sidewalk, enjoying the skateboarding experience. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 250540, "caption": "According to the picture, there is a group of men on motorcycles in a city setting, with several people on them. One of the motorcycles is on the right side of the scene, while another motorcycle is in the middle, and a third one is on the left side. There are five people visible, including four seated on the motorcycles and one standing nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "motorcycle", "cell phone", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 211604, "caption": " The image features a young woman laying on a wooden floor, wearing a white tennis outfit. She has her head towards the right side of the picture, and appears to be relaxed while resting on the ground. She is holding a tennis racket beside her, possibly taking a break from the game or resting after a match. The woman's position can be seen in the middle of the scene, with various sports equipment and a few other objects placed around the wooden floor.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 298067, "caption": "This image features a close-up view of a plate with some chicken pieces on it, and there are several beers placed in the background. A bottle of beer is located close to the edge of the table, creating an interesting and visually appealing setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "cup", "dining table", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 563938, "caption": " The image shows three people sitting on a couch, with one man and two women. The man and one woman are in the foreground, while the other woman sits further back. They all appear to be engaging in conversation and looking at a piece of paper, which might be their tickets to an event.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "tie", "bottle", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 461311, "caption": "The scene depicts a large group of people, including children, gathered in a park during a kite flying event. The sky is filled with numerous kites of various shapes and sizes, with some of them being flown by the people in the park. There are kites of different sizes, with one slightly larger kite flying over a smaller kite. The people in the park appear to be enjoying the event, possibly flying their own kites or cheering on others. Overall, the image captures the lively atmosphere of a kite flying festival.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["kite", "handbag", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 47940, "caption": " The image depicts a person lying on their bed with their legs dangling off the side. A variety of books can be seen spread out on the bed around them, providing reading material for this relaxed moment. There is another stuffed animal positioned near the person's lower body, and a laptop is placed close to the person's head. Overall, it is a casual and comfortable scene featuring a mix of leisure and study materials.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "laptop", "bed", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 261318, "caption": "In the image, a man is sitting on a chair, using a cell phone to make a call. He is surrounded by a number of suitcases and backpacks, suggesting that he is traveling or possibly returning from a trip. The cell phone he is using appears to be a flip phone, which is commonly used for making calls and also for texting.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["suitcase", "handbag", "bench", "cell phone", "chair", "person"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6666666666666666}}, {"image_id": 134768, "caption": "According to the picture, there is a keyboard sitting on a desk in front of a computer monitor, with a pair of headphones and another set of speakers nearby. The person seems to be using the computer and enjoying the music that's being played.", "mscoco_hallucinated_words": [["monitor", "tv"], ["person", "person"]], "mscoco_gt_words": ["keyboard", "laptop", "mouse", "dining table"], "hallucination_idxs": [18, 33], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.75}}, {"image_id": 526414, "caption": "According to the picture, the scene is a home office with a table located in the center of the room. On the table is a sewing machine, a pair of scissors, and a bottle. The sewing area contains a desk with some books and a laptop. There is a couch located near the wall on one side of the room. On the other side, a pair of sneakers and a chair can be seen. The table is positioned near the couch, suggesting it could be used for sewing or other activities. The space is well-organized, with various items placed around the room.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["chair", "chair"]], "mscoco_gt_words": ["remote", "cup", "tv", "dining table", "bowl", "laptop", "cat", "book", "scissors", "couch"], "hallucination_idxs": [37, 78], "metrics": {"CHAIRs": 1, "CHAIRi": 0.18181818181818182, "Recall": 0.5}}, {"image_id": 273132, "caption": " According to the image, there is a woman sitting on a park bench in an older black-and-white photo. She seems to be wearing a cap and looking at something outside the frame, possibly towards a court nearby. The woman appears to be elderly.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 505213, "caption": "According to the picture, there is a man in a crowded area eating hot dogs, likely having fun with the scene. One of the hot dogs appears to be in his mouth, with others arranged around him. There are a total of five hot dogs visible, one located to the right, another one in front of the man, and three on the left side of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "hot dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 192656, "caption": "According to the picture, there is a young man standing on a baseball field, wearing a black and white baseball uniform. He is holding a baseball and is likely waving or signaling to someone off the field. The man is in the middle of the image, showcasing his baseball-related activity.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 272728, "caption": " The image features a beautiful bouquet of colorful flowers placed in a wooden vase, creating a captivating visual display. The bouquet contains various flowers with different colors and shapes, such as pink, purple, and red. There are a total of 10 flowers in the bouquet.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "chair", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 509811, "caption": "According to the picture, there is a group of people sitting in a classroom or study area surrounded by desks and chairs. Multiple laptops are placed on some of the desks, suggesting that they are being used for studying or working. The people in the scene appear to be students or employees working on their laptops.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "apple", "book", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 412407, "caption": "According to the picture, there is a woman holding a cell phone up to show it to someone in the room. The phone appears to be open, and it has an activated camera for taking pictures. Several people can be seen in the background, suggesting a crowded or busy environment. One person, likely the woman, is also holding a cell phone in their hand, which might be a different one than the one she is showing.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "clock", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 353658, "caption": "According to the picture, a woman is walking down a street in a city, carrying a pink umbrella to protect herself from the rain. There are several other people present in the scene, one of them being a man. Some of them appear to be shopping, browsing, and carrying handbags.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "suitcase", "handbag", "umbrella", "carrot", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 441363, "caption": "According to the picture, there is a young man wearing a red shirt standing on a tennis court, playing tennis with a racket. He is swinging the racket to hit a ball in the air, likely during a game. Another person is also in the scene, standing closer to the right side of the court. There are multiple chairs in the background, possibly for spectators or players waiting to participate in the game.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 331084, "caption": " A group of people, including two adults and two kids, are gathered around a table in the image, preparing food for a pizza party. On the table, there are several ingredients, such as vegetables like tomatoes, onions, and mushrooms, which can be seen placed on a kitchen board.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "knife", "dining table", "bowl", "spoon", "carrot", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 282553, "caption": "According to the image, a man is walking down a path through a grassy area. The path is setting in the background, casting a shadow on the grass. The sun is setting in the background, creating a beautiful scene. There are two other people walking in different directions on the path.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 328818, "caption": "According to the picture, a young woman is sitting on a bench while putting on her shoes. She is in a park setting with a bicycle nearby, indicating that she is likely preparing to go for a walk some other day. The image captures a casual moment, with the woman possibly getting ready to enjoy a bike ride or a walk in the park.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bench", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 3103, "caption": "According to the picture, a man in a yellow helmet and black pants is performing a trick while wearing a helmet, showing off his skills on a skateboard. The man is actively involved in the activity, with a crowd of people in the background observing or possibly waiting for their turn to ride the ramps. The skateboarder seems to be riding a handrail or jumping on a downward sloping ramp.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 188136, "caption": "According to the picture, there is a kitchen area with various items arranged on the countertop. There are a few items such as a pitcher, a bowl, a large pot, and a bottle, giving the scene a clean, white atmosphere. A few apples and oranges are visible on the counter, creating a simple yet appealing setting. Two refrigerators, a stove, a sink, and multiple vases can also be seen in the room, giving it a more complete look.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["apple", "apple"], ["refrigerator", "refrigerator"], ["stove", "oven"], ["vase", "vase"]], "mscoco_gt_words": ["dining table", "bowl", "orange", "sink", "carrot", "person"], "hallucination_idxs": [37, 50, 67, 70, 77], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5555555555555556, "Recall": 0.6666666666666666}}, {"image_id": 449675, "caption": "According to the image, there is a white plate sitting on a table filled with food. The plate contains a sandwich, french fries, and a fork. The sandwich appears to be a grilled cheese sandwich, which is a popular item at many fast food restaurants. There are two slices of bread, cheese, and vegetables in the sandwich. The plate also features a fork, suggesting the meal is being enjoyed as a quick bite. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "spoon", "bottle", "sandwich", "fork", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.375}}, {"image_id": 191691, "caption": "According to the picture, the man is shaving in a bathroom. He is standing in front of a large mirror, using a toothbrush to clean his teeth. In the background, there is another person in the same room, possibly a friend or family member watching him. The image captures a moment of the man getting ready for his day, possibly preparing for a special event or just his normal routine.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toothbrush", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 89378, "caption": "According to the picture, there is a man standing next to a motorcycle in front of a red building. The motorcycle appears to be a sporty bike with a white and black color scheme, and the man is wearing a black helmet. Another person is visible in the background, possibly taking a picture. The man appears to be standing in front of a large garage door with multiple trucks parked beside it. Overall, the image features a scene involving two people and a motorcycle in front of a red building.", "mscoco_hallucinated_words": [["truck", "truck"]], "mscoco_gt_words": ["bicycle", "motorcycle", "backpack", "person"], "hallucination_idxs": [74], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 0.75}}, {"image_id": 99874, "caption": " According to the picture, a young woman is sitting at a table in a restaurant, wearing sunglasses and talking on a cell phone. She looks relaxed, possibly chatting with a friend or family member. The woman is seated in front of a cup, likely her drink during the conversation.", "mscoco_hallucinated_words": [["cup", "cup"]], "mscoco_gt_words": ["handbag", "dining table", "bottle", "cell phone", "cat", "person"], "hallucination_idxs": [47], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 437895, "caption": "The image is of a bed with a colorful, ornate quilt covering it. The bed has a frame or metal canopy on top, which gives the image an artistic and unique look. The quilt is placed on top of the bed, adding a touch of elegance and warmth to the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 557447, "caption": "The image features a man, possibly an executive, sitting at a table wearing a suit and tie. He appears to be in a meeting room and is focused on a document in front of him. He points at the document with his finger, drawing attention to it. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "tie", "spoon", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 159979, "caption": " The image captures a group of people sitting on a wooden boat located in the water, surrounded by various umbrellas. There are several umbrellas, each with a different size and color, protecting the individuals from the sun. In total, there are five umbrellas spread out across the boat.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "boat", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 357, "caption": "According to the picture, there is a baseball game taking place on a green field. Several players are actively engaged in the game. A batter is standing near home plate, ready to swing at the ball. The catcher is also in place, preparing to catch the ball when it is pitched. ", "mscoco_hallucinated_words": [["ball", "sports ball"], ["ball", "sports ball"]], "mscoco_gt_words": ["baseball bat", "person", "baseball glove"], "hallucination_idxs": [39, 52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.3333333333333333}}, {"image_id": 538891, "caption": "According to the picture, there are multiple people in different positions around the young woman, but they appear to be scattered and not particularly close to each other. A young woman is standing on a tennis court, holding a tennis racket and looking up. She is wearing a white tennis shirt and has her hair tied back. She seems to be ready to hit a ball.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 561967, "caption": "In the image, a train has come to a stop on the railroad track. The train is carrying shipping containers, with several visible along the length of the train. There are multiple trains traveling in different directions throughout the image, including the train in the foreground that has stopped, and a couple of others that can be seen in the background. This scene showcases the transportation and distribution of these shipping containers on the train tracks.", "mscoco_hallucinated_words": [["container", "bowl"], ["container", "bowl"]], "mscoco_gt_words": ["traffic light", "train", "car"], "hallucination_idxs": [21, 78], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.3333333333333333}}, {"image_id": 565778, "caption": "The image shows a blue train traveling along a track next to a tunnel. There are several people in various positions on the train, including one at the front, a few near the middle, and others at the back. The scene features multiple traffic lights along the track, adding to the train's visibility as it moves through the environment. Overall, it's a typical scene of a train traveling through the railroad tracks, enhanced by the presence of people on board.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "train", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 7125, "caption": "The image portrays a sunny day with several people enjoying the scenic view while riding bicycles down a path next to the water. There are four bicycles visible in the picture, one located towards the left, one in the center, another one near the right side, and a final bicycle closer to the right edge of the image. A group of riders is dispersed along the path, with one person riding a bicycle near the left edge, two individuals close together, and another person on a bicycle a little further back. The path runs along the water, and it seems like a popular spot for riding bicycles on a pleasant day.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "dog", "bottle", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 221350, "caption": "According to the picture, an older man is wearing a red and white tie, and he is holding the end of the tie on his shirt collar. He appears to be in a formal setting, possibly preparing for a meeting or event. The scene captures the person's attire and actions, showcasing a stylish yet professional appearance. The tie is tied neatly and the man is wearing a white shirt beneath it. The tie itself is red and white, adding a vibrant touch to his overall attire.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "dining table", "tie", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 280930, "caption": " The image shows a young girl standing in a kitchen, looking surprised while holding a pan with baked goods. She is smiling and laughing, making the scene amusing. The girl is wearing a blue shirt, and the kitchen has a stove nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "oven", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 151081, "caption": " The scene is a field located near a road with a small airplane flying above. The plane can be seen passing over the field, capturing a bird's eye view of the rural area. There is a person standing near the road watching the plane, and several other people, cars, and a truck are seen scattered throughout the field. One car is parked near the road, and another is farther away in the field. A tractor can be seen moving along the road, adding to the rural setting.", "mscoco_hallucinated_words": [["bird", "bird"], ["car", "car"], ["car", "car"]], "mscoco_gt_words": ["truck", "airplane", "person"], "hallucination_idxs": [28, 54, 67], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 1.0}}, {"image_id": 122232, "caption": " In the image, a woman is standing on a tennis court holding a tennis racket. She is wearing a yellow raincoat and is surrounded by several tennis balls placed at various positions on the court. The balls range from closer to the woman, further from her, and also on opposite ends of the court. Some balls are positioned at the edge of the court while others are closer to the woman. The scene captures the woman's presence on the tennis court as she stands there holding a tennis racket, ready for action.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 134223, "caption": " The image displays a sandwich served with french fries on a plate. The sandwich is made with a hamburger, with various toppings, including lettuce and ketchup. The fries are accompanied by two sauces, one on the left side and the other on the right side of the plate. Two more sauces are on the right side, closer to the edge of the plate. The arrangement is arranged on a table in a dining area.", "mscoco_hallucinated_words": [["sandwich", "sandwich"], ["sandwich", "sandwich"], ["hamburger", "sandwich"]], "mscoco_gt_words": ["bowl", "hot dog", "broccoli", "dining table"], "hallucination_idxs": [4, 14, 19], "metrics": {"CHAIRs": 1, "CHAIRi": 0.75, "Recall": 0.25}}, {"image_id": 198163, "caption": "According to the picture, a man is sitting in front of a table covered with small bowls filled with various colors of paint. The man is painting or glazing the items on the table. The scene showcases a man working in an art studio or workshop, where he is creating a collection of hand-painted bowls.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 3742, "caption": "According to the picture, it displays a window filled with a variety of apples, arranged in a bowl and placed on a windowsill. There are numerous apples visible, with some in the front and others scattered across the scene. Additionally, a vase can be seen in the image, set in a window, where a flower is also visible. This image captures a window, showcasing a beautiful display of fruits and a flower, likely creating a welcoming atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "apple", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 217593, "caption": "The scene features a young boy dressed in a baseball uniform and standing in a grassy field, possibly practicing for a game or waiting to play. There are several people in the area, but none of them seem to be involved with the child. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "chair", "baseball bat", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 279078, "caption": "According to the picture, it shows a young girl holding a bottle of beer and looking at an open refrigerator full of many bottles of beer. The bottles are arranged in multiple positions inside the refrigerator, creating an interesting display. The girl seems excited and curious about the selection of beers available.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 572081, "caption": "According to the picture, a man is playing tennis on a court with a net in the background. Another man is watching him. The two tennis players are participating in a tennis match, and the court is surrounded by a crowd of people watching and possibly cheering them on. The court is set up for an event in a large arena. People are standing around and sitting on benches in various positions, possibly engaged in conversation or observing the game.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["tennis racket", "chair", "sports ball", "person", "couch"], "hallucination_idxs": [74], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.2}}, {"image_id": 477016, "caption": "According to the picture, the image features a small girl, likely around four or five years old, with her mother walking towards a decorative kitchen. The kitchen has various features, including a sink, an oven, and a stove. The oven is colored in a pink and white design, and the stove appears to be in a light blue color.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "oven", "backpack", "potted plant", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 301266, "caption": "According to the picture, a group of young people, including a man, are gathered around a table in a kitchen, engaged in a conversation. Some are holding baskets, while others are working on whatever they are doing in the kitchen. There are also a few bowls, such as knives and spoons, visible on the table.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["knife", "knife"]], "mscoco_gt_words": ["cup", "dining table", "spoon", "book", "fork", "sink", "person"], "hallucination_idxs": [52, 56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.42857142857142855}}, {"image_id": 475813, "caption": "The image features a small child standing in a room, holding onto a cell phone with both hands. She appears to be walking around while holding the phone. The room is spacious, with a couch positioned near the child on the right side of the room and a chair on the left side of the room.", "mscoco_hallucinated_words": [["cell phone", "cell phone"], ["phone", "cell phone"], ["couch", "couch"], ["chair", "chair"]], "mscoco_gt_words": ["remote", "handbag", "person"], "hallucination_idxs": [14, 29, 38, 52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.3333333333333333}}, {"image_id": 558784, "caption": "According to the picture, a man is playing golf on a field, standing near the water holding his golf club. He is dressed in clothes, and a baseball bat placed on the ground next to him. He is captured in a serene landscape, with water in the background.", "mscoco_hallucinated_words": [["baseball bat", "baseball bat"]], "mscoco_gt_words": ["backpack", "frisbee", "person"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 308543, "caption": " The image depicts a person's hand holding two silver spoons, which appear to be twins. The individual is standing over a sink, possibly washing the utensils. The sink is filled with water, indicating that the process of cleaning the spoons is about to take place. The image creates an interesting scene of a person's hand holding two spoons and standing over a sink.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "spoon", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 359164, "caption": "According to the picture, there are two dogs - a golden retriever and a small dog - running around on a grassy area, playing and having fun together. The golden retriever is seen picking up something, possibly a toy, while the small dog follows close behind.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "chair", "dog", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 539951, "caption": "There is a small, Lego-built kitchen on display, featuring a stove, sink, and refrigerator. The kitchen has a black and white checkerboard pattern on the countertop and walls. There are two different types of ovens in the kitchen, one on the left side and the other on the right side, and both are part of the Lego build. The refrigerator is also built from Lego blocks and can be found in the center of the kitchen. The entire kitchen is made out of Lego bricks, showcasing the creativity of the builder.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "clock", "oven", "refrigerator"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 143479, "caption": "According to the picture, there is a man, possibly an older man, holding a tray full of hot dogs in front of a woman. They are standing in a grassy field, and the woman appears to be looking at the food.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["hot dog", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 475024, "caption": "The image features a vase with flowers and a mirror behind it. The vase has a combination of different flowers and leaves, with a mix of red and orange colors, arranged throughout it. The flowers are concentrated near the top and bottom parts of the vase. The mirror is located just behind the vase, reflecting the arrangement on the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "orange", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 238065, "caption": "The image depicts a young man skateboarding on a wooden ramp, possibly performing tricks in the air. He is wearing a blue shirt and appears to be enjoying the activity. There are a few other skateboards scattered around the scene, including one close to the main subject, another towards the left, and another one near the top left corner. The overall atmosphere appears to be playful and filled with excitement, showcasing the fun of skateboarding.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 314016, "caption": "According to the picture, there is a small bathroom containing various items. A toilet can be seen in the bathroom, along with a sink filled with alcohol, a glass vase containing various items, and a book. The sink has been filled with items such as a bottle, books, and other items. A cup can be spotted in the bathroom, and some books can be seen on a shelf. The bathroom seems to be used for its intended purpose, with some items being displayed on a shelf and others being stored within the space.", "mscoco_hallucinated_words": [["vase", "vase"], ["cup", "cup"]], "mscoco_gt_words": ["sink", "book", "toilet", "bottle"], "hallucination_idxs": [33, 61], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 1.0}}, {"image_id": 322163, "caption": " A group of people, including a chef, are gathered around a table in a commercial kitchen, preparing food. The chef is cutting food, while the other people watch and assist. The scene suggests that the chef is about to serve a meal.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "pizza", "oven", "dining table", "bowl", "potted plant", "bottle", "microwave", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2222222222222222}}, {"image_id": 230983, "caption": "According to the picture, a young man is seen performing a skateboard trick on the sidewalk. He is jumping off a green skateboard, and extending his arms out to fly through the air. A building can be seen in the background, and multiple other people are also present in the scene, possibly observing the skateboarder's trick.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 564629, "caption": "According to the image, the scene features a large kitchen with wood cabinets, multiple ovens, and a sink. There is also a dining table with chairs in the kitchen. The kitchen is decorated with wood cabinets and a range hood. The kitchen appears to be well-organized and functional, providing ample space for cooking and dining.", "mscoco_hallucinated_words": [["table", "dining table"], ["chair", "chair"]], "mscoco_gt_words": ["sink", "microwave", "oven", "refrigerator"], "hallucination_idxs": [27, 29], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.5}}, {"image_id": 53431, "caption": "According to the picture, a man is sitting on a bench in front of a brick wall. He is wearing a blue shirt and a tie and is using a laptop while resting his feet on a chair. He is sitting in a chair with a pillow placed on it. The location could be a building or a park, given the brick wall and bench.", "mscoco_hallucinated_words": [["bench", "bench"], ["tie", "tie"], ["bench", "bench"]], "mscoco_gt_words": ["laptop", "chair", "person"], "hallucination_idxs": [11, 27, 69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 1.0}}, {"image_id": 441840, "caption": " The image features a young boy playing baseball in a field, holding a baseball bat and swinging it at a baseball. He is in the center of the image, surrounded by other individuals who are likely watching or observing the play. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "baseball bat", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 53465, "caption": "According to the picture, there is a large, decorated cake placed on a table in front of a group of people, likely a gathering of friends or family celebrating a special occasion. The cake itself is adorned with the words \"Welcome to [IDK],\" with an [IDK] design added to it. This unique cake style adds a touch of fun to the celebration. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "bowl", "spoon", "cake", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 381031, "caption": " A table is filled with different types of foods, including a dozen donuts, a knife, an apple, and a glass of orange juice. The dozen donuts are placed on a white plate, while the apple and glass of orange juice are also present on the table. Various bottles and cups, containing additional beverages, can be seen nearby. The setup looks like a casual and whimsical breakfast setup.", "mscoco_hallucinated_words": [["knife", "knife"], ["orange", "orange"], ["orange", "orange"]], "mscoco_gt_words": ["cup", "dining table", "spoon", "bottle", "apple", "microwave", "donut"], "hallucination_idxs": [16, 25, 44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2727272727272727, "Recall": 0.7142857142857143}}, {"image_id": 530520, "caption": "The image captures a group of people sitting outside on the grass, using their laptops. There are four men and one woman, all seated together in a casual setting. They seem to be enjoying each other's company while working on their laptops. The laptops are placed on the grass in front of them. Some of the people have one or two hands touching the grass.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "motorcycle", "laptop", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 539555, "caption": "The image captures an exciting moment where a young man in a white t-shirt is performing a back flip while holding a green frisbee. There are several people in the background, possibly observing the man's impressive acrobatics. The man is caught mid-air, with one foot off the ground and another one extended, providing a thrilling display of physical skill and agility.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "frisbee", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 279521, "caption": " According to the picture, two young men are standing next to a subway train, looking at it. One of them is wearing a jacket, while the other has a backpack. They are standing near a sidewalk, discussing something related to their plans, possibly waiting for a train or planning their next move.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "handbag", "train", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 560439, "caption": "According to the picture, there are four people in the scene who appear to be walking along a path in the snow, enjoying themselves and spending time together. The main focus of the scene is the group of people walking the snowy path, with two of them wearing snowshoes. The three individuals are spread out along the path, with one at the front, another in the middle, and the last one trailing slightly behind the others.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "snowboard", "backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 248353, "caption": "According to the picture, there is a group of people skiing down a snowy slope. There are 13 people visible in the scene, most of them wearing skis. There is a mixture of male and female skiers, creating a diverse skiing environment. People are spread out across the slope, skiing and enjoying the day.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 9628, "caption": " The image depicts a man riding on water skis, performing a trick in the middle of the water while a friend observes from shore. The main man is in the air, leaping over the water and enjoying the thrill of the ride. He is accompanied by another person, who appears to be watching the action from the shore. The water skiing activity is happening on a lake with a significant amount of water visible around the participants. The scene captures the excitement and thrill of the water skiing adventure.", "mscoco_hallucinated_words": [["ski", "skis"]], "mscoco_gt_words": ["surfboard", "boat", "person"], "hallucination_idxs": [8], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 330050, "caption": "According to the image, there is a man and a woman standing next to each other on a tennis court. They appear to be smiling and posing for the camera, suggesting they might have been taking a group photo. Both individuals are holding tennis rackets. The man has a racket in his right hand, while the woman is holding a racket with both hands. There are also two chairs in the scene, but it's unclear whether they are part of a nearby sports event or just casual seating for the setting.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["tennis racket", "bench", "person"], "hallucination_idxs": [75], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.6666666666666666}}, {"image_id": 254161, "caption": " According to the picture, it is a warm summer day as several people enjoy an outdoor brick plaza. One man is standing near a tall building, flying a large kite. He appears to be in the middle of the plaza, surrounded by a couple of other individuals.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "kite", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 275749, "caption": "According to the picture, a group of people are standing in a small room, possibly a living room or a dorm room. The people are chatting, laughing, and interacting with each other, creating a casual and social atmosphere. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "tv", "dining table", "laptop", "mouse", "bottle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.125}}, {"image_id": 163518, "caption": "According to the picture, a woman in a chef's uniform is standing in a kitchen, placing prepared vegetables into plastic containers. The woman is positioned at the middle of the scene, in front of a table filled with different types of food, including vegetables, as well as a bottle and a cup. The image also includes a sink in the back and a refrigerator further back in the room, near the woman. The woman is focused on her work, making it clear that she is preparing food for customers or workers in the facility.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cup", "cup"], ["sink", "sink"], ["refrigerator", "refrigerator"]], "mscoco_gt_words": ["oven", "dining table", "bowl", "chair", "person"], "hallucination_idxs": [55, 58, 65, 71], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6}}, {"image_id": 156696, "caption": " There are two trains at a train station, with one train being longer than the other. Both trains are painted in a blue color. The longer train is parked on the track, while the shorter one is next to it on the same track. The people in the scene are standing close to both trains, possibly waiting for them to move.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "train", "handbag", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 525202, "caption": " The image displays a group of people sitting at a table in a restaurant, enjoying a delicious pizza. The pizza is placed on a pizza pan and is likely being shared by the group. There are multiple people at the table, with a man and woman being the main figures in the foreground, while others are in the background. It appears that they are having a meal, likely a casual evening out with friends or family.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "spoon", "bottle", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 423984, "caption": "The image features a white plate with a bowl of pasta, a slice of bread, and carrots on it. The pasta is covered in a creamy sauce, and there are two carrots visible on the plate. The carrots are positioned closer to the center of the dish, with the bread placed next to them. There is another set of carrots and a knife on the dining table nearby, but they don't appear to be part of the dish on the plate.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["knife", "knife"]], "mscoco_gt_words": ["carrot", "broccoli", "dining table"], "hallucination_idxs": [8, 69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.6666666666666666}}, {"image_id": 499486, "caption": "According to the picture, two cats and a kitten are sitting on a bed, laying down comfortably. The bed is covered with a messy sheet, and the cats are surrounded by various items on the bed. The cat at the left is larger, occupying most of the left side of the bed, while the cat at the right is smaller, taking up only a portion of the right side. The kitten is situated between the two other cats. Apart from the cats and the kitten, there are no other people or animals in the scene.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["book", "cat", "bed"], "hallucination_idxs": [100], "metrics": {"CHAIRs": 1, "CHAIRi": 0.07142857142857142, "Recall": 0.6666666666666666}}, {"image_id": 118124, "caption": "According to the picture, it appears to be an image of a young man sitting inside a car, presumably in the driver's seat. The man is dressed in a black shirt with a tie and is wearing glasses. He is the main focus of the image. The scene is also showing a steering wheel, possibly suggesting that the man might be using it while driving his car.", "mscoco_hallucinated_words": [["seat", "chair"]], "mscoco_gt_words": ["tie", "person", "car"], "hallucination_idxs": [25], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 1.0}}, {"image_id": 288729, "caption": "According to the picture, a cake is in the shape of a circus tent, and it is placed in front of a sign. The cake is decorated with clowns, as well as a balloon and a bear. Additionally, there is a figure standing next to the cake, and another sitting nearby. A clock is seen on the wall in the background, and a toy truck is visible nearby.", "mscoco_hallucinated_words": [["clock", "clock"], ["truck", "truck"]], "mscoco_gt_words": ["teddy bear", "bear", "elephant", "cake", "person"], "hallucination_idxs": [60, 73], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.4}}, {"image_id": 177721, "caption": " The image captures a young man riding a skateboard on a ramp. He is in the middle of performing a trick, with his feet off the ground as he showcases his skateboard skills. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 372220, "caption": "The image shows a man in a black suit, standing near a large stone wall and looking at his cell phone. He is wearing a beard and appears to be using his cell phone to communicate with someone or access important information.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "horse", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 252625, "caption": "According to the picture, a street with shops and various buildings is occupied by people, giving a lively and vibrant atmosphere. There are multiple people walking and riding bicycles along the street, some carrying backpacks and bags. Additionally, there are a few shops along the street, including a restaurant and some stores with signs. The overall scene is bustling with activity and people going about their day.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["traffic light", "potted plant", "person", "bicycle"], "hallucination_idxs": [38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 493610, "caption": "According to the picture, there is a group of people riding in a white truck on a dirt road in a developing country. The truck is carrying several passengers, and it is traveling through a dirt street, possibly in a developing country. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "person", "bus", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 302789, "caption": " A small room is occupied by a bed, which has a suitcase and other bags packed on it. The suitcase is placed at the left side of the bed, while the other bags are scattered throughout the room.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["suitcase", "handbag", "refrigerator", "backpack", "bottle", "sink", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 30667, "caption": " According to the picture, there is a brown dog lying on the floor of a boat. The dog appears to be resting or sleeping, and there are two individuals sitting near it. One of the people is closer to the left side of the image, while the other person is to the right.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "boat", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 114108, "caption": "According to the picture, it showcases a group of people sitting around a table, enjoying a homemade pizza. The pizza is cut into slices and served on a white plate. They are having a good time, engaging in conversation and socializing with each other while savoring their food.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "spoon", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.375}}, {"image_id": 328110, "caption": " The image shows a woman sitting on a bench with her baby in her arms, looking at her phone. She appears to be engaged in an important call or message. The woman is surrounded by a group of people in the background. These people can be seen walking down the street or along a path.", "mscoco_hallucinated_words": [["phone", "cell phone"]], "mscoco_gt_words": ["backpack", "handbag", "bench", "person"], "hallucination_idxs": [19], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 366111, "caption": "According to the picture, a group of people is riding bicycles through the city streets, creating a lively scene. The bicycles are various in size, some of which are larger, and some are smaller. Some people wear hats, and a few of them are dressed in suits, suggesting that this is an old-fashioned photo of people biking in their finery. They ride on different types of bicycles, which creates a diverse range of activities on the streets. Overall, it is a vibrant and engaging scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 532481, "caption": "According to the picture, a man is flying on a kite, while another person is riding a wave in the ocean. In the background, there are several homes along the waterfront, likely belonging to nearby residents or beachgoers.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "surfboard", "kite", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 291634, "caption": "According to the picture, there is a large group of motorcycles parked in the middle of a busy city street. The motorcycles are lined up along the street, giving a sense of a community gathering. There are numerous motorcycles of different sizes and colors, some are standing tall and some are laid down on the street. Additionally, a parked bicycle is positioned at the far right edge of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 463678, "caption": "According to the picture, the main focus of the scene is a young tennis player wearing a white shirt and white shorts, preparing to serve the ball in a tennis match. The player is standing on the tennis court and holds a tennis racket, getting ready to swing and make the serve. The image captures a moment of action and excitement in the game. Additionally, several other people can be seen in the background, suggesting they might be fellow players or observers in the event.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 70815, "caption": "According to the image, a small dog is sitting on a sidewalk in front of bikes, with a leash attached to it. The dog is looking up at the camera, possibly waiting for its owner or curious about the environment.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "person", "truck", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 367872, "caption": " The scene is set in a baseball field, with a player standing on the playing field with a baseball bat in his hand, preparing to swing at a ball. There are several other people in the image, possibly teammates, spectators, or coaches. They are positioned along the outfield, around the bases, and within the playing area. One player is located close to the batter, and another is slightly further away. Others are scattered throughout the scene. The background of the image includes a building, possibly a dugout, and a bench nearby. Overall, the atmosphere of the scene suggests a competitive baseball game or practice.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["umbrella", "person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [104], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6}}, {"image_id": 353898, "caption": " The scene features a man sitting on a stone bench in a park surrounded by a large number of white pigeons. The birds appear to be gathered around the man, possibly attracted to the sound of a nearby bird cage, or foraging for food or other reasons. The man is engaged in an activity, such as reading or resting, while the birds seem to be content, scattered, and in a state of rest.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "cell phone", "backpack", "bird", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 88269, "caption": "According to the picture, it presents a bowl of hot soup on a table. The soup is accompanied by a couple of slices of grilled cheese sandwiches, giving it a tasty, filling meal. One of the sandwiches has been topped with tomato sauce, adding an extra layer of flavor to the dish. The soup spoon can be seen in the bottom right corner of the image, ready for consumption. The scene appears to be an appetizing meal or snack.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["sandwich", "spoon", "bowl"], "hallucination_idxs": [14], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 463211, "caption": "According to the image, there are two children standing on a sidewalk, a young girl and a boy, both wearing backpacks. The girl is walking with her backpack in a sideways position, and the boy is walking with his backpack towards the back of the image. They appear to be going towards or walking past a school bus, possibly after getting off a school bus or getting ready to board it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bus", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 396269, "caption": " The scene is a large room with a chandelier hanging from the ceiling, a paneled wall, and several tiled floors. A golden chair is positioned near the center of the room, surrounded by a variety of vases filled with various colors and sizes of flowers. A table is situated at the right side of the room, with a paneled wall on the opposite side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "chair", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 98979, "caption": " The image features a black microwave sitting on top of a kitchen counter, with its door open. It is prominently displayed, taking up a significant portion of the kitchen scene. Several other appliances, such as a refrigerator and a sink, can be seen surrounding the microwave, indicating that it is placed in a kitchen setting. A car can be seen in the background, partially on the left side of the image.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"], ["sink", "sink"], ["car", "car"]], "mscoco_gt_words": ["microwave", "knife", "oven"], "hallucination_idxs": [41, 44, 64], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6, "Recall": 0.3333333333333333}}, {"image_id": 244455, "caption": "In the image, two motorcycle riders are traveling down the road. Both riders are dressed in uniform, likely being police officers. One of them is on the right side of the image, while the other is more to the left. There are also a few other vehicles in the scene, with some cars positioned in the background and a truck further down the street.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 379443, "caption": "According to the picture, there is an image of a black mesh fabric with a pair of scissors and an orange pair of pinking shears sitting atop it. The pinking shears are also visible in the background, possibly resting on the surface under the black mesh fabric. The scissors and pinking shears are positioned near the center of the image, creating a distinct visual display.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "scissors", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 536154, "caption": "A young girl is sitting on a bed surrounded by a large variety of stuffed animals, including teddy bears, elephants, and puppies. The girl seems to be enjoying herself as she has stuffed animals all around her. In the picture, there are 15 or more stuffed animals, with some in the background and others in the foreground. The teddy bears are placed in various locations, such as next to the girl and scattered around her. Some elephants can be found on either side of the girl, and a few puppies are situated closer to the left side of the image.", "mscoco_hallucinated_words": [["bed", "bed"], ["elephant", "elephant"], ["puppy", "dog"], ["elephant", "elephant"], ["puppy", "dog"]], "mscoco_gt_words": ["teddy bear", "cat", "person"], "hallucination_idxs": [7, 21, 24, 86, 100], "metrics": {"CHAIRs": 1, "CHAIRi": 0.45454545454545453, "Recall": 0.6666666666666666}}, {"image_id": 374061, "caption": "According to the picture, there is a black cat lying on a couch and holding a stuffed animal, possibly a teddy bear, close to its chest. The cat and the toy occupy different parts of the image. The teddy bear can be seen in the bottom left corner, while the cat is in the center of the picture.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "bear", "cat", "bed", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 560018, "caption": " According to the picture, several people are enjoying a day at the beach, riding the waves and enjoying themselves. There are about 12 people scattered throughout the image, which is mainly in the ocean and on the sand. A few of them are wearing surfboards and riding the waves, while the others are either standing in the water or close to the shore.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "kite", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 353320, "caption": " A large guitar case, possibly with luggage, is sitting on a bench outside of an airport. Nearby, there are two suitcases. The guitar case and one of the suitcases are on the right side, while the other suitcase is placed further to the left.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "suitcase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 256628, "caption": " The scene shows a golden retriever dog lying on a wooden deck, resting comfortably and enjoying its surroundings. The dog appears to be very relaxed, possibly waiting for its owner to return. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "chair", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 158806, "caption": " The image features a brown dog eating a sandwich on a blue plate. The dog's mouth is open, as it is in the middle of consuming the sandwich. There are two slices of bread visible on the plate, with one being closer to the dog's mouth. The scene also showcases another person, possibly the owner or a friend, in the background, standing behind the plate and watching the dog eat.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 424270, "caption": "In the image, there is a young girl sitting on a bed in a room, possibly in a hotel or vacation setting. The girl seems comfortable and relaxed, sitting on the left side of the bed. The bedspread is slightly raised, and the girl appears to be surrounded by pillows.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bed", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 368335, "caption": " The scene is set in a parking lot where a brown horse is walking along a parking spot between two cars. The horse is walking towards the cars, passing by them in a narrow space. The cars are of different makes, with one on the left and the other on the right. There is also another car, located further to the left, which might have been driving past the area. In the background, there is a truck, parked and ready to be used.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "horse", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 483227, "caption": "According to the picture, a group of young children is gathered around a tennis court, participating in a tennis lesson or playing a game of tennis. They are wearing shirts and holding tennis rackets, while some are holding a tennis ball. The scene shows a fun and energetic atmosphere, where kids are learning and enjoying the sport.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 228306, "caption": "There is a large, red double-decker bus in the scene, with its front and back clearly visible. The bus appears to be parked on the street, possibly at a bus stop or designated parking spot. There is another, smaller bus behind the main bus in the background. The buses are on a busy street, with multiple other vehicles in the scene, including cars parked on the right side of the image and a few additional buses further down the street.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 416384, "caption": "According to the picture, a young woman is sitting on a chair, reading a book. She is in the middle of the scene, surrounded by other people in the background.", "mscoco_hallucinated_words": [["book", "book"]], "mscoco_gt_words": ["cell phone", "chair", "person"], "hallucination_idxs": [16], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 364783, "caption": "According to the picture, a small, red bathroom has a white toilet and a red sink, with a mirror reflecting the image of the bathroom. The bathroom is clean and well-maintained, and there is a toothbrush hanging from the sink. The bathroom is small and designed for the sole purpose of a toilet and sink, with no shower or bathtub present in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toothbrush", "cup", "potted plant", "bottle", "toilet", "sink", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 335041, "caption": "The image features a man walking along a snow-covered sidewalk in the middle of a city street. The man is wearing skis and appears to be skiing down the sidewalk while navigating through a section of the street that has been covered with snow. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 531327, "caption": " The image features a white bowl filled with a variety of vegetables, including broccoli, carrots, green beans, and red peppers. The table where the bowl is located is a dining table, and there is a knife placed in the bowl as well.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["bowl", "bowl"], ["bowl", "bowl"]], "mscoco_gt_words": ["knife", "dining table", "spoon", "fork", "broccoli", "wine glass", "carrot"], "hallucination_idxs": [5, 29, 45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.5714285714285714}}, {"image_id": 451228, "caption": "According to the image, a busy city street is filled with a variety of buses and a red double-decker bus. There are several buses on the street, including a group of buses on one side of the road, and another group of buses on the other side. The scene is very lively with several pedestrians walking in different parts of the street. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "bus", "person", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 256365, "caption": " The image shows a plate filled with multiple cupcakes, each topped with blue frosting and a swirl of white frosting. There are six cupcakes in total, and they appear to be positioned evenly on the white paper plate. Some of the cupcakes are placed closer to the edge of the plate, while others are more towards the middle. The frosting adds a creative touch, making the pastries look appealing and appetizing.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "cup", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 502936, "caption": "According to the picture, a man is playing tennis in a large courtyard, using his tennis racket and a tennis ball. He is standing at the net, ready to hit the ball back to his opponent. The courtyard is surrounded by buildings, creating an outdoor environment for the game. The man is focused and attentive, as he prepares to serve the ball.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 451449, "caption": "According to the picture, there is a small black and white dog sleeping in a bed covered in various books and papers. The dog is curled up in the bed, possibly enjoying the comfort of being surrounded by the books and papers. A few other books are visible on shelves nearby, adding to the cluttered atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "bed", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 51054, "caption": "According to the picture, an elderly woman is sitting in a bedroom, talking on a cell phone. She is sitting on a bed, with her hands holding the phone. The bed is made and there is a bed, indicating that the woman may be in a comfortable and private space. She appears to be enjoying herself while talking on the phone.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "bottle", "cell phone", "bed", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 1268, "caption": "According to the picture, there is a park scene featuring a woman sitting on a bench next to the water, looking at a white swan swimming in the water. The woman has a handbag close to her. There is also a bridge in the background, with a few people walking around or under it.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["handbag", "cell phone", "backpack", "bird", "boat", "person"], "hallucination_idxs": [16], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.5}}, {"image_id": 62692, "caption": " In the image, a young woman is playing tennis on a court. She is in the middle of the court, holding a tennis racket and preparing to swing at the approaching tennis ball. Her hair is tied in a ponytail, creating a dynamic look in her hair. There are also two tennis balls in the scene, one closer to her and the other farther away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 542771, "caption": "The image features an old-fashioned black and white photograph of a sitting room with a single chair in the middle, surrounded by various items. There are two vases displayed, with one placed close to the chair and the other on the right side of the room. Two flower pots are also visible, one on the left side and another close to the vase in the right corner.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "potted plant", "vase", "book", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 575406, "caption": " The image shows a small red and white parked on the street in front of a building, surrounded by a variety of vehicles. There are several cars in the background, including some parked closely to the bus, a truck near the left side, and another car on the right side. In addition to the bus, there is also a taxi and a motorcycle present in the scene.", "mscoco_hallucinated_words": [["car", "car"], ["truck", "truck"], ["car", "car"], ["taxi", "car"]], "mscoco_gt_words": ["motorcycle", "bus", "potted plant", "person"], "hallucination_idxs": [28, 42, 50, 66], "metrics": {"CHAIRs": 1, "CHAIRi": 0.8, "Recall": 0.25}}, {"image_id": 244834, "caption": "According to the picture, two people, a man and a woman, are walking down a sidewalk in the rain, each holding umbrellas. They are standing next to each other and appear to be together, possibly walking to their destination or just out for a stroll. A couple of cars can be seen in the background, but they seem to be less prominent than the individuals walking on the sidewalk.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "clock", "handbag", "umbrella", "potted plant", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 151159, "caption": " In the image, a group of people, including two skateboarders, are hanging out in an outdoor park. The skateboarders are positioned next to each other on the left side of the park, both on their skateboards. There are at least three other people in the scene, standing in various locations around the park.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 277073, "caption": " The image shows a busy city street with several people walking and interacting with each other. One person is sitting on a motorbike, a man and a woman, who appear to be riding together. There is another person standing next to the motorbike, close to the traffic light. Apart from them, there are four more people in the street, one close to the traffic light and the other three scattered in different parts of the scene. One person can also be seen holding a cell phone near the edge of the image.", "mscoco_hallucinated_words": [["traffic light", "traffic light"], ["traffic light", "traffic light"], ["cell phone", "cell phone"]], "mscoco_gt_words": ["bicycle", "motorcycle", "car", "backpack", "person"], "hallucination_idxs": [51, 71, 93], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.4}}, {"image_id": 377879, "caption": "According to the picture, a large military tank is being transported on train tracks. The tank is on a metal frame, placed on top of the train cars. The trains are parked and seem to be waiting to move. This image showcases the transporting of heavy military vehicles, which is essential for logistics and military operations.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["truck", "train", "bed"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 436814, "caption": " According to the picture, there is an office space containing a laptop and a chair. The room is cluttered with papers and books, while other office supplies, such as a bottle, can be found in the background. The office space is located in front of a door, which can be seen in the image. Additionally, there is a computer front of the desk, making it easy to work on various tasks.", "mscoco_hallucinated_words": [["book", "book"], ["bottle", "bottle"]], "mscoco_gt_words": ["laptop", "mouse", "chair", "dining table"], "hallucination_idxs": [24, 34], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.75}}, {"image_id": 63934, "caption": " According to the picture, there is a person in black garb, dressed as a nun, skateboarding on the sidewalk in front of a building. The person is located near the middle of the scene, with several cars parked around them in different areas of the parking lot. There are also a few people visible in the background, potentially enjoying a sunny day outdoors.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "skateboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 275611, "caption": "According to the picture, there is a delicious looking pizza placed on a metal tray, presumably ready to be served or consumed. The pizza appears to have numerous toppings, including cheese and vegetables, making it a tempting option for anyone in the vicinity. Additionally, the dining table upon which the pizza is sitting serves as a background element. The image captures the essence of a tantalizing meal, making it an appealing sight.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "knife", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 516248, "caption": "The image depicts a person sitting in front of a computer, typing away on a keyboard while using a mouse. There are multiple computer components in the scene, including a keyboard, a computer mouse, and a laptop situated at different locations around the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "tv", "dining table", "laptop", "mouse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 478380, "caption": "A large group of sheep is herded along a road by a bus, with the bus occupying a middle position between the front and back of the herd. There are several sheep visible near the bus and some farther in the distance. The scene appears to be on a narrow road or a path, with a mix of open and enclosed spaces nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "sheep", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 422326, "caption": "According to the image, there are three motorcycles and a sidecar parked next to each other in a parking lot. The three motorcycles include a black motorcycle, a red motorcycle, and a silver motorcycle, while the sidecar is a silver color. The motorcycles are positioned close to each other, creating a lively scene in the parking lot. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 110449, "caption": "According to the picture, there are two young men in the image. One man appears to be the DJ, while the other man is possibly a bartender or a guest. The DJ is sitting down, with his arms outstretched in front of him, while the bartender is standing next to him. Both men seem to be attending to their duties in a club or bar setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "tv", "dining table", "laptop", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 45550, "caption": " The image features a man posing for a picture while holding a plate with fried chicken on it. The man appears to be smiling, enjoying the moment of having his picture taken. The scene takes place in a restaurant with multiple tables and chairs set around it. There is another person, or at least a second individual in the scene, standing in front of one of the tables.", "mscoco_hallucinated_words": [["table", "dining table"], ["chair", "chair"], ["table", "dining table"]], "mscoco_gt_words": ["sandwich", "clock", "person", "bowl"], "hallucination_idxs": [44, 46, 73], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 444913, "caption": "According to the picture, there is a man standing in a grassy area, possibly a park or backyard, holding a baseball bat. He is standing near a tree and has his back to the camera, looking in the direction of the tree. The grass is slightly scattered around the man, possibly indicating a recent activity in the area.", "mscoco_hallucinated_words": [["baseball bat", "baseball bat"]], "mscoco_gt_words": ["frisbee", "person", "car"], "hallucination_idxs": [23], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 82709, "caption": " The image depicts a black and white dog in motion, leaping into the air to catch a pink frisbee, with a group of people gathered around to watch the event. The dog is standing in a field, with the frisbee flying towards him, and the people are dispersed across the field, observing the playful activity. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dog", "bottle", "frisbee", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 132796, "caption": "According to the picture, a group of people are riding on the backs of elephants. There are two elephants present in the image, with people seated on their backs and enjoying the experience. One person is on the left side of the first elephant, while another is on the right side. On the second elephant, two people are seated close to each other on the right side. All individuals appear to be happy and enjoying the elephant ride through the forest or jungle.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "elephant", "backpack", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 303099, "caption": "According to the picture, there are two cats sitting on a toilet, looking at the camera as they pose together. The cats are placed on the toilet bowl, with one cat located closer to the front edge and the other cat positioned further back. The toilet is white and located in a bathroom setting, surrounded by various colorful decorative items and a potted plant. The two cats appear to be relaxed and comfortable while posing for the camera.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["potted plant", "potted plant"]], "mscoco_gt_words": ["cat", "toilet", "bottle"], "hallucination_idxs": [30, 68], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 271565, "caption": "According to the picture, there is a group of motorcyclists traveling down a tree-lined street. Several riders are visible, with their bikes in front of them. Some of the motorcycles appear to be traveling in a line, while others are trailing behind. There are multiple cars parked along the sides of the road. The motorcycles appear to be traveling through the neighborhood.", "mscoco_hallucinated_words": [["bike", "bicycle"]], "mscoco_gt_words": ["truck", "motorcycle", "car", "person"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.75}}, {"image_id": 205193, "caption": " In the image, there is a toaster oven on the left side with a bag of black tea and a cup on top of it. The cup is full of hot tea. A brown sugar can be seen next to the toaster oven as well. A mixture of green tea and black tea is prepared in a beaker in front of the stove, which appears to be full of hot water.", "mscoco_hallucinated_words": [["cup", "cup"], ["cup", "cup"]], "mscoco_gt_words": ["toaster", "microwave", "oven", "person"], "hallucination_idxs": [21, 28], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.5}}, {"image_id": 272566, "caption": "The image features a man sitting on the ground with a bunch of bananas in front of him. He is surrounded by various baskets full of fruits, including bananas, apples, and oranges. Some of these baskets are placed on the side and one is positioned behind him. There is also a bottle placed on the side near the man. The scene has a lively atmosphere with people walking past, including a person with a handbag.", "mscoco_hallucinated_words": [["orange", "orange"], ["bottle", "bottle"], ["handbag", "handbag"]], "mscoco_gt_words": ["cup", "apple", "person", "banana"], "hallucination_idxs": [35, 57, 82], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3, "Recall": 0.75}}, {"image_id": 170099, "caption": "According to the picture, a man stands in front of a living room couch, chairs, and other furniture. The man is wearing a suit and a tie, giving a speech or presenting in front of an audience. Another person can be seen standing close to the man, and there are also two cups visible on the couch.", "mscoco_hallucinated_words": [["chair", "chair"], ["cup", "cup"]], "mscoco_gt_words": ["tie", "orange", "person", "couch"], "hallucination_idxs": [16, 59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.75}}, {"image_id": 14940, "caption": "According to the picture, the scene features a cat and a stuffed animal cuddling together on a bed. The cat appears orange and white and the stuffed animal is a small, brown one. They both seem to be comfortably resting on the bed, enjoying each other's company.", "mscoco_hallucinated_words": [["bed", "bed"], ["bed", "bed"]], "mscoco_gt_words": ["dog", "cat", "orange", "chair", "couch"], "hallucination_idxs": [18, 46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.4}}, {"image_id": 408449, "caption": "In the image, a young man is riding a skateboard down a city street, passing an empty fountain nearby. There are several other people present in the scene, dispersed around the street. The street is lined with tall buildings on both sides, creating a distinct cityscape.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "skateboard", "bus", "handbag", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 140702, "caption": " In the image, a baby elephant, with its mother walking alongside it, is the main focus of the scene. The baby elephant is standing close to the left edge of the frame, while the adult elephant is located closer to the center of the scene. There are a couple of other objects in the image, including a bird that is positioned on the right side and a bottle that is towards the upper part of the scene.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["bird", "person", "elephant"], "hallucination_idxs": [74], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 354540, "caption": "According to the picture, a man is sitting on a kitchen counter, likely eating or drinking while using a laptop. There are various cables and electronic equipment surrounding him, with a laptop placed on a wooden surface near him. A few other items are also in the kitchen, including a cell phone, multiple bottles, a microwave oven, and a chair. The kitchen itself has wooden surfaces and multiple chairs. There are also a couple of knives and a bowl present in the scene.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["knife", "knife"], ["bowl", "bowl"]], "mscoco_gt_words": ["toaster", "oven", "laptop", "cell phone", "microwave", "chair", "person"], "hallucination_idxs": [60, 86, 89], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2727272727272727, "Recall": 0.8571428571428571}}, {"image_id": 19763, "caption": "According to the picture, there is a close-up image of a piece of deep-dish pizza sitting on a white plate. The pizza is covered in cheese and appears to be a delicious piece of food. The plate is surrounded by two glasses of beer, suggesting that the pizza is being enjoyed in a social setting. The dining table is a significant element in the scene, as it takes up most of the space, extending to the left and right of the plate. The scene appears to be from a restaurant or eatery, as the pizza, beer, and dining table suggest a culinary experience.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 191301, "caption": "According to the picture, there are two people sitting at a table, enjoying their desserts. One of them is holding a spoon and eating a chocolate cake, while the other woman is eating an ice cream cone. They appear to be in a restaurant, as there is a cup and an additional utensil on the table. A second spoon is also placed in front of the woman, and there is a bottle near the far end of the table. The table itself is made of wood, and two chairs are located next to the table, suggesting a dining setup.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["chair", "chair"]], "mscoco_gt_words": ["cup", "dining table", "spoon", "cake", "person"], "hallucination_idxs": [79, 98], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 124236, "caption": "The image features a small, dimly lit kitchen with appliances such as a microwave, an oven, and a sink. The kitchen also has an assortment of pots and pans arranged on two tables. A few bowls are scattered on one table, while the other contains multiple knives, some of which are leaning to the right. The kitchen is lit by sunlight coming from the left side, making it appear brighter compared to the rest of the room. Overall, the space appears to be a typical residential kitchen, possibly in the process of being prepared for cooking or cleaned up after cooking.", "mscoco_hallucinated_words": [["microwave", "microwave"], ["sink", "sink"], ["knife", "knife"]], "mscoco_gt_words": ["toaster", "oven", "dining table", "bowl", "spoon"], "hallucination_idxs": [14, 21, 52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 0.6}}, {"image_id": 344930, "caption": "According to the picture, there is a red fire hydrant situated on the sidewalk, close to the edge of a street. It is surrounded by green grass, indicating that the area is possibly in a residential neighborhood. In the background, a car is parked, which could be the reason for the fire hydrant's presence. There are also a few other vehicles visible in the scene, including a smaller vehicle parked a bit further away from the fire hydrant.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fire hydrant", "orange", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 14285, "caption": " The image captures a white, fluffy cat lying on a bed in a room. It appears to be a bedroom, with a small TV placed towards the left side of the room and several books spread throughout the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "tv", "cat", "chair", "book", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 82488, "caption": "According to the image, there is a room with two beds, a lamp, and a clock on the left side. Both beds are similar in size, but one has an orange blanket while the other has a light brown blanket. There is a painting or photograph on the wall behind the beds. The room also features an orange chair and a clock, and there is a window with drapes at the top of the scene.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["remote", "clock", "dining table", "orange", "book", "bed"], "hallucination_idxs": [65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.5}}, {"image_id": 348251, "caption": "According to the picture, a person, dressed in a green jacket, is walking down a ski slope. He is wearing skis and carrying a backpack, likely to ski or snowboard. The scene is captured with snow surrounding the skier and the buildings in the background. The presence of other people on skis is indicated by several pairs of skis nearby, scattered throughout the image. There are two pairs of skis visible in the foreground, and at least one other pair is positioned further back in the scene.", "mscoco_hallucinated_words": [["snowboard", "snowboard"]], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [34], "metrics": {"CHAIRs": 1, "CHAIRi": 0.09090909090909091, "Recall": 1.0}}, {"image_id": 150703, "caption": " According to the picture, there is a close-up image of a delicious Chicago-style hot dog, wrapped in a bun and served in a dish. The hot dog is placed next to each other on a table, creating a combination that those who enjoy both hot dogs and beer may find appealing.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "cup", "hot dog", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 381615, "caption": "According to the picture, a large window in someone's home is filled with various lush green plants. There are several plants in different spots, creating a vibrant display. The plants are placed on a table, and a chair is also present. The window provides an attractive view the plants, allowing the person to enjoy the scenery.", "mscoco_hallucinated_words": [["table", "dining table"], ["person", "person"]], "mscoco_gt_words": ["cup", "potted plant", "vase", "book", "chair"], "hallucination_idxs": [39, 59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.2}}, {"image_id": 352703, "caption": " In the image, a woman is standing in front of a dining table preparing a dish. She is wearing an apron and holding a bowl in her left hand while stirring the contents in the bowl with a spoon. There are several chairs surrounding the dining table, one of which has a bottle resting on it.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["cup", "handbag", "oven", "dining table", "bowl", "spoon", "bottle", "person"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.625}}, {"image_id": 111109, "caption": "The image shows a young boy dressed in a white shirt, standing in a room with a chair nearby. He appears to be posing for the camera, looking confident and smiling. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tv", "tie", "motorcycle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 226802, "caption": "The scene takes place on a rainy city street with a large red \"Xing\" sign in the center. There is a bus, or a red double-decker bus, parked on the right side, partially blocking the view. Several people can be seen walking and standing along the street in different areas. They seem to be trying to navigate through the city, likely during a wet and gray day. The atmosphere is gloomy, and many pedestrians are wearing umbrellas, indicating a rainy weather.", "mscoco_hallucinated_words": [["umbrella", "umbrella"]], "mscoco_gt_words": ["motorcycle", "bus", "person", "car"], "hallucination_idxs": [87], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 49005, "caption": "According to the picture, the image shows a baseball game underway at a field with several players on the field. The main focus of the scene is a pitcher standing on the mound, pitching a ball. There is a catcher positioned behind home plate, watching the pitcher and waiting to catch the incoming baseball. ", "mscoco_hallucinated_words": [["ball", "sports ball"]], "mscoco_gt_words": ["baseball bat", "person", "baseball glove"], "hallucination_idxs": [38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 410114, "caption": " In the image, there is a large group of cows walking down the side of the road. They are spread out over a wide area and appear to be in motion, with some animals closer to the road's edge and others in the middle. A couple of people are walking along the road, perhaps watching the cows or enjoying the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "horse", "person", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 100274, "caption": "According to the picture, the scene is an empty train station or rail yard, with railroad tracks running through the center and several train cars scattered around the area. There are several railcars parked on the ground, with some placed close to the tracks and others at a bit of a distance. The image depicts an empty, somewhat quiet and unused environment, with numerous trains present.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "train", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 317254, "caption": "The image captures a person riding a black motorcycle down a city street, wearing a mask. There are two cars in the scene, one white and the other one is smaller, also white. The person is positioned behind the motorcycle, while the cars are located at the left and right sides of the scene. The street appears to be empty, and the scene is a blur due to the person's motion while riding the motorcycle.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 425569, "caption": "According to the image, there is an empty, large, shiny kitchen with various pieces of equipment. Several silver refrigerators are visible in the scene, located along the edges of the kitchen. There are also silver stoves and several sinks in the kitchen, dispersed throughout the room.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"]], "mscoco_gt_words": ["bowl", "sink", "oven"], "hallucination_idxs": [22], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 28842, "caption": "The image features a park area where a skateboarder is attempting a difficult ramp and a second person is nearby, possibly waiting or watching the skateboarder. The skateboarder is using the ramp located in the center of the scene, while the person next to them is standing further away to the right. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 319607, "caption": "The image features a busy street with multiple traffic lights, several tall buildings, and a person on a bike. The road intersects with other streets in the urban area, and several cars can be seen on the streets and near the buildings. People are walking on the sidewalk, contributing to the busy and active atmosphere in the area. The overall scene appears to be a typical city setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bus", "traffic light", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 210372, "caption": "According to the picture, it depicts a small red kitchen with white cabinets and a sink, a small window towards the upper part of the image, and a refrigerator or similar appliance at the side of the sink. Various dishes can be seen in the sink, creating a cluttered and dirty appearance in the kitchen.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 156915, "caption": "According to the picture, a single red rose is sitting in a glass vase. The rose is located near the middle of the arrangement, accompanied by white and green flowers, creating a visually appealing bouquet. The arrangement is displayed on a wooden table, adding a touch of elegance to the room. The vase is placed slightly off-center, and the flowers can be seen both in the foreground and the background of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 349642, "caption": "According to the image, a man is sitting on a bench next to his dog, who is sitting next to him, in a park-like setting. The man appears to be wearing a dark brown hat, while the dog is a small brown one. They seem to be enjoying some time together, possibly taking a stroll outdoors. There are also several other benches in the scene, though they are not occupied.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bench", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 423008, "caption": "According to the picture, there is a vibrant gathering of people and boats at the water's edge. The boats are spread across the scene, some larger and some smaller, with an array of umbrellas set up among them. The people in the scene appear to be engaged in conversation, enjoying the atmosphere at the water's edge. There are also several tables present at the location, likely for the group to gather around and enjoy their time together.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["suitcase", "handbag", "umbrella", "boat", "person"], "hallucination_idxs": [69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6}}, {"image_id": 45435, "caption": "According to the picture, there is a person holding a cell phone in their hand. The phone displays a white and black screen, which can be seen from the image. The person appears to be holding the cell phone in one hand and possibly checking it or working with it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "cup", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 35827, "caption": "According to the image, there is a man sitting at a desk surrounded by various electronic devices such as a computer, two laptops, and a telephone. He appears to be typing on a keyboard while wearing a white shirt. The man is in the process of working, likely sitting at a desk working on his computer. There is another laptop positioned further away from the first, which seems to be connected to a TV.", "mscoco_hallucinated_words": [["telephone", "cell phone"], ["keyboard", "keyboard"]], "mscoco_gt_words": ["cup", "tv", "dining table", "laptop", "bottle", "chair", "person"], "hallucination_idxs": [28, 37], "metrics": {"CHAIRs": 1, "CHAIRi": 0.18181818181818182, "Recall": 0.5714285714285714}}, {"image_id": 72656, "caption": "According to the picture, a small outdoor gathering is taking place with two children sitting at a small wooden table with chairs, enjoying their lunch or snacks together. The wooden table has colorful chairs, and they are sharing a meal or playing together, smiling and having fun. On the table, there is a plate with food on it, and another plate nearby, indicating the children are enjoying food. A person is also present in the scene, perhaps a parent or another adult companion.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "suitcase", "dining table", "bowl", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 372087, "caption": "According to the picture, there is a green train on the tracks traveling towards a station. The train appears to be parked at the station, waiting for passengers to board or disembark. There are several people visible in the scene, some standing near the train and others walking along the platform.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "suitcase", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 336209, "caption": " In the image, a young man is performing a trick on a skateboard, flying through the air while jumping over a bench. The man appears to be enjoying the action and is the main focus of the scene. There are two benches present in the image, one in front of the skateboarder and another further in the background. Additionally, a handbag can be seen in the left side of the image.", "mscoco_hallucinated_words": [["handbag", "handbag"]], "mscoco_gt_words": ["skateboard", "bench", "person"], "hallucination_idxs": [67], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 46031, "caption": "According to the picture, there is a laptop sitting on a desk, with a mouse and a keyboard in front of it. The desk is also surrounded by several plants, including a couple of different species. In addition to the laptop and peripherals, there are a couple of monitors set up with screens, and another computer positioned atop the desk.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "tv", "dining table", "laptop", "mouse", "cell phone"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 283118, "caption": "According to the picture, a family of three is sitting at a table in a restaurant, enjoying their meal together. The little girl is smiling, and the father looking at her, possibly taking a photo of her. They are sharing a meal with hamburgers, creating a happy and memorable dining experience. The father and the child also have a bottle on the table, suggesting they are enjoying their time together.", "mscoco_hallucinated_words": [["hamburger", "sandwich"]], "mscoco_gt_words": ["pizza", "handbag", "dining table", "bottle", "cell phone", "car", "backpack", "chair", "person"], "hallucination_idxs": [49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.3333333333333333}}, {"image_id": 429598, "caption": " According to the image, there is a kitchen with vintage-style wooden cabinets, a stove, a refrigerator, and a sink. The sink is in the middle of the kitchen area, while the stove is on the left side. A refrigerator is placed to the right of the sink, and the wooden cabinets are above the stove and to the right side. An old-fashioned wooden table is also present in the kitchen, with its top area near the center of the room.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["sink", "oven", "refrigerator", "bottle"], "hallucination_idxs": [72], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.75}}, {"image_id": 333365, "caption": "According to the picture, there are two airplanes parked next to each other at an airport. One of the airplanes is red, and the other is white. They are positioned on a runway, which is likely to be a taxiway. The airplanes appear to be large and occupy a significant portion of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 424174, "caption": "The image depicts a large plate filled with a variety of fruit, including bananas, apples, oranges, and tangerines, all arranged together in a visually appealing manner. The table is wooden, and the plate containing the fruit is on top of it. The arrangement of the different fruits creates a natural and appetizing scene, highlighting the contrasting colors and shapes of the different fruit types.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "apple", "orange", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 3035, "caption": "The image depicts an airport tarmac with two large airplanes parked on the ground. One airplane is closer to the viewer, while the other one is further away in the scene. Various airport equipment can be observed in the background, including at least four cars (three passenger cars and one truck). Additionally, a couple of trucks are scattered around the scene, providing services to the parked airplanes.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 177213, "caption": "According to the picture, there is a white plate with a cheese pizza on it sitting on a table. It appears to be the main focus of the image. The table is covered with a tablecloth, giving it a pleasant, elegant ambience. A man is sitting next to the table, on the left side, and he is holding a glass of wine, possibly enjoying the pizza and his drink.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 438535, "caption": "The image depicts a large city street with a large clock displaying time in the middle of the street, sitting on a black pole. Surrounding the clock, there are many cars parked on the street and a few are in the near vicinity. The clock is facing a busy traffic intersection with tall buildings visible in the distance. It appears that this particular street has a few pedestrians moving around, with some closer to the clock and others scattered further away. The scene is a busy city street, highlighted by the clock and the presence of vehicles.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "clock", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 490711, "caption": "The image displays a kitchen with white appliances, including a refrigerator on the left side and a sink on the right side. In the middle, there is a stove surrounded by white cabinets. There are two bottles visible, one near the refrigerator and another closer to the middle of the kitchen.", "mscoco_hallucinated_words": [["stove", "oven"], ["bottle", "bottle"]], "mscoco_gt_words": ["sink", "refrigerator", "banana"], "hallucination_idxs": [31, 40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.6666666666666666}}, {"image_id": 163728, "caption": "According to the picture, there is a small black and white kitten sleeping on top of a table. It seems to be a domestic scene where the kitten is napping while also close to a mouse. There is another object, possibly a backpack, lying nearby. This scene suggests that the kitten is enjoying its nap while also having the mouse close by.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["cup", "dining table", "laptop", "mouse", "cat"], "hallucination_idxs": [46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6}}, {"image_id": 456756, "caption": " The image features a large giraffe standing in a park or enclosed space, likely a zoo or a wildlife exhibit. The giraffe appears to be enjoying its surroundings and taking a break. In the background, there are other animals visible, including a few zebras and several birds. These animals seem to be in a separate enclosure or area, but their presence makes the scene more diverse and engaging. Overall, it's a beautiful and majestic scene with a mix of giraffe and other animals.", "mscoco_hallucinated_words": [["bird", "bird"]], "mscoco_gt_words": ["zebra", "giraffe", "person"], "hallucination_idxs": [51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 281028, "caption": "According to the picture, there is an image of a train interior featuring a pair of empty blue chairs placed against the wall and a pair of larger chairs arranged near the windows. The seats are designed to provide a comfortable space for passengers. The image suggests that the train could be a commuter vehicle or a modern train, as it features modern interior design and the chairs are situated in a way that provides ample view of the outside.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "person", "chair", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 343937, "caption": " According to the picture, it features a young woman wearing a black and white snowboard standing on a snowy slope, preparing to ride down the slope. She is wearing a hat, indicating her readiness to navigate down the hill. There are also a few additional people in the image, spread out at various places.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 277051, "caption": " The image features two small birds, one on the left side and the other on the right side, standing on a table in the center of the scene. They appear to be eating a piece of food on a plate placed on the table, which could be pizza. The scene also shows several bottles of wine or other liquids positioned around the table. One bottle is located at the upper left corner, while another is positioned in the middle of the table, and a third bottle can be seen at the right side of the table.", "mscoco_hallucinated_words": [["pizza", "pizza"]], "mscoco_gt_words": ["knife", "dining table", "bottle", "bird", "chair"], "hallucination_idxs": [51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 0.6}}, {"image_id": 145666, "caption": "According to the picture, the image depicts a large bowl filled with fresh vegetables on a stainless steel countertop. There is a mixture of various vegetables including broccoli and carrots. The broccoli and carrots are arranged in a neat, organized manner, making it easy to identify them. The vegetables are likely to be cooked in a pan or on a stove top. The bowl is quite full with the various vegetables, taking up a significant portion of the image.", "mscoco_hallucinated_words": [["stove", "oven"]], "mscoco_gt_words": ["bowl", "carrot", "broccoli", "dining table"], "hallucination_idxs": [66], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.75}}, {"image_id": 304984, "caption": "The image showcases a meal consisting of various food items placed on a table. A sandwich and a serving of pickled onions are the main dishes on a white plate, accompanied by a bowl filled with orange juice. There are also some carrots placed on the table, along with a spoon and a fork. ", "mscoco_hallucinated_words": [["table", "dining table"], ["carrot", "carrot"], ["table", "dining table"], ["spoon", "spoon"]], "mscoco_gt_words": ["bowl", "sandwich", "cake", "fork", "orange", "wine glass"], "hallucination_idxs": [13, 45, 49, 54], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.6666666666666666}}, {"image_id": 122467, "caption": "The image depicts a person, possibly a child, holding a video game controller in one hand, with the other hand placed on top of the controller. There is a white background surrounding the person, making it easier to focus on the controller and the hand. The controller appears to be a remote for a video game system, such as a Nintendo Wii.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "dining table", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 518287, "caption": "The image features a white toothbrush with a big smile, as if it is enjoying a funny moment. The toothbrush is placed inside a cup or container, likely made to store it when not in use. The smile is quite prominent, and it's surrounded by a blue color that is not too dissimilar from the color of the toothbrush's handle.", "mscoco_hallucinated_words": [["container", "bowl"]], "mscoco_gt_words": ["toothbrush", "cup", "orange", "dining table"], "hallucination_idxs": [28], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 145448, "caption": "According to the picture, a black cat is sitting on a motorcycle, taking a rest. There are other items present, such as a mug in the background, a bottle near the left edge, and a backpack slightly right of center. There are also two people visible in the scene, likely enjoying their time around the motorcycle. The cat seems content as it lounges on the motorcycle, providing an interesting scene for all those present.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["backpack", "backpack"]], "mscoco_gt_words": ["dining table", "motorcycle", "cat", "car", "chair", "person"], "hallucination_idxs": [33, 41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 352900, "caption": " The image features a plate of pasta topped with a variety of ingredients. A generous amount of broccoli is spread across the plate and mixed with various pasta noodles. The noodles are visible throughout the dish, creating an aesthetically appealing dish. The plate is placed on a dining table, indicating it is being served or enjoyed as a meal.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["bowl", "fork", "knife", "broccoli"], "hallucination_idxs": [52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 153607, "caption": " The image features a blue glass filled with water, and inside the vase, there is a flower. The flower is placed on top of the water and appears to be floating. There is also an additional glass with an orange inside it, which could be considered a flower or an ornament. The arrangement of the two glasses and the orange one is symmetrical, adding an interesting and creative touch to the overall image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "orange", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 157581, "caption": "According to the picture, there is an array of wine bottles lined up on a dining table, showcasing various wine types. The bottles are arranged horizontally on the table, with one bottle in the center of the display and several other bottles surrounding it. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "bottle", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 227438, "caption": "The image showcases a large, luxurious bedroom with a comfortable bed. The bed is situated at the center of the room, positioned against a wall. A chair sits close to the bed, creating a cozy atmosphere for rest and relaxation. The bedroom features a large window, drawing attention to its natural lighting, and a nightstand can be seen on the side of the bed.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "couch", "bed", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 116037, "caption": "According to the picture, a young woman is participating in a softball game. She is playing at the plate and has a bat in hand. The game is being played in a field, with a large crowd watching from the stands. Many people, both young and old, are in various positions around the field, focused on the action taking place at the plate.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 202658, "caption": "According to the picture, a white toilet is sitting next to an unknown object in an open area. The white toilet has a pink character on it. It is not certain if this character is associated with a particular brand or product.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "chair", "toilet"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 18150, "caption": " According to the picture, it appears to be a family setting with a young boy and a man sitting on a bed, possibly in a living room. The boy is eating a piece of pizza, while the man is sitting next to him and eating some as well. The image captures a relaxed, intimate moment between the two individuals.", "mscoco_hallucinated_words": [["bed", "bed"]], "mscoco_gt_words": ["pizza", "bottle", "backpack", "person", "couch"], "hallucination_idxs": [22], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.4}}, {"image_id": 418711, "caption": "According to the picture, the scene depicts a man riding a bicycle on the street, carrying a large basket of bananas on the back of the bike. The bananas appear to be a significant amount, potentially going to market or to sell. The scene captures a man with the basket of bananas riding his bicycle while surrounded by various objects in the surrounding area, including a bottle in the center of the scene and a traffic light, contributing to the image. There is also a traffic sign nearby, potentially indicating the location of the street and its purpose. In addition to the main man with the bananas, there is another person visible in the background of the image, possibly a bystander watching the event unfold.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["traffic light", "traffic light"]], "mscoco_gt_words": ["truck", "bicycle", "car", "person", "banana"], "hallucination_idxs": [72, 81], "metrics": {"CHAIRs": 1, "CHAIRi": 0.15384615384615385, "Recall": 0.6}}, {"image_id": 92107, "caption": "The image shows a bowl of soup with corn and a spoon next to it on a table or surface. The soup has a light, creamy appearance, and there is corn visible on top of it. A slice of bread is visible in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "spoon", "broccoli", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 491000, "caption": " A group of people are walking down a street in a city, possibly in Latin America. There are three adult men walking on the sidewalk, and another person can be seen walking nearby. The street is lined with old buildings, some of which have the same color as the clothing of the people, likely a brown color.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "motorcycle", "dog", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 401163, "caption": "According to the picture, two men are sitting on a bench in a park, surrounded by trees and greenery. One of the men is elderly and is enjoying the park's scenery while sitting on a bench with his dog by his side. The other man is also enjoying the park, likely resting and taking in the peaceful atmosphere. The presence of the dog creates a comfortable and welcoming environment for the two men.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bench", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 39663, "caption": "According to the image, there is an arrangement of various items on a floor, including a backpack, notebooks, pens, pencils, and a cell phone. These items appear to be neatly placed and organized. There are three notebooks arranged in a row, with the middle one located slightly closer to the cell phone. Additionally, there are two pens, one on the right side and the other positioned near the cell phone. A pair of scissors is also situated towards the left side, along with a bottle further back. The cell phone is situated towards the center of the image. Overall, the image suggests a well-organized and tidy layout, indicating that the items are not scattered around the room, but rather kept in an orderly manner.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["laptop", "book", "cell phone", "backpack", "scissors"], "hallucination_idxs": [98], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.8}}, {"image_id": 268293, "caption": "The image shows a young man playing tennis on a court, with a tennis racket in his hand. He is swinging a tennis racket and has a ball in front of him, possibly getting ready to hit it or after a successful serve. The scene captures the movement of the player as he focuses on the game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 474499, "caption": "According to the picture, there is a kitchen with a white refrigerator, a dishwasher, and a stove. Various items are scattered throughout the kitchen, including a bottle of water, a glass bottle, and a jar filled with ketchup. There is also a wine bottle on the counter and multiple bottles in the oven. Several wine glasses are also present, one located near the refrigerator, and another near the wine bottle. A few other bottles are positioned around the kitchen, with one closer to the stove and another near the dishwasher. The overall scene appears to be a well-stocked kitchen with a variety of food and drink items.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"], ["wine glass", "wine glass"], ["refrigerator", "refrigerator"]], "mscoco_gt_words": ["clock", "oven", "spoon", "bottle", "vase", "apple", "sink"], "hallucination_idxs": [12, 63, 73], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.2857142857142857}}, {"image_id": 358086, "caption": "The image features a group of people, likely playing soccer together, with several men on the field and a few more spectators watching the game. The players are spread out across the field, with some in the foreground and others in the background. The focus of the image is a man jumping up to catch a frisbee that is being tossed.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "frisbee", "backpack", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 89541, "caption": "In the picture, there is a yellow container filled with various types of food, including tofu, mushrooms, and a variety of fruits such as bananas, oranges, and strawberries. The fruit is spread out across the dish, creating a colorful mix. The food is arranged in different areas of the container, with some items closer to the center and others along the edges. The arrangement creates a visually appealing and diverse selection of food.", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["bowl", "dining table", "cake", "banana"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 239445, "caption": " In the picture, there is a young man with a frisbee in his hands, posing for a photo. He is smiling and appears to be happy while holding the frisbee. There is another person in the background, but the focus is mainly on the smiling man. The scene seems to take place at an outdoor setting with benches in the background, one of which is visible on the right side of the image.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["frisbee", "sports ball", "person"], "hallucination_idxs": [63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 105782, "caption": "According to the picture, there is a young kangaroo standing on a grassy field. The kangaroo appears to be enjoying its time in the grassy area. In the background, there is a car parked on the left side of the field and a few more vehicles can be seen in the distance, along with a person walking on the right side of the image.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["cat", "kite", "dog", "car"], "hallucination_idxs": [60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 487990, "caption": "The image depicts a group of people seated at a picnic table, surrounded by various food items. There are several individuals, including a young woman, who is holding a guitar, playing a song. The guitar is placed at the center of the table. There are a few bowls and cups scattered around the table, along with some bottles and a few other food items.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "bowl", "spoon", "bottle", "cake", "fork", "donut", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 138834, "caption": "A person is walking through the snowy terrain while holding ski equipment. He is wearing skis and a backpack, and there is a car parked in the background. The man appears to be on his way to or from skiing. The image captures a person walking with skis while enjoying the outdoors.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["car", "skis", "person"], "hallucination_idxs": [19], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 1.0}}, {"image_id": 492057, "caption": " According to the picture, a man is taking a selfie in a public restroom, wearing a tie and smiling. The reflection of the man is visible in a mirror behind him.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "person", "cell phone"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 388215, "caption": "According to the picture, a young man in a white shirt is playing a tennis match, holding his tennis racket as he serves the ball. The man is on a tennis court, possibly practicing or playing a game with his friends. In addition to the main player, there are a few other people in the background, possibly observing the game or engaging in other activities.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 111409, "caption": "According to the image, a man is riding a motorcycle on a cement surface, possibly participating in a motorcycle race or enjoying a leisurely ride. He is wearing a helmet and riding a white and orange motorcycle, and he appears to be enjoying riding the motorcycle. A handbag is placed on the ground, and there are a couple of bottles visible on the ground as well, possibly belonging to the man or other individuals in the scene.", "mscoco_hallucinated_words": [["orange", "orange"], ["handbag", "handbag"], ["bottle", "bottle"]], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [38, 52, 65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 451391, "caption": "The image features a person holding a large bouquet of green broccoli in front of them. The broccoli has numerous florets in different stages of growth and is quite substantial. It seems to be grown in a greenhouse, as it is a vibrant green color and is surrounded by various fruits and vegetables.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["broccoli", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 92462, "caption": "According to the picture, a group of cows is walking down a dirt road, likely on their way to graze or get some exercise. The cows are of various sizes, with some closer to the camera than others. There are five cows visible in the scene, four of them in the middle and one near the right side of the image. Some of the cows have a white and black pattern on their fur.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "cow", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 397268, "caption": "According to the picture, there is a storefront with bicycles and chairs displayed outside. The bicycles appear to be on display, with one in the foreground and another in the background. The store is located in a busy city, where people passing by can see the bikes.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["bicycle", "chair", "potted plant", "bench"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 127659, "caption": "The scene depicts a train, which appears to be a commuter or regional train, on the tracks as it travels on the left side of the image. Two people are present in the vicinity, with one person standing near the center and the other walking closer to the right side. Both individuals are focused on a single bench near the train, possibly waiting for the train to stop. A second bench is placed further in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "person", "bench", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 251857, "caption": "According to the picture, there is a group of young men in a skate park, enjoying a sunny day. There are three men standing in the park, with two of them wearing striped outfits. Among them, one man is on a skateboard, enjoying a skate session.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "skateboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 243212, "caption": "According to the picture, a young boy is standing in a grassy field holding a blue frisbee. There are several vehicles parked outside the field, likely in a parking lot or driveway.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "dog", "frisbee", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 444304, "caption": "According to the picture, there is a man sitting on a couch, watching a television with a remote control in his hand. Near the man, there is a chair, and to the right of the TV, there is another chair. The bed seems to be in the background. The man is wearing a baseball cap, and there is another person in the background, sitting on a chair to the right of the man on the couch.", "mscoco_hallucinated_words": [["couch", "couch"], ["chair", "chair"], ["chair", "chair"], ["chair", "chair"], ["couch", "couch"]], "mscoco_gt_words": ["remote", "person", "bed", "tv"], "hallucination_idxs": [12, 32, 45, 76, 85], "metrics": {"CHAIRs": 1, "CHAIRi": 0.35714285714285715, "Recall": 1.0}}, {"image_id": 305000, "caption": "According to the picture, there are various fruits and vegetables on display at a produce market. A woman stands at the market, surrounded by an array of produce. She is wearing a black and white striped shirt and black pants. Several fruits, such as apples, oranges, and grapes, are scattered throughout the market, with some placed on tables and others scattered across the ground. A bowl can be seen among the fruits, as well as multiple carrots arranged around the market. The woman is engrossed in what she's looking at in the market, possibly examining the variety of produce available.", "mscoco_hallucinated_words": [["orange", "orange"], ["table", "dining table"], ["bowl", "bowl"], ["carrot", "carrot"]], "mscoco_gt_words": ["apple", "person", "banana"], "hallucination_idxs": [52, 67, 76, 88], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5714285714285714, "Recall": 0.6666666666666666}}, {"image_id": 44136, "caption": " The image shows two men dressed in tuxedos seated at a table while attending a formal event. Both of them are smiling and seem to be enjoying themselves. One of the men is holding a beer bottle, and the other man has a wine glass with him. They are surrounded by several wine glasses on the table, as well as dishes containing food. The setting appears to be a dining or conference area, and there are several chairs in the background.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["dining table", "tie", "bottle", "wine glass", "person"], "hallucination_idxs": [84], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.8}}, {"image_id": 154846, "caption": "The image features a metal pan filled with several types of pizzas. The pizzas appear to be sliced and arranged across the pan in a variety of sizes. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 567254, "caption": " According to the picture, the scene features a young girl sitting in front of a TV, watching a show or movie. She has a stuffed animal in her hand, possibly holding it while watching TV. The TV is placed further into the room, suggesting that it is a large or flat-screen model.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "vase", "tv", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 79261, "caption": "According to the picture, there is a man standing on a street in the middle of a large fountain filled with water. The man appears to be cleaning the water using a mop. There is a sign nearby that says \u201cplease walk,\u201d indicating the man may be promoting the importance of clean water. The man is wearing a blue shirt, and there are several people nearby looking at him as he goes about his task. The street area surrounding the fountain seems to be occupied by these observers, and there are at least seven individuals in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "clock", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 301402, "caption": "In the image, a man is sitting in a chair, using his laptop in the dining room. The scene shows two tables: one located closer to the man and the other placed towards the back of the room. On the nearer table, a cup is situated in the left corner and a bowl is placed in the middle of the table. The man is holding a cup and a bowl. On the further table, there is a bottle in the left corner.", "mscoco_hallucinated_words": [["cup", "cup"], ["bowl", "bowl"], ["cup", "cup"], ["bowl", "bowl"]], "mscoco_gt_words": ["dining table", "laptop", "potted plant", "bottle", "chair", "person"], "hallucination_idxs": [49, 58, 73, 76], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.8333333333333334}}, {"image_id": 415828, "caption": "According to the picture, there are several people gathered on a city street, likely enjoying a conversation or just spending time together. Some people are wearing backpacks, suggesting they might have just come from work or school. They stand in front of a house or building, which has a red roof and white walls. A person near the left side is holding an umbrella, and others in the group are holding a book and a cup, possibly engaged in an outdoor activity or just passing the time while on the street.", "mscoco_hallucinated_words": [["book", "book"], ["cup", "cup"]], "mscoco_gt_words": ["handbag", "umbrella", "traffic light", "stop sign", "car", "backpack", "person"], "hallucination_idxs": [80, 83], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.42857142857142855}}, {"image_id": 543231, "caption": "According to the picture, there is a woman playing a video game using a remote, with several people in the background. She is seated and surrounded by others who are likely watching her play. The remote she is using is located right in the middle of the image. A couch is also visible in the scene, situated close to the woman, possibly for her companions to sit and watch the game.", "mscoco_hallucinated_words": [["couch", "couch"]], "mscoco_gt_words": ["remote", "bowl", "book", "bed", "person"], "hallucination_idxs": [54], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.4}}, {"image_id": 495357, "caption": "The scene depicts a man riding a motorcycle down a city street, surrounded by several pedestrians on the sidewalk. Some of the pedestrians are wearing ties, possibly indicating they are dressed professionally or attending a formal event. The atmosphere appears to be lively, as the group is observing the man on the motorcycle.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "person", "handbag", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 523175, "caption": "According to the picture, it appears to be a bowl filled with soup. The soup is made with a variety of ingredients, including carrots and a variety of vegetables. A spoon can be seen in the picture, which is likely used for scooping up the soup or serving it. There are two oranges and one carrot present in the image as well. The bowl is placed on a dining table or tray, indicating that it may be part of a meal or a dining setting.", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["bowl", "carrot", "spoon", "dining table"], "hallucination_idxs": [58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 184945, "caption": " The image features a line of motorcycles parked next to each other in a parking lot, creating a busy scene. There are three motorcycles visible in the middle of the image, while two more motorcycles are located closer to the right side of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "potted plant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 49942, "caption": "The image shows a young man wearing a tie and holding a black umbrella up to his face. He is standing upright, with his head positioned below the umbrella. The umbrella covers part of his face, hiding his left eye. The man's face is centered in the scene, capturing his attentive and thoughtful expression.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 223019, "caption": "According to the image, a blue train has arrived at a train station, where multiple people are standing on a platform. They seem to be waiting for the train to depart or simply observing the train while it stops at the station. The train appears to be an old-fashioned train with a white color scheme, and it has a prominent \"T\" shape in the middle of its front.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["suitcase", "handbag", "train", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 42889, "caption": "The image features a teddy bear sitting on a table, with glasses and a wireless keyboard beside it. The teddy bear appears to be wearing a pair of glasses, while the keyboard is placed on the table. In the background, there is another electronic device, possibly a computer or a laptop, covered in various items like books, pens, and a cup, likely representing someone working with technology or using it for personal interests.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"], ["computer", "laptop"], ["laptop", "laptop"], ["book", "book"], ["cup", "cup"]], "mscoco_gt_words": ["keyboard", "teddy bear", "bear"], "hallucination_idxs": [9, 39, 53, 56, 63, 69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6, "Recall": 0.6666666666666666}}, {"image_id": 22112, "caption": "According to the picture, there is a wooden table full of various objects including a couple of potted plants. One of the potted plants is a blue vase with green plants, and another vase features a small plant inside it. There is also a skull-shaped item on the table. The plants appear to be placed strategically, creating a visually pleasing arrangement.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "potted plant", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 247587, "caption": " The image features a man standing on a surfboard on a sandy beach. The surfboard is spread out in front of the man, and he appears to be waving to someone in the distance. Several people can be seen in the background, walking or standing in various locations.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 171962, "caption": "There is a large city bus parked at the end of a street. The bus has several passengers, including a woman standing at the bus's front and back of the bus. The bus has several front and rear doors open, giving access to the passengers. The person closest to the bus has a bicycle, possibly a biking or transportation method. There are several other bikes in various locations, one of which is positioned behind the person with the bike and another near the front of the bus.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 545056, "caption": "The image shows a vast, green field with a herd of grazing cows. Many of them are black, while others are brown or possibly a mixture of the two colors. The cows are spread out throughout the field, with some of them lying down in various positions, while others are standing up and grazing. The herd is large, with multiple cows occupying the field. The cows appear to be enjoying their time on the open grassy land.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "sheep", "cow", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 401411, "caption": "According to the image, there is a man standing in a kitchen surrounded by a variety of food items. He appears to be preparing food, possibly a meal, as he has multiple foods in front of him, and he is attentive to his task.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "oven", "dining table", "bowl", "book", "carrot", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.14285714285714285}}, {"image_id": 2867, "caption": "According to the picture, a group of snowboarders are gathered around a sign, getting ready to ski or snowboard down the slope. There are five people in total in the scene, and they all appear to be wearing backpacks and snow gear. Some of the snowboards can be seen on the ground, likely belonging to the people in the group. There are two bottles in the scene, likely for drinking or hydration during their excursion.", "mscoco_hallucinated_words": [["snowboard", "snowboard"], ["snowboard", "snowboard"], ["bottle", "bottle"]], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [20, 50, 70], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 1.0}}, {"image_id": 246963, "caption": "According to the picture, a group of motorcyclists is riding down a road, with some of them wearing helmets. There are several motorcycles visible, including some near the stop sign, which is located on the side of the road. A few of the motorcyclists appear to be holding hands as they ride, forming a social bond. The scene depicts a casual ride with friends.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "stop sign", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 498225, "caption": "A group of people is standing in front of a display of desserts in a store. They appear to be admiring and enjoying the selection of desserts, with some looking more closely at the cakes and others just taking in the scene. There are several cakes on display, varying in size and shape. The people are spread out throughout the scene, and a handbag can be seen near one of the individuals. Overall, it appears to be a busy store with customers appreciating the dessert selection.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "cake", "cell phone", "donut", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 124601, "caption": "According to the picture, it is an image of a small kitchen with a white, stainless countertop filled with various items. A laptop is placed on the countertop, while a couple of bottles are situated on different areas. There are also two mugs, a microwave, a cell phone, a cell phone accessory, and multiple cups in the kitchen. ", "mscoco_hallucinated_words": [["cell phone", "cell phone"], ["cell phone", "cell phone"]], "mscoco_gt_words": ["keyboard", "cup", "dining table", "laptop", "bottle", "wine glass", "sink", "microwave"], "hallucination_idxs": [53, 57], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 581749, "caption": " The image displays a variety of vegetables, including broccoli, carrots, green beans, and oranges. The broccoli is placed in the upper right corner of the image, followed by the carrots and green beans on the left and right sides. The oranges are arranged towards the bottom of the image, occupying the right side and extending slightly towards the left. The arrangement of these vegetables creates a colorful and appealing display.", "mscoco_hallucinated_words": [["carrot", "carrot"], ["carrot", "carrot"]], "mscoco_gt_words": ["broccoli", "orange", "dining table"], "hallucination_idxs": [11, 35], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 227960, "caption": " The image captures a dining table that is filled with various items. At the table, there are multiple pieces of silverware, including two forks and two spoons, as well as two carrots.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"], ["fork", "fork"]], "mscoco_gt_words": ["cup", "bowl", "spoon", "wine glass", "carrot"], "hallucination_idxs": [5, 15, 26], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6, "Recall": 0.4}}, {"image_id": 286503, "caption": "According to the picture, there is a young elephant standing on the ground at a petting zoo or other outdoor environment. The elephant is close to a tree and appears to be looking at something. Surrounding the elephant are several people, who might be visitors observing the animal.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "sports ball", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 315740, "caption": "The image is of a modern kitchen featuring two stainless steel sinks and granite countertops. The kitchen is filled with various cooking utensils and is well-appointed with various cabinets. There are three chairs present in the kitchen area, as well as a large couch in the dining area. The couch is the only piece of furniture in the room, creating an inviting and relaxing living space.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["oven", "tv", "bowl", "potted plant", "sink", "microwave", "chair", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.375}}, {"image_id": 142108, "caption": "According to the picture, there is a group of young people, including adults, standing around a dining table with several bananas on it. They appear to be engaged in a fun and educational activity, possibly blindfolded, where they are learning about various foods and working on [IDK]. Some of the bananas are placed closer to the edge of the table while others are situated more towards the middle of the dining area. The people in the scene seem to be in different positions, and some are closer to each other while others are more spread out.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 542933, "caption": "A baseball game is in progress with a batter, ready to swing, in the batter's box. The batter is holding a baseball bat and is prepared for the pitch. A catcher and an umpire are also present in the scene. In total, there are 12 people in the picture, which is likely a group of players and spectators. A bench is visible at the side of the field, potentially serving as a place for the players to sit during the game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "chair", "baseball bat", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 475043, "caption": "According to the picture, there are a few people in a room, sitting on couches or chairs, possibly having a relaxed gathering or social event. Some of the individuals are sitting on the floor, and one person is drinking from a cup. A laptop is also present in the scene, either being used or brought as an additional element to the event. A couple of dining chairs are placed in the room as well, suggesting a space for the participants to enjoy food or drinks together.", "mscoco_hallucinated_words": [["chair", "chair"], ["cup", "cup"], ["chair", "chair"]], "mscoco_gt_words": ["bowl", "spoon", "laptop", "bottle", "person", "couch"], "hallucination_idxs": [18, 46, 74], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 0.5}}, {"image_id": 456784, "caption": "In the image, a man is holding a knife and slicing a cake, cutting it into smaller portions. The man's position and the knife's movement suggest that he is in the process of serving the cake to others. There are two other knives, one located closer to the center of the table and another one near the left side of the cake.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["cake", "knife", "person"], "hallucination_idxs": [58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 1.0}}, {"image_id": 33753, "caption": "According to the picture, it depicts a grey bench in a park setting. A remote control is sitting on the bench, appearing to be a key component in the design. The bench is a wooden structure with no visible dirt or scratches on it. A chair is nearby, but not in direct proximity to the bench.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "chair", "bench", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 448694, "caption": " The scene shows a group of people actively playing in a park with a frisbee. The frisbee is being thrown by a man in the foreground, and two more people are in close pursuit. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["frisbee", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 99053, "caption": " The image displays a bowl of food filled with noodles and vegetables. A spoon can be seen within the bowl, suggesting the dish is ready to be enjoyed. The vegetables appear to be including broccoli and there are multiple pieces in the bowl. Additionally, there is a cup placed at the top left of the image, which may be used for drinks or storing utensils.", "mscoco_hallucinated_words": [["spoon", "spoon"], ["cup", "cup"]], "mscoco_gt_words": ["dining table", "bowl", "fork", "broccoli", "carrot"], "hallucination_idxs": [14, 52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.4}}, {"image_id": 503210, "caption": "According to the picture, a spacious and bright living room with a wooden floor and yellow walls is furnished with several items. There are two couches in the room, one closer to the middle and the other in the right corner. A piano can be seen in the left corner, and a television is positioned near the couch in the middle of the room. A number of chairs are also scattered throughout the space, some close to the couches and others further away. The room features several decorative pots in various positions, providing a touch of style and personality. The overall atmosphere appears to be warm and inviting.", "mscoco_hallucinated_words": [["television", "tv"]], "mscoco_gt_words": ["clock", "potted plant", "vase", "orange", "book", "chair", "couch"], "hallucination_idxs": [57], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.2857142857142857}}, {"image_id": 142257, "caption": "According to the picture, the scene is a busy city street with multiple cars driving down the road. There are four cars present, with a red car, a blue car, and two other cars, one in the center of the street and the other towards the left side. People are present as well, moving around the city. Some people can be seen in the vicinity of the cars, traveling to various destinations.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}], "overall_metrics": {"CHAIRs": 0.338, "CHAIRi": 0.11631944444444445, "Recall": 0.5480637813211845, "num_hallucinated_caps": 169.0, "num_caps": 500.0, "hallucinated_word_count": 335.0, "coco_word_count": 2880.0, "length_response": 72.166, "hallucinated_caps_ls": [144305, 164091, 54870, 324008, 52700, 39053, 493862, 128119, 401027, 507171, 231534, 227031, 209347, 12375, 13550, 2154, 104326, 300784, 409346, 344942, 275929, 465601, 490491, 133146, 185327, 348654, 482432, 454161, 129707, 203110, 451943, 529256, 37907, 368367, 295656, 401010, 313928, 490284, 346940, 278627, 400851, 122602, 24104, 59710, 527497, 411564, 285302, 61959, 5934, 131138, 258209, 285820, 245664, 453731, 87052, 418692, 259345, 119709, 378831, 449798, 375769, 426172, 152499, 460866, 333998, 443259, 301837, 309071, 240681, 47781, 14773, 261318, 134768, 526414, 441363, 188136, 89378, 99874, 357, 561967, 151081, 134223, 572081, 301266, 475813, 558784, 314016, 564629, 53431, 381031, 9628, 330050, 163518, 423984, 499486, 118124, 288729, 252625, 328110, 367872, 88269, 98979, 536154, 416384, 531327, 1268, 575406, 277073, 377879, 436814, 45550, 444913, 303099, 271565, 205193, 272566, 170099, 14940, 140702, 354540, 191301, 124236, 82488, 348251, 381615, 352703, 226802, 49005, 425569, 423008, 35827, 336209, 283118, 429598, 490711, 163728, 456756, 277051, 145666, 304984, 518287, 145448, 352900, 18150, 418711, 39663, 474499, 89541, 239445, 105782, 138834, 111409, 397268, 444304, 305000, 44136, 301402, 415828, 543231, 523175, 42889, 2867, 124601, 581749, 227960, 475043, 456784, 99053, 503210]}}}