{"overall_metrics": {"CHAIRs": 0.33867735470941884, "CHAIRi": 0.10801393728222997, "Recall": 0.5586490187129165, "num_hallucinated_caps": 169.0, "num_caps": 499.0, "hallucinated_word_count": 310.0, "coco_word_count": 2870.0, "length_response": 69.64529058116233, "hallucinated_caps_ls": [144305, 308003, 54870, 324008, 234433, 524656, 401027, 253630, 326021, 161227, 343534, 254368, 231534, 209347, 523292, 209925, 2154, 104326, 233888, 277479, 300784, 169169, 344942, 465601, 300124, 490491, 133146, 498747, 227204, 185327, 348654, 482432, 37907, 506401, 406611, 278365, 368367, 21779, 405183, 148977, 399058, 325836, 437910, 401010, 567562, 313928, 346940, 122602, 183709, 59710, 527497, 411564, 285302, 258209, 285820, 245664, 453731, 253262, 418692, 259345, 443299, 545594, 437720, 288576, 535403, 426172, 180764, 147762, 425447, 443259, 301837, 240681, 278890, 47781, 14773, 521863, 250540, 298067, 563938, 261318, 272728, 188136, 99874, 557447, 357, 134223, 301266, 475813, 539951, 143479, 230983, 564629, 53465, 381031, 530520, 9628, 118124, 288729, 252625, 493610, 30667, 366111, 532481, 70815, 88269, 463211, 396269, 98979, 536154, 560018, 256628, 364783, 531327, 451449, 51054, 575406, 436814, 63934, 275611, 478380, 422326, 45550, 444913, 82709, 303099, 205193, 170099, 354540, 191301, 124236, 344930, 14285, 82488, 150703, 226802, 49005, 319607, 210372, 349642, 45435, 336209, 283118, 429598, 424174, 490711, 163728, 281028, 304984, 145448, 352900, 202658, 39663, 474499, 105782, 487990, 111409, 127659, 305000, 44136, 495357, 42889, 401411, 581749, 227960, 286503, 315740, 142108, 475043, 503210]}, "results": {"sentences": [{"image_id": 144305, "caption": "There are a few computer devices displayed in the image. A laptop, placed on top of a table, is sitting in the center of the scene, connected to various cables and a monitor. On the right side of the laptop, there is a television set, likely on and displaying content, and a mouse nearby. On the left side, a keyboard is positioned, making it a multi-functional electronic setup.", "mscoco_hallucinated_words": [["monitor", "tv"], ["television", "tv"], ["mouse", "mouse"]], "mscoco_gt_words": ["keyboard", "laptop", "dining table"], "hallucination_idxs": [36, 49, 60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 1.0}}, {"image_id": 164091, "caption": "In the image, there are two dolls placed side by side in a baby carriage, a white one and a teddy bear, along with a bed in the background. The white doll, dressed in a blue outfit, seems to be sleeping in the carriage while the teddy bear is sitting next to it. The bed is also located in the background, but its position seems to be at the far left. The image showcases the assortment of dolls and a bed in a bedroom setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "bed", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 308003, "caption": "The image shows a wooden bench with a splintered, worn surface. It appears to be in a state of disrepair, possibly indicating a public park or a picnic area. The bench takes up most of the scene, occupying the majority of the visual space. There is a chair to the left of the bench, also with a worn appearance. Additionally, a few people can be seen in the scene, likely enjoying the bench in the outdoors.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["chair", "bench", "dining table"], "hallucination_idxs": [71], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 54870, "caption": "There is a plate with a sandwich and salad on it, sitting on a table. The sandwich appears to be made with ham, and the salad is covered in multiple types of greens and a few tomatoes. There is a knife positioned near the plate on the left side. It seems like a meal, perhaps served in a restaurant. The atmosphere is neat and well organized.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["sandwich", "fork", "knife"], "hallucination_idxs": [15], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 95908, "caption": " The image features a desk with a mouse pad placed in front of it, and a computer keyboard resting on the desk's surface. A computer mouse is sitting on top of the keyboard, while a small metal cup with a handle is positioned nearby. A cup, likely for drinking, is placed in the left side of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "dining table", "laptop", "mouse"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 329614, "caption": "This is a large image, featuring a cobblestone street with multiple lanes and a few buildings surrounding it. In the foreground is a cute, yellow school bus travelling down the street, possibly passing by a group of people near a church or other building. There are multiple cars in the street, some farther away and a few closer to the side of the image. The street and buildings show a city scene with an urban atmosphere. The clouds above create a stormy sky, further enhancing the atmosphere of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 324008, "caption": "The image displays a tray full of food with various items. There are three bananas neatly arranged on the tray, as well as a cup filled with yogurt. The food also includes a bread roll placed on the left side of the tray, and a bottle containing juice is situated on the right. A seat can be seen on the left side of the image, indicating this scene might take place in a plane or a train.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["seat", "chair"]], "mscoco_gt_words": ["cup", "dining table", "train", "orange", "airplane", "person", "banana"], "hallucination_idxs": [49, 59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5714285714285714}}, {"image_id": 52700, "caption": "A group of people is standing around a dining table in a large, open room. The dining table is covered with a variety of desserts, including various cakes and cakes arranged neatly in rows along the center of the table. There are 14 slices of cake in total, with some cakes having multiple slices and others having just one slice.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "dining table", "spoon", "cake", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 501311, "caption": "The image captures a baseball game in progress, with multiple people playing on the field. The main focus is a pitcher throwing a baseball while wearing baseball cleats, which can be seen in the center of the field. In total, there are at least 12 individuals spread throughout the scene, some playing in different positions and others watching the game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 247179, "caption": "The scene is a city street at night where a group of people dressed in white is gathered around two elephants walking down the street. The elephants are decorated with garlands or lanterns on their backs. One of the elephants is being held by several people, while the other is walking alongside the crowd. The people in the scene are likely celebrating some kind of festivity.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "clock", "person", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 577912, "caption": " It is a well-lit bathroom scene with a toilet and a sink positioned along the walls. The toilet is closer to the left side of the image and the sink is closer to the right side. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "toothbrush", "toilet", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 196341, "caption": " The image shows a large clock with two circles on its face, surrounded by a group of people standing close to it. The clock appears to be the primary focus of the scene, with many people gathered around it. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 234433, "caption": "According to the image, it depicts a street scene with a couple of cars parked on the street. There are at least six vehicles in the scene, including a truck parked further back, several smaller cars in the foreground, and a few other cars visible further down the road. ", "mscoco_hallucinated_words": [["truck", "truck"]], "mscoco_gt_words": ["traffic light", "car", "person", "bicycle"], "hallucination_idxs": [32], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.25}}, {"image_id": 455610, "caption": "The scene depicts a group of people gathered around a red and green motorcycle parked on the street. Several individuals are standing near the motorcycle, engaged in conversation or observing the scene. There are a few cars in the background, one on the left and another one in the right corner. Some traffic cones are placed around the area, likely for parking purposes or to control pedestrian flow.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "motorcycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 39053, "caption": "The image presents a table filled with a variety of food items. There are several dishes on the table, including bread and a bowl filled with a yellow substance. There are also several cups and a cell phone placed around the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "bowl", "spoon", "cake", "cell phone", "apple"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 493862, "caption": "According to the picture, it features a person standing in a field with a fly swatter, seemingly enjoying the outdoors. The scene is calm, and the person is standing still, surrounded by green grass in the field. There are no other notable objects or people in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "tennis racket", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 269873, "caption": "The image is a black and white photo of a group of people standing in the street near the intersection of Avenue [IDK]. There are several people in the scene, with some of them wearing ties, as well as some others standing nearby. There is a trolley car in the background, which is likely a streetcar or bus.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "person", "bus", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 524656, "caption": " The image depicts a small kitchen filled with various items and appliances, such as a sink, an oven, and a microwave. In the center of the kitchen, there is a counter with two bowls, multiple cups and wine glasses placed on top, creating a display of bottles and vessels. A spoon can be found in the kitchen, potentially being used for cooking or serving food. The kitchen also features a window, allowing natural light to enter and providing an opening for ventilation.", "mscoco_hallucinated_words": [["vessel", "boat"]], "mscoco_gt_words": ["toaster", "cup", "knife", "oven", "refrigerator", "bowl", "spoon", "bottle", "wine glass", "sink", "microwave"], "hallucination_idxs": [55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.7272727272727273}}, {"image_id": 369547, "caption": "The image shows a beach scene where a dog and a cat are sitting on the beach together. The dog is near the left side of the image, while the cat is more centered in the shot. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "dog", "cat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 181446, "caption": " In this image, a man is leaning on the open refrigerator door while standing in front of it. The scene appears to take place in a kitchen setting. The man looks tired or exhausted as he leans against the refrigerator. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "bowl", "bottle", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 128119, "caption": " The image captures a large bus on a city street with a bike and a number of bicycles parked on the sidewalk in front of it. Some of the bikes have been locked to the posts while others are unlocked. The bus is positioned with one of its sides closer to the roadside and another bike on the sidewalk just next to the bus. There is a second bicycle in the foreground, closer to the front of the bus.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bus", "handbag", "cell phone", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 401027, "caption": " The image features a table with a plate of food on it, including a sandwich and a cup of dressing. There is also a knife on the table. The table is set with cups and dishes, indicating that it may be a dining setting. There are three chairs, two situated near the left side of the table, and the third one located on the opposite side. The scene is completed by a vase, placed in the center of the table, adding to the overall aesthetic.", "mscoco_hallucinated_words": [["chair", "chair"], ["vase", "vase"]], "mscoco_gt_words": ["cup", "knife", "dining table", "sandwich", "fork"], "hallucination_idxs": [52, 80], "metrics": {"CHAIRs": 1, "CHAIRi": 0.18181818181818182, "Recall": 0.8}}, {"image_id": 253630, "caption": "The scene features a small square table sitting in the middle of the room, surrounded by various items including a television, a remote control, and a bowl. The table is positioned next to a window on the left side of the room. A couch is situated close to the window on the right side, while a few chairs are placed at the back of the room near the wall. ", "mscoco_hallucinated_words": [["remote", "remote"], ["bowl", "bowl"], ["couch", "couch"]], "mscoco_gt_words": ["potted plant", "chair", "tv", "dining table"], "hallucination_idxs": [24, 29, 48], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 0.75}}, {"image_id": 326021, "caption": " A woman sits at a table wearing a blue denim jacket with a teddy bear beside her. The table is draped with a red tablecloth and is surrounded by chairs on all sides. The woman is holding a second teddy bear, which she places on the table in front of her. There is another teddy bear placed on a different table, further back in the scene. The setting appears to be outdoors as there are trees visible in the background.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["teddy bear", "person", "dining table"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 1.0}}, {"image_id": 317320, "caption": " The scene depicts a man riding a bicycle down a street with several other people, some of whom are walking. The streets have been marked with yellow crosswalks. The man is surrounded by pedestrians, including another person riding a bicycle, and a few more people nearby, likely taking part in the group activities.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 507171, "caption": "In the image, a man is riding a surfboard on a wave in a large body of water. The waves are near a bridge, which can be seen in the background of the scene. There are also other smaller waves in the background, creating a dynamic and exciting scene. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "person", "cat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 161227, "caption": " According to the picture, there is a chair sitting in the room with two remotes and a cell phone on it. One of the remotes is located on the left side of the chair while the other two remotes are placed on the right side. The cell phone is also on the right side, towards the top. There are other items in the room as well, including a bottle, cups, a laptop, and a remote on the desk nearby.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cup", "cup"], ["desk", "dining table"]], "mscoco_gt_words": ["remote", "keyboard", "laptop", "cell phone", "chair", "couch"], "hallucination_idxs": [74, 76, 86], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 25812, "caption": " The scene features a person snowboarding down a ski slope, skiing with a snowboard rather than skis. There is a ski lift nearby, possibly at a ski resort.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 267537, "caption": "The scene depicts a group of women sitting on bleachers, with one wearing a coat. The women are sitting next to each other and covering themselves with rain umbrellas. There is also a man in the picture, likely sitting near or behind them. There are other people in the background, but they appear to be less prominent in the image. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "umbrella", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 195645, "caption": " There is a group of people sitting outside at a table, dining and having a conversation. They are gathered around a dining table, which is placed in the middle of the outdoor space. Some of the people are dressed in black, indicating that they might be in mourning or attending a memorial. Among the group, a baby sits on the floor, observing the proceedings. There is also a wine glass sitting at the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "dining table", "bottle", "wine glass", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 155671, "caption": " A sunny day finds a traffic light hanging over the middle of the street, near the intersection with several roads. Cars, trucks, and a semi-truck are driving on various parts of the city street. There are four traffic lights in view, positioned throughout the surrounding roads. A couple of cars are parked at different spots around the street, and a few pedestrians can be seen walking around, including two in the distance. The street is quiet, with no significant construction or activity taking place.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 412894, "caption": "A busy street scene is taking place with a large group of people walking along the sidewalk near a subway entrance, creating a bustling atmosphere. Some people are carrying bags and purses, while others are just walking around in the vicinity. There are two traffic lights, one near the front of the scene and the other towards the end of the street, possibly controlling traffic in the area. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "person", "handbag", "backpack"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 32625, "caption": "There is an image in front of a table, displaying several plates with food. There are various pieces of food on the plates, including two pieces of food in the upper left corner, a bowl and another plate of food. On the lower right side of the image, there is a bowl of food. In addition, there are some cups placed on and around the table. The overall scene features different food items and dining items arranged on and around a table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "bowl", "fork", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 18688, "caption": "The image shows a baseball game taking place in a field. There are several people standing around the area, some near the batter and others scattered throughout the field. There is a man holding a baseball bat and wearing a baseball glove, while others in the group wear various clothing items. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 343534, "caption": "The image depicts a small doll sitting next to two stuffed animals. The doll is wearing a pink outfit, possibly a top and skirt, and is holding onto one of the stuffed animals. The doll appears to be a small person or a doll, with pink being the predominant color in its appearance. There is a small chair nearby, and the doll is positioned on the left side of the image, while the stuffed animals are on the right side of the image.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["teddy bear", "potted plant", "dog", "book", "chair"], "hallucination_idxs": [44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.2}}, {"image_id": 370657, "caption": "According to the image, there are several people riding motorcycles, one of which has a young child sitting on it. Some of the people are wearing masks.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 481596, "caption": " In a crowded sports stadium, a man is standing in the bleachers while eating a hot dog. There are several people in the background, spread throughout the stadium, suggesting a busy atmosphere. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["hot dog", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 460347, "caption": "The scene depicted is a large bus driving down a road with many cars following it. There are a total of eight cars, two of them are in front of the bus and the others are positioned further behind it. The bus appears to be making its way through the city, and multiple drivers are visible behind the wheel of their respective vehicles.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 398753, "caption": "The scene is an outdoor baseball field with a large crowd watching a game. There is a baseball player on the pitcher's mound, while the batter is holding a baseball bat ready to hit the ball. The crowd is dispersed around the field, with some people sitting on bleachers in various positions. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 67833, "caption": " The image features a large pizza placed on a tabletop, likely in a restaurant or home setting. There is another pizza sitting on top of a plate placed near the first pizza. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "bowl", "bottle", "fork", "wine glass", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.1111111111111111}}, {"image_id": 272880, "caption": " The image shows a piece of pizza sitting on a plate that is placed on a table or dining table. The pizza appears to be mystic pizza, an artistic and hand-painted pizza that is famous for its unique and intricate designs. The pizza appears to be cut up, with a piece being held up in front of the other pieces, perhaps indicating that it is the last piece on the plate.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 254368, "caption": " The image features a man sitting in a bed and eating a dish, which could be either a piece of cake or a sandwich. He is dressed in a robe, which adds a comforting and warm atmosphere to the scene. There is a dining set close to the bed, possibly including a dining table and a few chairs. The man seems to be enjoying his meal and relaxing in his bed.", "mscoco_hallucinated_words": [["cake", "cake"], ["sandwich", "sandwich"], ["table", "dining table"], ["chair", "chair"]], "mscoco_gt_words": ["pizza", "bed", "person"], "hallucination_idxs": [21, 24, 58, 62], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4444444444444444, "Recall": 0.6666666666666666}}, {"image_id": 193108, "caption": " The scene takes place at a baseball field where a couple of players are standing on the field. There are two main players, one of them holding a baseball bat in his hand, swinging at an object near him, likely an incoming ball. Another player is positioned further back on the field, also holding a baseball bat. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 459141, "caption": "According to the picture, there is a woman wearing a conical hat, riding a bicycle, and carrying a large bag on the bike's rack. The scene takes place in a residential area with other homes and buildings in the background. The woman is likely the main focus of the scene as she is riding the bicycle. The other objects in the scene may be part of the surrounding environment, such as the bicycle, or they might be in the background, creating a sense of space and depth in the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "potted plant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 231534, "caption": "The image features an airport runway with a few vehicles, including a large truck and a few cars. The truck is positioned near the runway, loading luggage. The cars are located further down the runway and appear to be part of an airport transport system.", "mscoco_hallucinated_words": [["car", "car"], ["car", "car"]], "mscoco_gt_words": ["truck", "suitcase", "handbag", "airplane", "person"], "hallucination_idxs": [18, 32], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.4}}, {"image_id": 227031, "caption": "The image depicts a man and a woman cutting a three-tiered wedding cake together, possibly in front of an audience. The man is wearing a black suit, and the woman is wearing a gray suit. They are in the process of cutting the cake while standing at a table dining in a public venue, such as a reception hall. A few other people can be seen in the scene, but they seem to be further away from the cake-cutting action.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "tie", "potted plant", "cake", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 209347, "caption": "The image depicts a woman in a black leather jacket sitting on a bench in a subway car, holding her cellphone and looking down at it while wearing earrings. The woman is surrounded by various people on the subway car, including two people to her left and one person in front of her.", "mscoco_hallucinated_words": [["bench", "bench"], ["car", "car"], ["car", "car"]], "mscoco_gt_words": ["handbag", "tie", "train", "cell phone", "person"], "hallucination_idxs": [13, 17, 41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.4}}, {"image_id": 523292, "caption": "The scene features a cat sitting on a couch, which is located at the center of the image. It is leaning its head onto a large open laptop that is on top of a table in the living room. The table is situated in the lower part of the picture. A cup is placed on the table, positioned near the laptop.", "mscoco_hallucinated_words": [["couch", "couch"], ["cup", "cup"]], "mscoco_gt_words": ["tv", "dining table", "laptop", "cat", "chair"], "hallucination_idxs": [8, 55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6}}, {"image_id": 12375, "caption": "According to the picture, a plate with food on it is a meal featuring multiple items including meat, potatoes, and vegetables. The meal includes a variety of items like potatoes, carrots, and broccoli. The plate is on a table, possibly a dining table, and appears to be a meal or snack served to someone. The food arrangement can be considered as a hearty, home-style meal with a variety of ingredients.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["carrot", "broccoli", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 13550, "caption": "In the image, a woman is standing on top of a large rock, surrounded by two elephants. The woman appears to be kneeling down and leaning over the rock, close to the water. It seems as if she is interacting with the elephants, perhaps giving them food or simply getting close to them. The two elephants are both different sizes, with the larger one closer to the center of the image and the smaller one towards the left edge.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "handbag", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 146837, "caption": " The image features a young man riding a skateboard on a rail, which is a horizontal surface that extends from the sidewalk, possibly a concrete edge or metal rail. The skateboarder is positioned on the rail, focusing on his ride.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 67042, "caption": " In the image, a man is standing in a corridor or walkway and checking his cell phone. The man is wearing a white shirt and tie. There are a couple of other people in the area, including another man standing to the right of the cell phone user.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "person", "cell phone"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 209925, "caption": "A man is standing on a baseball field, wearing an orange uniform. He is holding a baseball and appears to be getting ready to throw it during an inning. There are several other people in the background, dispersed throughout the field, possibly watching the game or preparing to participate.", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [11], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 2154, "caption": "The image features a shopping cart sitting on the sidewalk near a curb, next to a park bench. The bench appears to be a common area where people gather or rest. A few dining tables can be seen in the area as well, one towards the center of the image and the other two near the right side. There are also two people visible in the scene, both standing close to the left side of the image. The overall atmosphere of the scene appears to be in a public area, perhaps a shopping district or a city street.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["car", "umbrella", "bench", "person"], "hallucination_idxs": [37], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 274931, "caption": "The image is a black and white photo of a sidewalk with parked cars along the side of the street. There are several cars parked next to each other on the sidewalk, creating a row of vehicles. The street is a narrow and seems to be in the middle of a neighborhood. Some of the cars are old and vintage models. The black and white nature of the image gives it a vintage look.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "fire hydrant", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 104326, "caption": "In a scene filled with green, a small but cozy kitchen has been renovated with wood floors. A dining table with six chairs is situated near the kitchen, occupying most of the space. A refrigerator is present in the kitchen, as well as a clock on a wall. Two potted plants are visible, one in the background and the other on the left side of the image.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"]], "mscoco_gt_words": ["clock", "knife", "oven", "dining table", "potted plant", "sink", "microwave", "chair"], "hallucination_idxs": [38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 233888, "caption": " The image features a young boy, likely in his home, playing baseball in front of a large TV. He is holding a bat and has just swung it at a television screen that displays a baseball game. In the room, there is a clock situated near the television and a series of bottles lined up on a shelf.", "mscoco_hallucinated_words": [["clock", "clock"], ["bottle", "bottle"]], "mscoco_gt_words": ["tv", "potted plant", "baseball bat", "person", "baseball glove"], "hallucination_idxs": [48, 57], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.4}}, {"image_id": 213905, "caption": "The scene features two female tennis players, both dressed in white, standing on a tennis court together. They are holding tennis rackets and appear to be friends. The court has several chairs around it, possibly for spectators to sit on. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "tennis racket", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 277479, "caption": " The image shows a young girl standing near the refrigerator, looking down into the refrigerator and about to open the door to inspect the contents. The fridge is positioned in the left-hand corner of the scene, surrounded by various bottles and a bowl placed on the shelf.", "mscoco_hallucinated_words": [["bowl", "bowl"]], "mscoco_gt_words": ["person", "refrigerator", "bottle"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 300784, "caption": "According to the picture, there is a group of four children and a pet dog sitting on a couch. The children are smiling, indicating a friendly atmosphere among them. There is also a bottle sitting on the couch, likely to be for the pet dog.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["person", "bench", "dog", "couch"], "hallucination_idxs": [37], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.75}}, {"image_id": 183766, "caption": "The image features a collection of small cakes with fruit on display. They appear to be decorated with various types of berries, including cherries, as well as ornaments. There are three cakes in total, with one larger cake situated in the center and two smaller cakes on either side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "fork", "knife", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 169169, "caption": "A busy sidewalk is the setting of the image with people walking near different businesses. The street is lined with green banners, with one banner saying \"spring kkt.\" Among the sidewalk's features are several traffic lights and trees, including one located at the sidewalk's edge. Two large vases are prominently placed along the sidewalk, adding visual interest. A bench and a trash can can also be seen in the scene.", "mscoco_hallucinated_words": [["vase", "vase"], ["bench", "bench"]], "mscoco_gt_words": ["traffic light", "person", "potted plant", "car"], "hallucination_idxs": [56, 69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.5}}, {"image_id": 409346, "caption": "The image depicts two women standing in a room, attentively serving food from a buffet table. The table is filled with various desserts and sweet treats, like cupcakes, cake slices, and cookies. One woman is on the left side of the table, while the other is on the right side. There are several dishes displayed around the table, and the food appears to be arranged in a festive manner. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "spoon", "cake", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 522262, "caption": "In the image, there are several bicycles parked against a fence, with some located closer to the viewer and others farther away. A handful of these bicycles are colorful, and most of them appear to be the same size. The scene is set near a city street, likely a side street, and the bicycles are lined up along the fence. There is also a tall building visible in the distance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 79014, "caption": "According to the picture, there is a man, wearing glasses, standing behind a mirror and holding an orange in front of his face, creating an interesting optical illusion. The orange creates the illusion that the man is holding a real lemon in front of him, while the reflection shows a different face or another object.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 344942, "caption": "The image is of a small robot toy sitting on top of a laptop computer. The robot is positioned prominently in the scene, appearing as if it is attached to the keyboard. There is another robot toy situated near the screen, giving the impression that they are communicating with each other. ", "mscoco_hallucinated_words": [["keyboard", "keyboard"]], "mscoco_gt_words": ["clock", "laptop", "tv"], "hallucination_idxs": [33], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 275929, "caption": " A series of white toilets can be seen on the stairs in this image, with several of them broken and placed against the steps in an unusual arrangement. The toilets appear to be scattered up the stairs, with some situated closer to the bottom, while others are closer to the top.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "bowl", "toilet"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 465601, "caption": " The scene depicts a group of people standing outside under umbrellas, which provides shelter from the rain. Most of them are wearing backpacks and walking around. There are at least three individuals holding umbrellas, while another person is wearing an umbrella-like handbag, which can be seen as an alternative to carrying an umbrella.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["handbag", "umbrella", "bench", "person"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.75}}, {"image_id": 300124, "caption": "The image shows a microwave sitting on a couch, which is placed on the sidewalk outside a building. The couch is empty except for the microwave, which appears to be in the process of being removed or repaired. The TV is positioned on top of the couch, with the microwave underneath it. The scene is situated close to a trash can, possibly on a street corner or outside a store. The TV and microwave occupy a significant portion of the scene, drawing attention to the unconventional use of furniture for cooking purposes.", "mscoco_hallucinated_words": [["tv", "tv"], ["tv", "tv"]], "mscoco_gt_words": ["microwave", "couch", "chair", "dining table"], "hallucination_idxs": [43, 79], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.5}}, {"image_id": 46919, "caption": "According to the image, a group of people is hiking up a snow-covered hill. The group consists of at least three individuals, with one of them standing at the front of the group while the others are trailing behind. Each person is wearing a backpack, indicating that they may be on a hiking or camping trip. The hill is covered with snow, creating a wintery scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 434022, "caption": " In the picture, a train is moving along the train tracks. The train is quite long and has multiple railroad tracks beneath it, extending off in different directions. Some of the tracks are visible near the front of the train, while others are further back. There is a clear steam emanating from the engine, indicating a train blowing steam. Additionally, there are no other vehicles on the road in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 490491, "caption": "There is an image with various signs on a red brick wall. The image includes several street signs in various positions, and a sign with a cross on it is visible. The scene appears to be a street corner or roadside view, with a single person walking in front of the wall and another individual closer to the sign with the cross on it. The signs are a mix of different types, including a traffic light, a stop sign, and a number sign. The scene conveys a sense of an urban city environment, and the various signs provide context to the surroundings.", "mscoco_hallucinated_words": [["traffic light", "traffic light"]], "mscoco_gt_words": ["stop sign", "car", "person"], "hallucination_idxs": [80], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 175878, "caption": " In the image, a group of four people, two men and two women, are standing together at a porch, enjoying the outdoors. One of the men is looking at a cell phone, possibly using it to take a picture of the group. They appear to be socializing and having fun.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "cell phone", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 12669, "caption": "A woman in a white top and white pants is standing in front of a large artwork, possibly a painting or a statue, holding a wine glass. She is surrounded by various other objects and sculptures, including a vase on the right side of the scene. Additionally, there are a couple of people in the scene. One person, a man, is standing in the background, while another person, a woman, is standing next to the wine glass-holding woman.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["wine glass", "handbag", "vase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 192496, "caption": " There is a train station in the image, with two trains parked at a platform next to a building. The first train is closer to the building, occupying a central position along the platform. It extends from the lower left to the middle of the scene, with a longer portion visible near the center. The second train is slightly further from the building, positioned on the right side of the platform. This train is shorter in comparison to the first train and appears towards the upper right part of the image. Both trains have some cars attached to them.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 133146, "caption": "The image displays a large, comfortable living room with several pieces of furniture. A couch and several chairs, which are arranged around the room, provide seating options. There are also various accessories, including vases, a bottle, a book, and a remote control. A fireplace and a television are located in different parts of the room, with the fireplace situated near the center of the living space. The room is well-lit, giving a warm, inviting atmosphere to the space. This living room serves as a comfortable and cozy area for entertaining or relaxing.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["remote", "remote"]], "mscoco_gt_words": ["tv", "vase", "book", "chair", "couch"], "hallucination_idxs": [42, 49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 1.0}}, {"image_id": 53744, "caption": "The image displays a cutting board with pieces of tomato, lettuce, and a knife on it. The tomato is cut into several pieces, with some pieces lying close together, and others scattered on the cutting board. The lettuce is also cut into smaller pieces, with some positioned closer to the knife and others near the edges of the board. A close-up view of the knife reveals its blade, and a sandwich is visible in the background, creating a kitchen-like atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "knife", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 375902, "caption": "The image features a winter scene with skiers gathered around an opening in the mountainside. There is a yellow jacket being worn by one of the skiers, who is positioned at the front of the group with their skis and poles on display. The rest of the skiers are scattered throughout the scene, each with their own set of skis. Some are carrying backpacks while others seem to be holding their poles close by. There are multiple people visible in the image, with some individuals more towards the center of the scene and others at the edges. The overall atmosphere is lively and filled with skiers celebrating their accomplishment at the top of the hill.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 498747, "caption": " The image features a young man in the center of the picture, talking on a cell phone. He is surrounded by several people, some of whom are looking directly at the man while others are in the background. One person is holding a bottle in the left side of the picture, while another person has a wine glass on the right side. There is a bottle on the right side as well. A few other people are scattered throughout the scene, including one in the middle right, and two more towards the top right and bottom left corners. ", "mscoco_hallucinated_words": [["wine glass", "wine glass"]], "mscoco_gt_words": ["cell phone", "bird", "person", "bottle"], "hallucination_idxs": [61], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 0.75}}, {"image_id": 52435, "caption": "There are numerous people enjoying a beach day, with several people in the water, some surfing, and others watching from the shore. The beach appears to be popular with several groups of people scattered across the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "person", "umbrella", "bird"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 301977, "caption": "The scene features a brown cow, standing next to a store selling food and drinks, likely a grocery store or a food stand. The cow appears to be positioned close to the front of the store, with some of its body on the sidewalk and the remainder standing in the store's area. The cow likely has been drawn to the store due to the presence of many bottles containing various beverages, such as soda and/or sports drinks. There are several bottles visible in the scene, distributed throughout the store. One is positioned towards the left side, while a few others are on the right side, with a larger bottle located closer to the back side of the store. The bottles are arranged in different positions, and some are closer to the bottom, giving the image an organized look.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cow", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 261097, "caption": " A man is standing on a tennis court holding a tennis racket, possibly waiting for a match to begin. The court is crowded, with multiple people spread throughout the space. Several chairs can be seen placed around the court, which are likely for spectators or other participants to sit on during the match.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "tennis racket", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 227204, "caption": " The image depicts a group of three women, possibly of Asian origin, sitting around a dining table and talking on their cell phones. They are in a relaxed and casual setting. On the table in front of them, there is a selection of food items, such as bananas, apples, and bread.", "mscoco_hallucinated_words": [["apple", "apple"]], "mscoco_gt_words": ["cup", "knife", "dining table", "bowl", "spoon", "bottle", "cell phone", "person", "banana"], "hallucination_idxs": [55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.4444444444444444}}, {"image_id": 353981, "caption": "The image is of an old airplane sitting on the ground. It appears to be a military aircraft, with a white and black coloring. There are three people sitting on the ground next to the airplane, one closer to the front, and the others further back. One person is closer to the rear of the airplane, while the other two are closer to the middle of the image. They seem to be at a military airfield, as indicated by the presence of an airplane and the presence of people dressed in military attire.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 185327, "caption": "The scene depicts a man walking in the middle of a street with a herd of domestic animals, including a few cows and an overwhelming number of zebras. They appear to be walking on the street, perhaps to be led by the man. The cows are positioned closer to the man and further down the street, while the zebras are more scattered and numerous. In the background, there are several people on the sidewalks, one close to a bus, and another on the right side. There is also a car parked nearby, likely waiting for the man and the herd to pass by. The street appears to be busy with various activities.", "mscoco_hallucinated_words": [["zebra", "zebra"], ["zebra", "zebra"]], "mscoco_gt_words": ["horse", "bus", "cow", "car", "person"], "hallucination_idxs": [28, 63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6}}, {"image_id": 348654, "caption": "There is a cell phone lying on a stand in front of a computer. The cell phone appears to be turned on and connected to a keyboard and a mouse, which are also placed in close proximity to the phone. The keyboard can be found close to the cell phone, while the mouse is a bit further away, possibly indicating the presence of a computer or a peripheral device nearby.", "mscoco_hallucinated_words": [["keyboard", "keyboard"], ["mouse", "mouse"], ["keyboard", "keyboard"], ["mouse", "mouse"]], "mscoco_gt_words": ["cell phone", "laptop", "tv", "dining table"], "hallucination_idxs": [27, 30, 44, 56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.5}}, {"image_id": 482432, "caption": "The image shows two bottles placed close to each other on a bathroom sink, creating an interesting contrast. One bottle appears to be containing toothpaste, while the other is a dental product such as toothbrush. The items are situated near the bottom of the picture. Additionally, a dental brush can be found on the right side of the image, and there are a few toothbrushes visible in the picture as well.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["bottle", "bottle"]], "mscoco_gt_words": ["toothbrush", "sink", "bowl"], "hallucination_idxs": [4, 21], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.6666666666666666}}, {"image_id": 454161, "caption": "The image features a piece of pizza cut in half on a plate, accompanied by a fork and a wine glass. The pizza sits on a dining table on a dining room rug. The fork is placed near the pizza, while the wine glass is close by on the table. The dining table is empty except for the pizza, the fork, and the wine glass.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["wine glass", "pizza", "fork", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 144723, "caption": "In the image, a dog is in the middle of a field, leaping forward to catch a frisbee that is being held by another dog close to its side. The frisbee is about to be caught in the dog's mouth.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "frisbee", "car", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 129707, "caption": "A person is holding a large cordless phone with a button-type keypad. The phone is placed in front of the person's face, taking up a significant portion of the image. The person appears to be the focus of the image, with the phone being prominently displayed.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "cell phone", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 379842, "caption": "The image displays two Wii video games, one on top of the other, with the Wii controller partially inserted into the top one. The games are arranged side-by-side, creating an interesting scene. The bottom game has some of its pages opened to showcase the game's cover, while the top game appears to be closed. The two Wii games are placed on a table or surface, possibly showcasing them as part of a collection or being prepared for use.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "remote", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 203110, "caption": " The image captures a yellow trolley moving down a city street on a sunny day. The trolley has a sleek, modern appearance, featuring a large wheel on one side and an opening door on the other side. It is surrounded by trees on both sides of the street, which creates a more natural and vibrant urban setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bus", "train", "orange", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 556478, "caption": "In the image, there are two pizzas in metal pans, placed side by side on a kitchen stove. The tops of the pizzas appear crusted and browned, and they likely have been cooked and are ready to be served. The arrangement of the pizzas creates a visually appealing and appetizing display, teasing the taste buds.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "oven", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 103815, "caption": "The image captures a baseball player on the mound, actively pitching the ball. The batter is positioned on the opposite side of the field, holding a baseball bat and swinging it towards the ball during a game. In the scene, there are three other people, possibly teammates, who are also engaged in the game and visible in various locations on the field.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 451943, "caption": "The scene depicts a spacious bedroom featuring a large bed, likely a king-size or queen-size bed, with matching bed sheets and white bedding. The bed is situated towards the center of the room and is accompanied by two bedside lamps, one positioned on each side of the bed. The lamps can be used for reading, providing light, or creating an ambiance in the room.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "suitcase", "bed", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 268854, "caption": " The scene takes place near a beach or lake where a woman is carrying a large basket, walking on the sand. She is near the water and there are a few small boats parked or moored near her, which have caught her attention. One of the boats is closer to the shore and another one is further away, near the middle of the body of water.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "person", "boat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 34464, "caption": " The image features a group of people and a surfboard at the beach. The people are scattered across the sandy shoreline, some closer to the water and others further back on the beach. There are three people visible in the middle of the scene, with one individual standing close to the water.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 495513, "caption": " The scene in the picture features two people, a man and a woman, standing next to each other holding up hot dogs on sticks for a photo. They appear to be posing for a picture, perhaps as a fun way to capture a moment or for a social event. There are a few other people visible in the background, some standing in the foreground and others further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "hot dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 529256, "caption": " The scene features a red bedspread with two dogs on it, one on the left side and another on the right. The left dog is positioned near the edge of the bed, while the right dog is closer to the middle. Both dogs seem relaxed and content in their respective positions.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "bed", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 37907, "caption": "According to the picture, there is a cat sitting on various electronic devices near a television. The cat appears to be sitting on top of a TV, as well as other electronics, such as two TV remotes, a remote control, and a CD player.", "mscoco_hallucinated_words": [["remote", "remote"], ["remote", "remote"], ["player", "person"]], "mscoco_gt_words": ["cat", "potted plant", "tv"], "hallucination_idxs": [40, 43, 49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.6666666666666666}}, {"image_id": 506401, "caption": "The image features a traffic light in a small town or village, situated among some buildings and near a street. There are two dogs nearby, with one dog close to the traffic light and another further away. There is also a car parked to the left of the image, and two more cars can be seen in the background, one near the center of the scene and the other on the right side. Some traffic lights can be spotted among the trees, which are placed near the building closest to the sidewalk on the left side of the scene. A few pedestrians are scattered throughout the area, including one close to the dogs and another near the middle of the scene.", "mscoco_hallucinated_words": [["dog", "dog"], ["dog", "dog"], ["dog", "dog"]], "mscoco_gt_words": ["truck", "traffic light", "cat", "car", "person"], "hallucination_idxs": [25, 30, 122], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.4}}, {"image_id": 406611, "caption": " The image features a snow-covered hill where a large group of people is gathered. Some people are sitting in the snow, while others are skiing. Among the people, there are two snowboards and at least six pairs of skis, indicating that the hill is likely a ski resort. There are several people on skis, skiing across the hill, and some skis can be seen scattered around the area. Overall, the image portrays a bustling ski resort filled with excited winter sports enthusiasts.", "mscoco_hallucinated_words": [["snowboard", "snowboard"]], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [35], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 0.6666666666666666}}, {"image_id": 267225, "caption": "The scene shows several trucks and vehicles parked outside, including a truck with a white body and satellite dishes mounted on top of the roof. There are at least five people walking around in the area, some closer to the truck and others further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "remote", "tv", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 278365, "caption": "In the image, there are two people seated at a dining table enjoying pizza. They are positioned side by side, with one person closer to the edge of the table. A bowl of pizza is in the center of the table, as they eat and chat together. A cup, probably containing a drink or beverage, is also on the table, closer to the person on the left. There are two bottles on the table, one to the right of the pizza and another to the left, and a car parked outside. The scene appears to be an enjoyable evening spent together sharing pizza and good company.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["bottle", "bottle"], ["car", "car"]], "mscoco_gt_words": ["pizza", "cup", "dining table", "cell phone", "chair", "person"], "hallucination_idxs": [35, 80, 100], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1875, "Recall": 0.6666666666666666}}, {"image_id": 69757, "caption": "The image portrays a family gathering around a dining table, where an elderly woman is cutting a piece of chocolate cake. The family members appear to be interested in the cake-cutting process, with one man looking closely at the knife. Around the table, there are several chairs positioned for the people sitting down to dine.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "cake", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7142857142857143}}, {"image_id": 432213, "caption": "The image features a plate with a sandwich and fries as the main food items on the menu. The sandwich appears to be served on a bun, with two slices and several toppings, giving it an appealing appearance. There are three people sitting and dining in the restaurant, one closer to the plate, while the other two are further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "bowl", "sandwich", "wine glass", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 368367, "caption": " The image depicts a smiling woman holding a cell phone, talking on the phone while in an outdoor setting. She is standing close to a table, likely the dining table, and has her arms close to her body. There are two chairs near the woman, one at the far left and the other a bit to the right.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"], ["chair", "chair"]], "mscoco_gt_words": ["cell phone", "cup", "person"], "hallucination_idxs": [27, 32, 46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 0.6666666666666666}}, {"image_id": 201555, "caption": "There is a dirt road right next to a sandy area where construction is occurring. A few traffic lights are scattered throughout the area, and multiple road signs are present, including a \"Royal National Boating Championships\" sign near the construction site. There is also a handheld device on the left side of the image. The scene features a variety of cars, with some parked near the traffic lights and others on the road, moving past the construction site.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 92131, "caption": "The image captures a public transit scene, with a green and white city bus parked at the edge of a parking lot. The bus is quite large and occupies a significant amount of space. Around 12 people are gathered nearby, some of them wearing winter coats, and others standing in different positions. Some of the passengers are lined up and ready to board the bus, while others have their hands in their pockets or appear to be engaging in conversation.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "handbag"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 21779, "caption": "In the image, there's a woman who appears to be lying down on the floor. She is holding a large red and white suitcase and is positioned near the left side of the scene. In the background, a handbag can be seen placed further away from the woman, on the right side.", "mscoco_hallucinated_words": [["handbag", "handbag"]], "mscoco_gt_words": ["backpack", "suitcase", "person"], "hallucination_idxs": [43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 565148, "caption": "The scene takes place in a baseball stadium where a baseball player is holding a baseball bat on the baseball field. The player is getting ready to hit a baseball and take a swing during a baseball game. Several other players are also visible in the scene, including an umpire and a catcher. There are also people in the background, likely teammates and supporters. The atmosphere is filled with excitement and anticipation for the upcoming game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 160183, "caption": "The scene features a fire truck parked in a closed parking lot. It is positioned close to the building, near the front. The truck has a white roof with a long ladder extending from the back of it, providing an essential function to firefighters.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 555456, "caption": " The image depicts a person skiing on a snowy hill. They are dressed in a blue parka and black pants, and their snow skis are visible on the ground. Several other people can be seen scattered around the scene, all seemingly enjoying their time skiing on the snow-covered slope.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 235595, "caption": "The image shows several motorcycles parked next to a building along a street. One of the motorcycles appears to be red, while the others are black. They are parked in a row along the side of the street, creating a uniform appearance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "motorcycle", "potted plant", "car", "parking meter", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 489909, "caption": "The image displays a basket filled with various fruits, specifically oranges, bananas, and cucumbers. These three fruits are spread out within the basket, each occupying a different section. The oranges are arranged towards the front, bananas are situated in the middle, and cucumbers are located towards the back of the basket.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "orange", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 501080, "caption": "The image shows a large, white vase filled with a beautiful, pink flower arrangement. The pink flowers are arranged in a way that creates a striking, colorful composition. The flowers appear to be arranged in various places, both in the center and towards the edges of the vase, making it a visually appealing bouquet.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "potted plant", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 405183, "caption": "A group of people, including a woman, are gathered in a living room playing a video game. The woman is holding a Wii remote, presumably controlling the game, while others watch her. A couch is located in the middle of the room, with some people sitting on it and others nearby. There are several bottles of varying sizes, likely containing beverages, placed throughout the room. One bottle can be found on a lower left corner, another bottle is in the middle left section, and a third one is located near the top left part of the scene. ", "mscoco_hallucinated_words": [["bottle", "bottle"], ["bottle", "bottle"], ["bottle", "bottle"]], "mscoco_gt_words": ["remote", "handbag", "tv", "potted plant", "book", "chair", "person", "couch"], "hallucination_idxs": [62, 77, 88], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.375}}, {"image_id": 409138, "caption": "The scene is set in front of a park or a parking lot, with a large white and purple bus positioned in the center of the image. The bus has various colored stripes, giving it a distinct appearance. The bus appears to be on the side of the road, facing towards the left side of the image. A number of cars can be seen in the vicinity, some parked behind the bus and others scattered in various locations around it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 411840, "caption": "The scene is set on a tennis court, where two women in tennis outfits are walking across the court. Both players are holding tennis rackets, indicating that they are playing a match. The first woman is closer to the camera, while the second one is farther away and walking towards the right side of the court.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "chair", "backpack", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 333712, "caption": "The image is a pizza topped with cheese and onions, with a sliced slice taken out of it. The pizza appears to be crispy and well-cooked. There are multiple pieces of bacon on the pizza, adding a savory and crispy texture. The onions contribute to the flavor and add a crunch to the dish. The pizza is taking up a considerable portion of the dish, showcasing the popular cheese and bacon combination.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "spoon", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 148977, "caption": "In the image, there is a teddy bear with headphones on. It is sitting on a table or surface, possibly indicating that the environment is someone's home or workplace. There are multiple books and CDs lying around the table, with a total of six of each. There is also a cell phone sitting closer to the edge of the table, likely belonging to the person or the teddy bear.", "mscoco_hallucinated_words": [["cell phone", "cell phone"], ["person", "person"]], "mscoco_gt_words": ["keyboard", "teddy bear", "bear", "dining table", "bottle", "book"], "hallucination_idxs": [57, 72], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 399058, "caption": " The image shows a man riding on the back of an elephant while walking down a road. The elephant is positioned near the center of the image, and the man appears to be sitting on its back as they travel together. The scene has a few vehicles parked along the street, including a couple of trucks and a car. The elephant is walking close to the edge of the road, while the man is sitting on its back, enjoying the unique experience.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["truck", "bicycle", "elephant", "orange", "person"], "hallucination_idxs": [62], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.6}}, {"image_id": 183571, "caption": " The scene features a boat with a red and black design, likely a houseboat, on which a man is standing, looking down at the water on the other side of the boat. A dog is situated on the other side of the boat, near the man. There are a few additional people around the scene, but they are on the other side of the boat or on the boat next to it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "boat", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 325836, "caption": "The scene shows a motorcycle prominently displayed at the front of a room, along with other items such as a wine bottle, cups, and pots placed around it. There is a crowd of people in the room, some of them closer to the motorcycle while others are more spread out.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cup", "cup"]], "mscoco_gt_words": ["bicycle", "motorcycle", "handbag", "person"], "hallucination_idxs": [22, 24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.5}}, {"image_id": 437910, "caption": "The image features a small, narrow bathroom with a toilet and sink occupying a significant portion of the room. The toilet is positioned towards the right side, and the sink is near the left side, both visible from the bathroom door. The scene appears quite dirty, with a small puddle of liquid under the toilet. No chairs or any other furniture are visible in the space.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["sink", "toilet", "bottle"], "hallucination_idxs": [63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 155885, "caption": "The image is an old black and white photo of a small harbor with several boats docked in it. There are many seagulls flying around the boats and swimming in the water. Some of the birds are close to the boats, while others are further away. The scene is rather peaceful with no people or boats visible in the background. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 467887, "caption": "The image depicts a man sitting in a chair, wearing glasses, and using a laptop computer. He is positioned at the edge of a couch, in front of a table with various items on it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "mouse", "chair", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 174402, "caption": " The image features a young man standing on a tennis court, holding a tennis racket in his hand as he is ready to play. He is dressed in white clothing and appears to be waiting for the ball to come to him or for the game to begin. The background of the image reveals other people in the distance, possibly watching or participating in the match.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 352734, "caption": " In the image, a young boy is eating some kind of sweet dessert, likely cake, with a spoon. The boy appears to be sitting at a dining table eating the food while holding the spoon. There is a piece of cake on the table in front of him, indicating the type of dessert he's consuming. In the background, there are two other individuals, one sitting further away and the other closer to the boy.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "spoon", "cake", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 346716, "caption": "The image features a busy city street with two red double-decker buses moving down the street, following a route close to one another. The first bus is travelling from left to right near the center of the scene, while the second bus is located towards the right side of the street. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 451144, "caption": "The image features two snowboarders on a snow-covered slope, with their skis on their backs. The scene is a beautiful backdrop for the mountains, with some clouds visible in the sky. The two snowboarders are wearing red backpacks and stand close together. One of them has ski poles, and both are attentively observing the mountain surroundings. They appear to be enjoying their time on the snowboarding trail.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 295656, "caption": "According to the image, there are two people in the scene who seem to be in the process of eating slices of pizza. One person appears to be holding two slices, while the other person is holding only one slice. There are a couple of people standing in the background, and a car can be seen on the right side of the image. The people seem to be posing for the camera while enjoying their pizza.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "clock", "tv", "tie", "car", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 129886, "caption": " The image shows a group of people dressed in snow gear and standing near ski equipment on a ski slope. One person is wearing skis and two others are standing nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 557998, "caption": "According to the picture, it features a young boy sitting at a table with a bottle placed next to him, possibly alcohol or a drink. He is holding his hand close to his face, seeming to be taking a picture or having a conversation. Additionally, there are two chairs nearby, one in the foreground and one in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bottle", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 401010, "caption": "The scene is a bedroom with a single bed and a television, giving the appearance of a hotel room or a room for a disabled person. The bed is positioned in the center of the room, while the TV is placed against a wall near the bed. The bed measures about 30 inches wide and is covered with a white sheet. ", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["sink", "chair", "bed", "tv"], "hallucination_idxs": [26], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 567562, "caption": "According to the picture, there is a woman sitting at a table with two children, enjoying a meal together. On the table, there is a bowl of food and a knife. The table also has a vase in the middle of the arrangement. ", "mscoco_hallucinated_words": [["knife", "knife"], ["vase", "vase"]], "mscoco_gt_words": ["pizza", "cup", "dining table", "bowl", "spoon", "bottle", "fork", "broccoli", "book", "chair", "person"], "hallucination_idxs": [34, 41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.2727272727272727}}, {"image_id": 313928, "caption": "The image depicts a balcony or patio area overlooking a beach with a surfboard placed on the beach. There are three chairs surrounding the table, with one chair located closer to the surfboard, another chair near the edge of the balcony, and a third chair further back. The table is surrounded by potted plants, giving a touch of greenery to the scene.", "mscoco_hallucinated_words": [["potted plant", "potted plant"]], "mscoco_gt_words": ["surfboard", "dining table", "bottle", "wine glass", "chair"], "hallucination_idxs": [57], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.6}}, {"image_id": 490284, "caption": "In the image, a black dog is laying across a couch, possibly resting or sleeping. The dog is positioned in the foreground and extends towards the left side of the frame. Several people can be seen in the background, some seated on the couches, one at the left edge and others at the top of the frame. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["couch", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 346940, "caption": " The image depicts a kitchen scene with a stovetop featuring a steaming kettle and a frying pan, along with a few other kitchen utensils. There are several knives scattered around the kitchen, with a bottle and a spoon nearby. ", "mscoco_hallucinated_words": [["bottle", "bottle"], ["spoon", "spoon"]], "mscoco_gt_words": ["bowl", "knife", "oven"], "hallucination_idxs": [37, 40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.6666666666666666}}, {"image_id": 222782, "caption": "The image features a street scene with a person standing next to an old fashioned street sign that says \" amoco\" and another person walking on the sidewalk. The people are not doing anything special or significant in the picture, just standing there on the sidewalk.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "cell phone", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 255934, "caption": " The image features an elephant and a smaller elephant walking next to each other. They are standing in a dirt area, possibly an enclosure, surrounded by various people and other elephants. A couple of other elephants can be seen, one on the left side of the picture and another on the right. Some of the people are also walking around the enclosure, possibly observing the elephants.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "handbag", "elephant", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 278627, "caption": "The image is a collage of various dishes and people eating and posing. The scene has two men standing close to each other, a woman in a red blouse, a man standing in front of a plate of food, a person holding a plate, and a person holding two cups of food. In the same scene, there are two more people standing near each other. All of the people appear to be enjoying their dining experience.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "spoon", "potted plant", "bottle", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 400851, "caption": " The image features a large group of people enjoying a day at the beach, with a handful of tents and some chairs providing shade. The beach area is surrounded by various tents and umbrellas, likely set up for those attending the event.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "umbrella", "chair", "backpack", "boat", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 472610, "caption": " In the image, a woman stands next to a white dog while a group of cows surrounds them. The cows are spread across the field, some closer to the woman and others further away. There are various cow silhouettes, some smaller and some larger, with the largest ones appearing closer to the left side of the field. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cow", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 78565, "caption": "The image portrays a group of people standing on a sandy beach. There are several small sailboats floating around in the water nearby, giving the impression of a windy day. The people are positioned in various areas around the beach, with some standing near the water's edge and others located a little further away. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "boat", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 388237, "caption": " The image shows a man riding his skateboard down the street, with the surrounding area filled with people in various poses. There are two other skateboards visible in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "chair", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 289640, "caption": "The image features a man standing in a public setting, holding a frisbee over his face. In the background, several people can be seen sitting at dining tables arranged around the area, some close to the foreground, and others a little further away. A frisbee is placed on top of a table near the man holding it, and another one is situated on a counter close to the edge of the scene. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "frisbee", "orange", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 122602, "caption": "A person is sitting and working on a large table covered with several different items such as books, scissors, glue, and a glue bottle. The table is filled with various objects related to crafts or paper projects. The table is also covered in a white sheet, which could be used for protection or as a work surface. Surrounding the table, there are a few scissors and a few spools placed throughout the scene, further emphasizing the table as a work space.", "mscoco_hallucinated_words": [["person", "person"], ["book", "book"]], "mscoco_gt_words": ["dining table", "scissors", "bottle"], "hallucination_idxs": [1, 17], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 24104, "caption": "The image features a desk with three computer monitors neatly arranged on it. On the left side of the desk, there is a laptop with a screen occupying the central position, while another laptop sits towards the top right corner. The third monitor is placed on the right side of the central laptop. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "tv", "dining table", "laptop", "mouse", "book", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.375}}, {"image_id": 183709, "caption": "According to the picture, there is a group of people standing next to each other near a crosswalk. They appear to be walking along the sidewalk, likely crossing the street. The scene shows the city atmosphere, with cars parked near the street, traffic lights, and numerous pedestrians around. Some individuals are wearing backpacks, and a couple is walking hand in hand, which suggests they might be traveling together.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["traffic light", "handbag", "person", "car"], "hallucination_idxs": [60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.75}}, {"image_id": 59710, "caption": " The image shows a refrigerator with its door open, revealing its contents. The fridge sits on top of a wooden floor in a bedroom. There is a sink located next to the refrigerator, which is quite small in size. The refrigerator is positioned close to the wall on the left side of the room, and a bed can be seen in the corner near the refrigerator. The room appears to have a mirror positioned on the right side of the image.", "mscoco_hallucinated_words": [["sink", "sink"], ["bed", "bed"]], "mscoco_gt_words": ["chair", "refrigerator", "dining table"], "hallucination_idxs": [30, 62], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.3333333333333333}}, {"image_id": 279407, "caption": "According to the picture, the scene features a young boy in a baseball uniform standing on a field and holding a baseball ball. Another child, also in a baseball uniform, is standing in the background. Additionally, there are multiple people in the background, likely teammates or fellow players.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "bottle", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 314729, "caption": "The image is a vintage baseball card featuring a baseball player posing for the photograph. The player is holding a baseball bat, swinging it and waiting for the ball to be thrown. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 527497, "caption": "A computer is sitting on a desk, with two computer monitors displaying different screens. There is a keyboard sitting next to the computer, possibly indicating an office or work setting. A cordless phone is placed nearby, most likely for use in an office environment. Another phone can be seen near the left monitor, but it is not plugged in. The overall setting appears to be an office or work area, with computer equipment and a telephone at the ready.", "mscoco_hallucinated_words": [["phone", "cell phone"], ["phone", "cell phone"], ["telephone", "cell phone"]], "mscoco_gt_words": ["keyboard", "tv", "dining table", "laptop", "mouse"], "hallucination_idxs": [36, 51, 84], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2727272727272727, "Recall": 0.8}}, {"image_id": 411564, "caption": "According to the picture, there is a man sitting by a window, likely looking outside. The man is sitting on a couch or chair placed underneath the window. The kitchen is filled with a variety of items, such as a knife and two bottles, displayed on the countertop. A cell phone is also visible, likely being used by the man.", "mscoco_hallucinated_words": [["couch", "couch"], ["knife", "knife"]], "mscoco_gt_words": ["refrigerator", "dining table", "bottle", "cell phone", "chair", "person"], "hallucination_idxs": [24, 45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 320350, "caption": "The image displays a snowy scene where two people, presumably a man and a woman, are skiing with their two dogs. They are standing next to each other and their skis are positioned on the snowy ground, indicating their presence. The dogs, both wearing skis, are also standing alongside the couple.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "skis", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 401857, "caption": " A baseball game is taking place on a grassy field. There are several people on the field, including players, coaches, and umpires. The main focus is a baseball player swinging a bat at an oncoming ball, held in the hands of a catcher in the background. Various other participants can be seen throughout the scene, some standing or sitting nearby, and others actively involved in the game. The atmosphere is lively and energetic as the team members work together to achieve their goals.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "bench", "baseball bat", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 285302, "caption": " The image features a young boy sitting at a dining table, possibly playing or engaging in an art project. He is cutting up a paper sheet using scissors, located near the table's center. Various items are present on the table, including some cups and a couple of bowls.", "mscoco_hallucinated_words": [["cup", "cup"], ["bowl", "bowl"]], "mscoco_gt_words": ["person", "chair", "scissors", "dining table"], "hallucination_idxs": [48, 53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.75}}, {"image_id": 61959, "caption": "The image features a bus traveling down the street, driving alongside several cars parked near the sidewalk. The bus appears white, and some of the cars are parked on both sides of the street. There are a total of six cars present, with four on one side of the street and two on the other side. Two of these cars are parked more closely to the sidewalk, while the other cars are further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "bus", "truck", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 229659, "caption": "This is an image of a kitchen and restaurant. The kitchen contains a table and several chairs, placed throughout the space. A laptop is sitting on top of the table, likely being used by someone in the area. There are people dotted around the kitchen and restaurant. Some of them are sitting at tables while others are standing.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "tv", "dining table", "laptop", "bottle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 473118, "caption": "In the image, there is a person on a skateboard performing a trick, likely riding a skate ramp, as they are in the process of riding up the ramp. The person is in the center of the scene, with their back towards the right side of the image. There are also two bicycles in the background. One bicycle is positioned closer to the left side of the scene, and the other is further towards the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 391222, "caption": "The image features two people in winter clothing, possibly skiing or wearing cross-country skiing equipment, standing in the snow and looking at a sign. There are two sets of skis visible, one closer to the center of the scene and the other slightly farther away. The people seem to be looking at the sign as it is located nearby. The overall atmosphere suggests a scenic, wintry setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 416104, "caption": "According to the picture, there is a large group of people seated outside at various tables under blue umbrellas, possibly to shield them from the sun. Some of the people are eating, likely with food in front of them. There is a sense of relaxation and camaraderie among the people, who are sitting and talking, enjoying the outdoor atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "dining table", "bottle", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 511852, "caption": " Two people are skiing down a snowy hill in the image, with two sets of skis on the snow-covered slope. The first person appears on the left side of the hill, while the second person is closer to the right side. They seem to be skiing together, with both individuals wearing backpacks as they follow the slope. The scene is set against a snowy background, showcasing the winter sport of skiing.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 59386, "caption": "The image features a group of giraffes standing together in a grassy field. There are five giraffes visible, some standing close to each other, while others are more spread out across the field. A total of nine giraffes can be seen in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["zebra", "giraffe", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 540180, "caption": "According to the picture, there is a traffic light with a green smiley face in the middle of a road located in a city street. The light is positioned on the side of the road next to a building, and is easily visible in the scene. There are several cars parked along the street, with some closer to the traffic light and others further away. The cars are located both behind the traffic light and to its right, providing a good view of the street scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 153692, "caption": " The image depicts a man on a bicycle with the dog, passing through an intersection. They are in the middle of a busy street, surrounded by traffic lights, while crossing the street. The man on the bike is wearing a backpack, and the dog is also wearing a backpack. There is a second dog on the left side of the scene. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "traffic light", "dog", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 265628, "caption": "According to the picture, there is a toilet visible in a small room under a circular mirror. The toilet takes up a significant portion of the space, occupying roughly half of the room. In the scene, there is a person standing over the toilet, which appears to be looking down at the toilet. The person has his legs positioned near the bottom left corner of the image, while the toilet is positioned in the center, just above the person's feet.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "person", "toilet"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 499594, "caption": " The image features a dinner meal with meat, fish, and vegetables on a white plate. The meal includes meat, broccoli, and cheese. Various utensils, such as a knife and fork, are placed on the tray, possibly being used in the meal preparation or consumption. The table appears to be on a dining table, indicating that the meal is ready to be consumed. Overall, the meal appears to be a well-planned and tasty assortment of food items.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "bottle", "fork", "broccoli"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 5934, "caption": "The image features a baseball and a baseball glove sitting on a bench in the dugout. The baseball is positioned close to the glove, suggesting that it might have just been caught or thrown in the game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 47828, "caption": "In the image, there are several people walking along the side of a water body. The water body, likely a river or canal, is surrounded by a variety of buildings and bridges, and some people are walking near the water. A blue bridge is prominent, spanning across the water and connecting the sides. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "boat", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 47916, "caption": "The image depicts a grassy area filled with several dogs of different sizes, all enjoying the playtime together. One dog is in front of the group, playing with a red frisbee, while a second dog is farther behind and further to the right. Additionally, a third dog can be seen close to the right side of the image. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bench", "dog", "frisbee", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 255483, "caption": "The image is of a person sitting at a table with various pieces of paper laid out in front of them. There are several books and sheets on the table. Some of the books have their covers open, while others are closed. The person appears to be writing or editing something on the table. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "person", "scissors", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 131138, "caption": "The image displays a workspace with various computer and other electronic devices placed on a desk. There are two monitors \u2013 a computer monitor and a television \u2013 both placed on the desk. The computer monitor is connected to a keyboard and mouse, while the television is positioned towards the right side of the desk. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "tv", "dining table", "spoon", "laptop", "potted plant", "mouse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5555555555555556}}, {"image_id": 258209, "caption": "A man is sitting on a bench in front of a building. The bench is located in the middle of a sidewalk, while the man appears to be sitting on the edge of the sidewalk. The surrounding area appears to be an urban street, with cars parked on the street. There are two cars visible in the scene, one closer to the building and the other further away from it. There is also a traffic light, with its arrow pointing towards the man.", "mscoco_hallucinated_words": [["car", "car"], ["car", "car"], ["traffic light", "traffic light"]], "mscoco_gt_words": ["bicycle", "book", "bench", "person"], "hallucination_idxs": [49, 58, 81], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.5}}, {"image_id": 347453, "caption": "The scene captures a busy city street filled with multiple red buses and a taxi. The buses are parked at various points on the street, while the taxi is positioned near the middle. There are traffic lights visible at the top of the image, as well as one car in the middle left of the scene. The street is relatively empty of people or other vehicles, except for the buses and taxi.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "bench", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 285820, "caption": "In a large room with windows, a couch is positioned near one of the windows. The couch is brown and leather-covered. A black chair is also in the room, placed near the couch. In the room, there are also several potted plants arranged throughout the space.", "mscoco_hallucinated_words": [["chair", "chair"], ["potted plant", "potted plant"]], "mscoco_gt_words": ["bench", "tv", "dining table", "bowl", "cell phone", "book", "couch"], "hallucination_idxs": [26, 46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.14285714285714285}}, {"image_id": 245664, "caption": "In the image, there is a desk with a laptop and a computer set-up on it. The laptop is placed towards the right side of the desk, while the computer is situated near the left side. There is a keyboard on the left side of the desk and a mouse on the right side. A glass or a bottle is also present on the desk, positioned near the laptop and keyboard. A number of books are scattered on the desk, along with several papers, suggesting various activities and interests.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["keyboard", "cup", "tv", "dining table", "laptop", "mouse", "book", "wine glass"], "hallucination_idxs": [63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.06666666666666667, "Recall": 0.625}}, {"image_id": 453731, "caption": "According to the image, there is a sleeping dog on the floor in front of a large wooden barrels, possibly wine barrels. The dog is in a relaxed position, taking a nap. There are two other wooden barrels in the same room as the dog. The barrels seem to be placed near a table, which could be a dining table or a display table. There are also a couple of bottles or glasses on the table.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"], ["table", "dining table"], ["table", "dining table"]], "mscoco_gt_words": ["cup", "dog", "bottle"], "hallucination_idxs": [59, 66, 70, 83], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.6666666666666666}}, {"image_id": 524601, "caption": "The image features a group of young people gathered in a large room, actively playing a video game. They are spread out across the room, some standing closer to the camera and others a bit further away. There are several remotes in the scene, indicating the participants are holding them as they play. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "refrigerator", "book", "chair", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 253262, "caption": "The image features a small white plane parked on the tarmac near a group of people. There are three people, two men and one woman, standing near the aircraft. They appear to be entering the plane, as they are in the process of unloading their luggage. There are two large backpacks near the people, likely containing their belongings.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["truck", "suitcase", "tie", "airplane", "person"], "hallucination_idxs": [56], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 0.6}}, {"image_id": 87052, "caption": "In the image, a man is looking at a refrigerator in a kitchen. He is wearing a gray sweatshirt. Next to him, a woman is also in the kitchen, possibly cooking or preparing food.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "oven", "refrigerator", "bowl", "bottle", "vase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 357578, "caption": "In this image, a red fire truck is driving down a street past the car and truck. There are several cars in the scene, including one near the front of the car, one partially visible to the left of the car, another one closer to the right side of the car, and two more towards the back of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 418692, "caption": " The scene depicts a living room with a couch, a chair, and a lamp. There are two couches in the room, one positioned closer to the left side and the other farther to the right. A chair is placed in the center of the room. There is a lamp sitting on a table near the center of the room, illuminating the space.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["remote", "cell phone", "book", "chair", "person", "couch"], "hallucination_idxs": [58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.3333333333333333}}, {"image_id": 259345, "caption": " A man in a colorful dress is sitting under a pink umbrella, holding it open to provide shade. He is sitting on a bench and there is a second bench in the background, possibly indicating an outdoor setting. ", "mscoco_hallucinated_words": [["bench", "bench"], ["bench", "bench"]], "mscoco_gt_words": ["umbrella", "chair", "person"], "hallucination_idxs": [25, 31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.6666666666666666}}, {"image_id": 119709, "caption": "According to the picture, there is a woman standing in a kitchen, actively involved in the baking process. She appears to be preparing or working on some donuts, arranging them on a dining table nearby. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "donut", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 443299, "caption": "According to the picture, there is a man sitting on a bench with a cell phone in his hand, and a woman sitting next to him. Both individuals are wearing caps, which suggests that they might be from an Asian culture. There is another person in the background, but they are farther away from the man and the woman. In addition to the bench, there are some bottles and cups nearby, which could be for public use.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cup", "cup"]], "mscoco_gt_words": ["handbag", "bench", "cell phone", "backpack", "person"], "hallucination_idxs": [75, 77], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6}}, {"image_id": 120776, "caption": "The image captures an outdoor scene with a large group of people gathered near a modern building and a couple of tall umbrellas. The people are standing in various locations, some closer to the street and others further away. The umbrellas are spread out, with one umbrella located near the middle of the scene, and another closer to the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 411765, "caption": " The scene is set on a tennis court, where a man is playing tennis. He is holding a tennis racket, swinging to hit a ball that is in the air. Two other people are visible in the background. One person is standing further away from the court, and the other is closer to the court, likely watching the match.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 447991, "caption": " A person in a blue shirt is standing on the sidewalk in front of a row of shops, including a large shopping mall and a smaller shop nearby. The street is a bit dark and there are some streetlights on it, providing light to the area. There are also a few cars and a vintage Volkswagen bus parked along the sidewalk, adding to the urban atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "handbag", "traffic light", "cell phone", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 276719, "caption": " The scene takes place on a street with a dog, surrounded by various cars. The dog is walking across the street, leading the group of people who appear to be following the dog. There are a total of 11 people in the picture, with some standing closer to the cars and others in the background. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "handbag", "motorcycle", "dog", "car", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 422328, "caption": " The image features a horse pulling a large sack or bag filled with snow while wearing a blanket. The horse appears to be outside near a building or a house, possibly on a path or a road. There are also other vehicles nearby, possibly parked close to the horse or nearby the building it's near.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["horse", "snowboard", "car", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 354559, "caption": "The image depicts a woman standing in front of a white wall, possibly taking a selfie or a photo. She is wearing a black and white striped shirt and has her long brown hair held back with braces, giving her a stuffed animal a closer look. The stuffed animal appears to be quite large, taking up much of the woman's arms and hands.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "bear", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 438788, "caption": "The image features a cake placed on a table, with two pieces of it missing. There are two knives, one placed near a piece of the cake, and the other knife close to the other piece of cake.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "knife", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 468005, "caption": "The image features a large elephant, possibly an Indian elephant, with its trunk open as if it's about to spray water. It's walking down some steps that are located near a body of water. The steps are situated on the side of a bridge-like structure, leading down towards the beach or water.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "person", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 545594, "caption": "In the image, a group of people is seen riding a public bus, with a woman wearing a backpack in the front row and multiple people sitting behind her. Some of them have handbags and backpacks, and there are a few people sitting or standing on the right side of the bus. The bus appears to have multiple rows of seats, providing ample space for all the riders. Overall, it appears to be a crowded bus with multiple individuals on board.", "mscoco_hallucinated_words": [["handbag", "handbag"]], "mscoco_gt_words": ["backpack", "chair", "orange", "person"], "hallucination_idxs": [36], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.75}}, {"image_id": 219546, "caption": " The image showcases a table covered with various types of food, including a bowl with meat, another bowl with different types of vegetables, and a bottle of red wine. The table is spread out across a dining room and appears to be set for a festive event or a special occasion.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "bowl", "spoon", "bottle", "fork", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 386645, "caption": "The image features a view of a road with a building on the side, as well as a few vehicles. A truck is positioned in front of the building, and a few cars can be seen driving along the street. A group of people is also present in the scene, with one person standing near the building and the others spread throughout the street.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "motorcycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 155131, "caption": "The scene takes place on a sports court where a woman in a white outfit is making a save while playing tennis, using a tennis racket in her hand. She is in the center of the court, while various people are scattered around the area, watching and possibly waiting for their turn to play. Some of these people are standing near the edges of the court, while others are closer to the woman in the middle. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "tie", "tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 188752, "caption": " The scene features a banana stand with numerous bananas hanging from the ceiling and a person standing nearby. There are numerous bananas displayed throughout the market, with some arranged in bunches and others hanging individually. The person appears to be a seller at the stand, possibly providing the customer with various options to choose from. The bananas range from being light and slightly brown to fully ripened, creating an attractive display for the customer.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["apple", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 228350, "caption": " A group of people are sitting at tables outside under a series of red umbrellas. Some are near the edge of the patio, while others are in the center. The umbrellas are placed over several tables, creating a shaded area for those sitting and dining. There are at least 13 chairs at the tables.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "umbrella", "dining table", "bowl", "tie", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 437720, "caption": "According to the image, the painting shows a nude person sitting at a table in a room with a red chair. There is a painting on the wall, and a vase and some books are displayed on a shelf in the background. The table is also adorned with a flower vase, two wine glasses, and a wine bottle. In the foreground, there are two red chairs, one of which the person is sitting on.", "mscoco_hallucinated_words": [["book", "book"], ["wine glass", "wine glass"], ["bottle", "bottle"]], "mscoco_gt_words": ["dining table", "laptop", "vase", "chair", "person"], "hallucination_idxs": [36, 57, 63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2727272727272727, "Recall": 0.8}}, {"image_id": 378831, "caption": "According to the picture, there is a woman walking down a street. She is carrying a shopping bag and is walking towards the right side of the scene. The street appears to be empty, except for the woman and the shopping bag she is holding. Additionally, there are two fire hydrants in the scene, one of them more prominent towards the left side of the picture. A handbag can be seen on the ground near the woman, but it does not belong to her.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fire hydrant", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 375503, "caption": "According to the image, the scene is an upstairs view of a house, possibly a home or a residence, with a wooden staircase leading to the upper level. There is a room on the left side of the image and another on the right side, both featuring ornate wooden staircases and ornate wallpaper. The room on the left features an ornate wooden clock on a table, and the room on the right features a couple of chairs, possibly arranged around a table or other piece of furniture.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "tv", "dining table", "laptop", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 330369, "caption": "The image features a red and white boat docked in the water near the shore. The boat appears to be in the foreground of the scene, with the ocean stretching out further into the distance. There is another boat visible at the edge of the image, potentially located in the harbor or in the water nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 288576, "caption": "According to the picture, there is a man standing near a tree stump, with a frisbee thrown in front of him. Another person is also visible in the background, possibly watching the frisbee being thrown or catching it. The scene appears to be set in a grassy area with various objects located throughout the scene. A bottle is placed close to the tree stump, and two bicycles are located farther away from the main action.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["bicycle", "frisbee", "person"], "hallucination_idxs": [62], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 32190, "caption": "A toy horse is on the sidewalk in front of a street corner, near a curb. There are several cars parked or driving on the street, some close to the horse, and a few further away. The scene seems to be taking place in front of a store, as there are multiple cars with drivers visible in the parking lot. The parking lot has a few cars parked, both close and far from the store entrance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["horse", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 535403, "caption": "A person has taken a photo in a bathroom, capturing the sink, mirror, and several bottles arranged on a counter. The bathroom features a combination of clean white walls and tiled surfaces, creating a unique look. There are two sinks, one closer to the left side of the image and another towards the right, surrounded by a couple of bottles on the counter.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["clock", "potted plant", "bottle", "toilet", "sink", "hair drier"], "hallucination_idxs": [1], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.3333333333333333}}, {"image_id": 449798, "caption": "According to the picture, there is a white computer placed in the center of a room. Near the computer, there are several toy figures arranged in various positions. These toy figures are on display, and they are placed on the table and other surfaces. They appear to be a mix of different types of toys, including some stuffed animals and figurines. The arrangement creates a unique setup in the home.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "dining table", "laptop", "mouse", "cell phone", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 375769, "caption": " There is a young boy eating at a restaurant, possibly on a street. He is sitting at a table with a drink near him, surrounded by numerous chairs. The table is covered with a tablecloth, and there are two bicycles outside the restaurant, with one on the left side and another one on the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "cup", "umbrella", "dining table", "hot dog", "car", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 426172, "caption": " The image depicts a dining room setting with a table, various dishes, and a countertop display of wine bottles. The dining room is furnished with chairs, and a TV can be seen in the background. The table holds several items, including a pair of books, a book on a stand, a wine glass, a cup, and a pair of shoes. A few bottles of wine are also displayed on the countertop.", "mscoco_hallucinated_words": [["tv", "tv"], ["wine glass", "wine glass"], ["cup", "cup"]], "mscoco_gt_words": ["knife", "dining table", "bottle", "book", "chair"], "hallucination_idxs": [32, 59, 63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3, "Recall": 0.8}}, {"image_id": 20650, "caption": "The image showcases a meal consisting of a sandwich placed on a colorful plate, surrounded by a mix of vegetables, including broccoli, which is prominently featured in the dish. There is also a fork on the plate, which might be used to eat the sandwich and its accompaniments.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "sandwich", "fork", "broccoli", "orange"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 138078, "caption": "The image features a young man in red and white baseball uniform standing on a grass field. The boy is wearing a baseball glove and appears to be focused, likely waiting for the ball to be thrown.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 152499, "caption": "In the image, there is a family gathering with several people, including children, around a pool filled with water. Some of the people have drinks and a party hat, suggesting they are enjoying a fun occasion together. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "umbrella", "dining table", "potted plant", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 180764, "caption": "The image depicts a man sitting in a kitchen, taking a bite of a pink glazed donut. The donut is positioned in front of the man, highlighting the scene. The kitchen features a few items, such as a chair, a cereal bowl, and a sink, suggesting a domestic setting for the man's meal. The background also reveals a couple of bottles, possibly containing drinks or condiments.", "mscoco_hallucinated_words": [["chair", "chair"], ["bowl", "bowl"], ["sink", "sink"]], "mscoco_gt_words": ["microwave", "donut", "person", "bottle"], "hallucination_idxs": [43, 47, 51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.75}}, {"image_id": 147762, "caption": "This image is a close-up view of a plate, likely to be found on a table, with a piece of food or dessert topped with a yellow substance. The cake or dessert appears to have a slice of orange on top, and it is decorated with a yellow glaze. The plate is positioned on a dining table.", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["fork", "cake", "dining table"], "hallucination_idxs": [41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 113860, "caption": "According to the picture, there is a boat floating down a river. The boat is very visible in the center of the image, occupying the main area of the scene. It appears to be a pontoon boat, as several pontoons can be seen extending along its length. The river water extends beyond the boat, filling most of the image. There are several people visible in the scene, some near the boat and others farther away. One person is situated on the right side of the image, while another can be seen on the left side, further away from the boat. Additionally, a person is standing at the bottom center, near the water's edge.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 460866, "caption": " The image depicts various toothbrushes in a cup, which is filled with them. In total, there are five different toothbrushes, one on the left, two in the middle, and two on the right side of the cup. They occupy different positions within the cup, creating a unique and somewhat random arrangement. Some of the toothbrushes are taller and placed towards the back, while others are more spread out and towards the front of the cup. The image captures the different colors of the toothbrushes, showcasing the popular orange-colored one towards the middle of the group.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toothbrush", "cup", "dining table", "bowl", "sink"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 177015, "caption": "The image depicts a man sitting on a couch, with a bearded man sitting on his lap holding a small cat. They are both looking at a laptop, with the cat resting on the man's lap while he uses the laptop. There are two chairs in the background, one on the left and one on the right, while another chair is placed in front of the couch where the man is sitting. A couple is sitting on the other couch on the right side of the scene. They appear to be enjoying some downtime together, possibly watching a movie or browsing the internet on the laptop.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "laptop", "cat", "apple", "chair", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7142857142857143}}, {"image_id": 333998, "caption": "The image portrays a small and cozy kitchen with white appliances, including a refrigerator to the left and a microwave oven located above it. There is also a sink located on the opposite side of the room, just behind the refrigerator. In the middle of the kitchen, a wine bottle can be found on a shelf, perhaps indicating a favorite wine. Another person is also present in the room, possibly a family member or friend enjoying the company. The entire kitchen occupies the majority of the image's space.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "suitcase", "oven", "refrigerator", "bowl", "bottle", "sink", "microwave", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 47131, "caption": "The image depicts a black and white photo of an overturned train, likely a passenger train or freight train. Many people, some of whom appear to be in their 20s or 30s, are gathered around the overturned train, which is located on a grassy area. Some of the men are on top of the train, while others are standing nearby. There are multiple groups of people in the image, each with at least one person on the train. The image captures a dramatic moment, likely the aftermath of a derailment or collision involving the train.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 425447, "caption": "A large white truck is driving down the street, spraying water near a tree. The truck is parked in the middle of the street, blocking traffic as it performs the cleaning task. A police car is nearby, parked further from the truck, but still visible in the scene. There are also two other cars parked closer to the side of the street. The trees in the image have been sprayed with water, as indicated by the misty and shiny appearance.", "mscoco_hallucinated_words": [["car", "car"], ["car", "car"]], "mscoco_gt_words": ["truck", "fire hydrant", "person"], "hallucination_idxs": [38, 60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.3333333333333333}}, {"image_id": 295105, "caption": " The image captures a scene of a man standing in a kitchen. He appears to be standing near an open refrigerator door in the left side of the scene. Another person can be seen behind him, walking away from the scene towards the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["oven", "refrigerator", "bowl", "bottle", "apple", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 443259, "caption": "The image shows a large kitchen with a stove and refrigerator situated at the back of the room. There is a dining area, consisting of several chairs placed around a wooden table with a vase on top. The kitchen has a vent hood above the stove, which is part of the overall kitchen design.", "mscoco_hallucinated_words": [["vase", "vase"]], "mscoco_gt_words": ["oven", "refrigerator", "dining table", "bowl", "potted plant", "bottle", "orange", "sink", "chair"], "hallucination_idxs": [36], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.4444444444444444}}, {"image_id": 537861, "caption": "The image features a train station, with a large train parked on the tracks near the station. Two other trains can be seen in the scene; one parked closer to the end of the train track, and the other partially visible further down the track. The train station has two train tracks on the side of the train, as the trains are stopped and not currently in motion.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 145020, "caption": "The image shows a toy donald duck wearing a christmas sweater next to a large group of people, possibly at a disneyland attraction. There are several large trees behind donald duck, and many small toy objects, including dolls, can be seen in the foreground.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "bird", "mouse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 249969, "caption": "The image features a plate filled with a variety of food items, including green beans, carrots, turkey, and potatoes. There is also some meat, likely chicken, visible on the plate. In addition, there is a combination of different utensils, such as a knife, fork, and spoon, which are placed around the dish.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "spoon", "fork", "broccoli", "wine glass", "carrot", "bird"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.625}}, {"image_id": 146504, "caption": "In the image, a person is sitting in front of a table or desk. They are holding their hand on top of a computer keyboard and using a computer mouse, positioning themselves near the keyboard. A coffee mug can also be seen on the desk, further emphasizing the casual nature of this setting. The person's hand is prominent in the scene, indicating their involvement with the computer.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "dining table", "laptop", "mouse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 40037, "caption": "This is an image of a clock tower, which is a distinctive building with a clock on top. It's tall and has a steeple on top, which is a common feature of churches and some other buildings. The clock tower is positioned in the middle of a street or town square, surrounded by buildings on either side. Several cars are parked near the tower, including one in front of the tower and several more along the road. There are also a few people nearby, with one person on the left side of the image, another in the center, and one more on the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 343218, "caption": " The scene is set on a tennis court with a man standing on the court holding a tennis racquet. He appears to be getting ready to hit the tennis ball, as it is near him in the air. In the background, several other people are also on the court, possibly playing other matches or warming up.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 301837, "caption": "The image displays a long narrow kitchen with a skylight above. There are two refrigerators, one placed to the left of the kitchen and the other closer to the middle of the room. On the countertops are several bottles, cups, and a bowl. A sink is located at the center of the room, with a single bowl on the left side and an oven on the right side.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cup", "cup"], ["bowl", "bowl"], ["bowl", "bowl"]], "mscoco_gt_words": ["clock", "oven", "refrigerator", "apple", "sink"], "hallucination_idxs": [41, 43, 47, 63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5714285714285714, "Recall": 0.6}}, {"image_id": 309071, "caption": " The image captures a man, possibly a boy, leaning down to touch the pedals of a bicycle. The bike is positioned closer to the right side of the image, while the man is on the left side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bench", "potted plant", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 240681, "caption": "According to the picture, a man with a backpack is standing next to a red motorcycle. The motorcycle is located on the side of a road and appears to be parked. There are also several other vehicles parked along the road, such as a car and a couple of bicycles. Some traffic signs can be seen further down the street, and one can be spotted to the left of the main focus.", "mscoco_hallucinated_words": [["bicycle", "bicycle"]], "mscoco_gt_words": ["backpack", "motorcycle", "person", "car"], "hallucination_idxs": [53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 178753, "caption": " The scene depicts a young girl, possibly a child, sitting in a red chair in an amusement park or stadium, holding her cell phone. She appears to be using it while sitting down in a seating area. There are a few more chairs in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 278890, "caption": " The image depicts a person holding a cellphone open in their hand. The cellphone appears to be placed on a table or surface, as the person's hand is not entirely visible. The person is surrounded by other objects in the scene, including two bottles, one placed on the left and the other on the right, and a cup near the left bottle. There is also a handbag located close to the left side of the image, further reinforcing the presence of multiple people in the area.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cup", "cup"], ["bottle", "bottle"], ["handbag", "handbag"]], "mscoco_gt_words": ["cell phone", "person", "dining table"], "hallucination_idxs": [48, 64, 68, 74], "metrics": {"CHAIRs": 1, "CHAIRi": 0.36363636363636365, "Recall": 1.0}}, {"image_id": 47781, "caption": "There is an image of a bedroom featuring a modern bed with a white bedspread and red blanket. The room is also adorned with a couple of nightstands. One of them contains a bowl and a couple of cups, while the other nightstand has a bottle and a vase on top. The image conveys a clean and tidy atmosphere with the various pieces of furniture in the room.", "mscoco_hallucinated_words": [["bowl", "bowl"], ["cup", "cup"], ["bottle", "bottle"]], "mscoco_gt_words": ["tv", "vase", "book", "bed", "person"], "hallucination_idxs": [35, 40, 48], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6, "Recall": 0.4}}, {"image_id": 542792, "caption": " The image features a large brown stick, which is prominently displayed in the middle of a street, surrounded by buildings and other city elements. The brown stick resembles a baseball bat and is located near the edge of the road, partially blocking traffic.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 293703, "caption": " There are three people sitting around a dining table at a restaurant. One man is sitting on the left, another man is seated in the center, and a third person is on the right. They are all smiling and laughing together. A delicious pizza is placed on the table in front of them, with several slices ready to be enjoyed.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "bowl", "bottle", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 318777, "caption": " The image features a baseball game being played in a large stadium filled with spectators. The action is taking place on a grassy field, with several baseball players scattered across the area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 14773, "caption": "There is a unique hot dog dish served on a plate, which features a replica car filled with various toppings. The plate is placed on a table in the center of the image. There are a couple of chairs around the table, including one on the right side and another one near the back of the image. The table appears to be a dining table or a kitchen island with a counter top. In addition to the hot dog dish, there are cups of different sizes on the table, indicating that people have recently enjoyed the meal or drink.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["cup", "dining table", "spoon", "bottle", "fork", "hot dog", "car", "person"], "hallucination_idxs": [41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.09090909090909091, "Recall": 0.625}}, {"image_id": 521863, "caption": "The image captures a skateboarder riding his skateboard in a busy area. There are several people in the vicinity, walking or standing around, and multiple bicycles can be seen in the scene. Some people appear to be carrying books, suggesting that the area might be near a library or bookstore. ", "mscoco_hallucinated_words": [["book", "book"]], "mscoco_gt_words": ["bicycle", "skateboard", "person"], "hallucination_idxs": [42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 250540, "caption": " The image depicts a group of people sitting on motorcycles, occupying both sides of a sidewalk during a rainy day. There are three motorcycles in total, and several people are riding them, including a man and woman closer to the front and another group further back. The people are dressed in raincoats, and some of them have umbrellas.", "mscoco_hallucinated_words": [["umbrella", "umbrella"]], "mscoco_gt_words": ["handbag", "motorcycle", "cell phone", "car", "person"], "hallucination_idxs": [63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.4}}, {"image_id": 211604, "caption": " In the image, there is a woman laying on the floor, holding a tennis racket in her hand. She is wearing a hat and has two additional hats positioned behind her, suggesting a group setting or a gathering. The tennis racket she holds is positioned closer to her, while the other hats are situated further away from her.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 298067, "caption": "This is a picturesque image featuring a combination of food and drink. The main image is a plate with food on it, including meat pieces and a beverage. The beverage is a bottle of beer. There is also a slice of pizza on the plate, positioned next to the meat pieces. The plate is accompanied by a bottle of beer.", "mscoco_hallucinated_words": [["pizza", "pizza"]], "mscoco_gt_words": ["bowl", "cup", "dining table", "bottle"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.25}}, {"image_id": 563938, "caption": "The image portrays a scene with a man and a woman sitting on a bench or couch. Both of them are wearing sunglasses, and there are some cups and bottles visible in the background. One cup is placed closer to the man, while a couple of bottles are situated further to the left.", "mscoco_hallucinated_words": [["cup", "cup"], ["cup", "cup"]], "mscoco_gt_words": ["bench", "tie", "bottle", "person", "couch"], "hallucination_idxs": [29, 38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.8}}, {"image_id": 461311, "caption": " There is a large crowd of people standing in a field, with many people flying kites of various sizes. The kites fill the entire sky, with some kites being higher and others lower in the air. Among the people in the field, there are several individuals standing in groups or by themselves, enjoying the outdoor activity. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["kite", "handbag", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 47940, "caption": " The image is of a person sitting on a bed with their legs and feet resting on it. This person has their laptop and books placed beside them. The laptop and books are positioned on the right side of the bed, while the person's feet are located on the left side. The books are spread across the bed and are placed in various positions, one near the lower left corner, and two others towards the center. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "laptop", "bed", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 261318, "caption": " In the image, a man is sitting on a red chair, using a cell phone while leaning against a bench. Near him, two large suitcases are positioned in close proximity to one another, one on the left and another on the right. There are also additional suitcases farther in the background, giving the scene an air of travel. The bench where he is sitting has two chairs positioned near the bench's back, one on the left and the other on the right. Some potted plants can be seen at the edge of the scene.", "mscoco_hallucinated_words": [["potted plant", "potted plant"]], "mscoco_gt_words": ["suitcase", "handbag", "bench", "cell phone", "chair", "person"], "hallucination_idxs": [94], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 0.8333333333333334}}, {"image_id": 134768, "caption": "The image features a keyboard with speakers placed near the left side. The speakers occupy the top right and bottom right corners of the picture, while the keyboard is located towards the top left. The combination of keyboard and speakers suggests that this is an electronic setup for enjoying music or watching multimedia content. The keyboard appears to be a regular computer keyboard, with keys visible along the top half of the device. Additionally, there's a mouse placed on the right side of the keyboard, possibly for use with the computer. Overall, this setup appears to be a well-equipped computer setup with speakers and keyboard for multimedia entertainment or work purposes.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "laptop", "mouse", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 526414, "caption": "The image depicts a small room with a dining table situated in the middle of the room. A sewing machine, a television, a couch, and a computer are present in the room, contributing to the converted living space turned workspace atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "cup", "tv", "dining table", "bowl", "laptop", "cat", "book", "scissors", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 273132, "caption": " In the image, a woman is sitting on a park bench, resting and relaxing. She appears to be wearing a hat, and there are several bags placed around her. The scene seems to be an old black-and-white photograph. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 505213, "caption": "The image shows a man holding a hot dog with several toppings, such as lettuce, ketchup, and mustard, as he stands in front of a brick building. The man seems to be enjoying his meal, making for a fun and entertaining scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "hot dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 192656, "caption": " The image depicts a baseball game, with multiple people visible on the field. A baseball player is seen standing near the center of the field, getting ready to throw the ball to a specific spot. Several other people, including teammates, are also spread throughout the field, some closer to the player, while others are in more distant positions. The players are wearing baseball uniforms and gloves, and their actions indicate they are practicing or playing the game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 272728, "caption": " The image features a large, potted flower arrangement in a beautifully decorated, multi-tiered glass vase. The arrangement is filled with a variety of different flowers and pink, purple, and orange hues dominate the scene. There are several potted plants and flowers within the arrangement, adding more colors to the overall visual appeal. The vase itself is prominently displayed on a table, giving it a more prominent position in the scene. Overall, the image captures a lively and colorful arrangement of flowers.", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["vase", "potted plant", "chair", "dining table"], "hallucination_idxs": [34], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.75}}, {"image_id": 509811, "caption": " The image shows a school room or library filled with various furniture, including desks and chairs arranged in a couple of rows. The tables are adorned with numerous laptops placed in different spots, creating an environment of learning.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "apple", "book", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 412407, "caption": "According to the picture, several people are present at the event or gathering, with one person holding a cell phone close to their hand, which has a black screen with white pixel squares on it. There are others in the background, but their focus seems to be on the person holding the phone. The event or gathering appears to be a social occasion.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "clock", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 353658, "caption": "The image shows a woman standing in the rain under an umbrella. There are several other people in the background, likely walking or passing by on the street. The umbrella is a polka dot design, adding a fun and festive touch to the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "suitcase", "handbag", "umbrella", "carrot", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 441363, "caption": "The image captures a tennis player swinging his racket on a red tennis court, possibly practicing or engaged in a game. There are two other people on the court, likely his teammates or opponents, who are also watching the action. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 331084, "caption": "The image displays a dining room table filled with a variety of pizzas, vegetables, and other toppings. There are two large, tray-style pizzas on the table, with the first being cut in half. There are also several small pizzas, one of which is sliced.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "knife", "dining table", "bowl", "spoon", "carrot", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 282553, "caption": "According to the image, there are two women walking down a dirt road in a field. The women are walking in opposite directions, with one closer to the viewer and the other further away from the viewer. The dirt road seems to be the main path through the field. The scene is captured at sunset, with the light coming from the right side of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 328818, "caption": "According to the picture, the scene shows a woman in pink shirt and blue jeans standing by a park bench. She is leaning over the bench, tying her shoelaces as she prepares to get dressed. There is a bike nearby, either a parked one or one that the woman rode to her destination.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bench", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 3103, "caption": "A young man wearing a yellow helmet is performing a trick on a skateboard, possibly a ramp. There are several other people in the background, some of them seem to be watching the skateboarder while others might be waiting or standing around. Overall, it appears to be a fun environment with people engaging in various activities.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 188136, "caption": "The image features a very empty kitchen with a few appliances and a table. There are a few pots on the counter, a sink and an oven in the middle of the kitchen space. In the foreground, there's a table with a few food items on top of it, including a bowl and an apple. The kitchen has a very clean and organized look.", "mscoco_hallucinated_words": [["oven", "oven"], ["apple", "apple"]], "mscoco_gt_words": ["dining table", "bowl", "orange", "sink", "carrot", "person"], "hallucination_idxs": [28, 60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 449675, "caption": " The image features a dining table with a piece of food, specifically a sandwich, and a bottle on it. There's a fork nearby, with one end protruding from the plate, indicating that the sandwich has been partially consumed. There's a knife as well, placed closer to the bottle. A salad is also present on the plate, served along with the sandwich.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "knife", "dining table", "spoon", "bottle", "sandwich", "fork", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.625}}, {"image_id": 191691, "caption": "According to the picture, it features a man in a bathroom standing in front of a mirror, getting ready for his day. The man is shaving his face in the bathroom, shaving his right side as shown in the image. He is also using a razor to accomplish this task.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toothbrush", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 89378, "caption": "In the image, there are two motorcycles \u2013 one in front of the other. The front motorcycle is mostly black and white with a small red section in the middle, while the second motorcycle is smaller and primarily red. A man, wearing black and white clothing, is positioned between the motorcycles, close to the front one. The background consists of a series of garages. The man appears to be standing near the motorcycles as they are parked next to each other.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 99874, "caption": " The image features a woman sitting at a table in a cafe, using her cell phone. She has sunglasses on and is talking on her phone, possibly taking a call or using it to communicate with someone else. In the background, there is another person in the cafe, possibly enjoying their meal or waiting for someone. The table is occupied with a cup on the left side and an empty cup nearby. There is also a bottle on the right side of the table, possibly containing a beverage.", "mscoco_hallucinated_words": [["cup", "cup"], ["cup", "cup"]], "mscoco_gt_words": ["handbag", "dining table", "bottle", "cell phone", "cat", "person"], "hallucination_idxs": [69, 77], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 437895, "caption": "According to the picture, the image depicts a large bed with a canopy above it, making it a four-poster bed. The bed is covered with a colorful blanket, giving it a warm and cozy feel. It appears to be in a room or a bedroom setting, possibly a guest room or a bedroom with a canopy bed. The bed is occupied by someone or a couple, or it could be simply a decorative piece.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 557447, "caption": " The image depicts a man wearing a suit and tie sitting at a table with his hand pointing towards the left side. There is another person sitting behind him in the background, and another person is partially visible further away. A book and a knife can be seen on the table, as well as a cell phone placed on the surface to the right side of the image. A door is located in the background, to the left side of the scene, possibly leading to another room or a hallway.", "mscoco_hallucinated_words": [["book", "book"], ["cell phone", "cell phone"]], "mscoco_gt_words": ["knife", "dining table", "tie", "spoon", "fork", "person"], "hallucination_idxs": [44, 59], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.6666666666666666}}, {"image_id": 159979, "caption": " The image depicts a busy harbor scene with various boats docked on the water. Multiple umbrellas, both big and small, are visible overhead, providing shade for the people sitting on the boats. A number of people are standing or sitting around the boats, some with umbrellas. They appear to be selling or buying goods such as fish, as they are accompanied by a lot of fish.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "boat", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 357, "caption": "The image depicts a baseball game in progress, with several players on the field. A batter is up to bat and a batter has a ball held in his hand, getting ready to hit a baseball. A pitcher is on the mound, ready to make a throw, and an umpire is positioned behind the catcher, watching the play.", "mscoco_hallucinated_words": [["ball", "sports ball"]], "mscoco_gt_words": ["baseball bat", "person", "baseball glove"], "hallucination_idxs": [27], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 538891, "caption": " The image features a woman in a white shirt standing on a tennis court. She is holding a tennis racket in her hand, getting ready to hit the ball. There are several other people in the background, some of them standing closer to the woman, while others are further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 561967, "caption": "Scene description", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "train", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 565778, "caption": "According to the picture, there is a train passing by on the train tracks. The train appears to be blue and long, traveling from right to left through the scene. There is also a tunnel in the background, which seems to be the destination of the train. The train appears to be moving fast, with the speed and motion evident from the perspective of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "train", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 7125, "caption": "According to the image, there is a bike path along the road by the water. The bike path extends for some distance and a group of people are riding their bicycles on this path, enjoying a leisurely ride. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "dog", "bottle", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 221350, "caption": " The image depicts a man wearing a red tie in front of a large window. The tie has hearts drawn all over it, possibly written by the person wearing it. The man seems to be reaching for his neck to adjust his tie.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "dining table", "tie", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 280930, "caption": "In the image, there is a young girl standing in a kitchen, laughing while holding a pan filled with baked dough. She appears to be enjoying her experience of cooking or baking.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "oven", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 151081, "caption": "This image showcases an airplane flying above a field as it drives slowly near the edge of the road. There is a tractor working on the ground in the same area, near the middle of the image. The airplane appears to be on a gray, overcast day as it makes its way over the field. The field is also brown in color, and there are a few more airplanes in the sky, though they are smaller and further away from the main airplane in the picture.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 122232, "caption": "A person is standing on a green tennis court wearing a yellow raincoat and a hat, holding a tennis racket in each hand. There are eight tennis balls around them scattered across the court. The background is a tennis net, with four main posts placed across the court. The person is standing in the middle of the court, holding the two tennis rackets in different positions.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 134223, "caption": "The image features a white platter with a meal consisting of a hot dog and fries. The hot dog is served with ketchup and two bite-sized fries are placed alongside it, creating a satisfying and tasty meal. In the background, there's a bottle containing a condiment, likely a sauce or dressing to complement the meal. This image suggests a fun and casual dining experience, ideal for a quick meal or snack.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["bowl", "hot dog", "broccoli", "dining table"], "hallucination_idxs": [47], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.25}}, {"image_id": 198163, "caption": "The image features a man in a gray-blue jacket sitting inside a damaged building, surrounded by numerous small plates filled with different colors of paint. The man is focused on the plates, examining them closely, as if he is working on a project or painting. There are a few bowls placed on a shelf nearby, adding to the visual assortment of colors and items.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 3742, "caption": "The scene features a vase filled with different kinds of apples arranged in a beautiful pattern. The vase is placed against the wall, and some of the apples can be found both inside and outside the vase.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "apple", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 217593, "caption": "The image shows a young boy in a baseball uniform, wearing a baseball glove and a baseball cap. The boy appears to be ready to throw a baseball or play the position of the catcher. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "chair", "baseball bat", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 279078, "caption": "The image depicts a young person standing in front of an open refrigerator filled with bottles and beverages. There are various bottles of various sizes on the refrigerator's shelves, suggesting a well-stocked fridge. A few bottles are located towards the left side, with another group located towards the right side of the refrigerator. The young person appears to be curious, pointing to one of the bottles on the refrigerator's shelf.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "refrigerator", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 572081, "caption": " The image features a tennis match being played on a court with multiple people on both sides. Two tennis players are on the same side of the net, with one of them holding a tennis racket while standing close to the right side of the court and the other one holding a tennis ball. A man and a woman can be seen on the opposite side of the net, with the woman holding a tennis racket and standing near the right side of the court.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "chair", "sports ball", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 477016, "caption": " The image depicts a group of people in a small kitchen area, standing in different positions around a toy oven. A mother and her child are the main subjects in the scene, while the other people in the area are also attentively looking at the oven. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "oven", "backpack", "potted plant", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 301266, "caption": "In the image, there are four people standing at the table, actively engaged in working together. There are three baskets at the table filled with various items, such as apples and plastic bottles. One person is reaching for an object within the basket. Additionally, there are four spoons lying on the table, being utilized in the work they are doing.", "mscoco_hallucinated_words": [["apple", "apple"], ["bottle", "bottle"]], "mscoco_gt_words": ["cup", "dining table", "spoon", "book", "fork", "sink", "person"], "hallucination_idxs": [33, 36], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.42857142857142855}}, {"image_id": 475813, "caption": "A young child is standing in a room, possibly in their bedroom or another area, wearing a nightgown and holding a bottle. There are two cell phones nearby, with one held by the child and the other on the left side of the scene. In the scene, there are also three backpacks, with one positioned closer to the right side, another slightly in the middle, and the last one on the left side of the room. ", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cell phone", "cell phone"], ["backpack", "backpack"]], "mscoco_gt_words": ["remote", "handbag", "person"], "hallucination_idxs": [23, 28, 57], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6, "Recall": 0.3333333333333333}}, {"image_id": 558784, "caption": "According to the picture, there is a man walking along a path next to a pond, with a long stick in his hand. The man is standing in the grass, enjoying the outdoors.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "frisbee", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 308543, "caption": "In the image, a person's hand is holding a pair of silver spoons, with one spoon in front of the other. The spoons appear to be large and in close proximity, suggesting they are being handled or manipulated. The hand is holding the spoons from the top and the spoons are positioned near the center of the image. The background of the scene appears to be a stainless steel sink, which is likely the location where the spoons are being handled.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "spoon", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 359164, "caption": "In the image, there are two dogs, one brown and one white, enjoying a sunny day playing outside on the grass. The brown dog is the bigger of the two, and the white dog is smaller. They are both in the foreground of the scene, with the white dog closer to the right side and the brown dog closer to the left side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "chair", "dog", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 539951, "caption": "The image features a toy kitchen that has a checkerboard floor and a fridge to the left. There are various kitchen appliances in the scene, including an oven to the right and a microwave in the middle. A few refrigerators are positioned throughout the kitchen, adding to the sense of variety and modernity.", "mscoco_hallucinated_words": [["microwave", "microwave"]], "mscoco_gt_words": ["sink", "clock", "oven", "refrigerator"], "hallucination_idxs": [35], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.5}}, {"image_id": 143479, "caption": "In the image, an elderly man is holding a tray of hot dogs, with some on the left side and others scattered around the tray. He is wearing a tie and a white shirt. There are two more people in the image \u2013 one person is standing at the right side of the elderly man, and another person is sitting behind him to the right. ", "mscoco_hallucinated_words": [["tie", "tie"]], "mscoco_gt_words": ["hot dog", "chair", "person"], "hallucination_idxs": [32], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6666666666666666}}, {"image_id": 475024, "caption": "The image features a vase filled with different colored and arranged flowers sitting on a table. The arrangement is positioned near a mirror, which reflects the scene from a different angle. The vase has a decorative mirror placed on top of it, adding a unique touch to the flower display.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "orange", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 238065, "caption": "The image captures a young man on a skateboard, riding on top of a skateboard ramp, possibly performing a trick or simply enjoying the ride. He is dressed in a blue shirt and blue jeans, and the skateboard is situated in the middle of the image. The skateboard ramp has a shape that mimics a cement ramp. The ramp is empty except for the skateboarder, adding a sense of focus to the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 314016, "caption": "According to the picture, a small bathroom contains a toilet with a sign \"Magic Eye\" next to it. The bathroom also has a sink and a book titled \"Magic Eye.\" The bathroom appears to be clean and well-maintained.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "book", "toilet", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 322163, "caption": " The image depicts a group of people in a kitchen area, with one person focused on making a pizza and others standing nearby. The individual making the pizza is in the center of the image, carefully spreading the toppings onto the dough. The pizza is being prepared in the middle of a table in the kitchen area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "pizza", "oven", "dining table", "bowl", "potted plant", "bottle", "microwave", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 230983, "caption": " The image captures a person on a skateboard performing a trick, in mid-air while jumping over a green bottle. The skateboard appears to be the main focus of the scene, and the person riding it is captured in a dynamic, exciting pose. The bottle appears to be small and strategically placed for the trick to be performed.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["bottle", "bottle"]], "mscoco_gt_words": ["skateboard", "car", "person"], "hallucination_idxs": [19, 48], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 564629, "caption": "According to the image, there is a kitchen located in a mobile home. The kitchen features a wooden table with two chairs on the left side and a black counter on the right side. There is a sink, stove, and oven in the kitchen area.", "mscoco_hallucinated_words": [["table", "dining table"], ["chair", "chair"]], "mscoco_gt_words": ["sink", "microwave", "oven", "refrigerator"], "hallucination_idxs": [20, 23], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.5}}, {"image_id": 53431, "caption": " In the image, a man is sitting on a chair, using a laptop. He has a t-shirt and a pair of shoes, and a pair of trousers with the pants legs folded up and tied around his legs. There are two columns next to the man, one on the left side and one on the right side of the image. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["laptop", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 441840, "caption": "The image captures a young child swinging a baseball bat and holding it with confidence on a field. There are several people present in the scene, including another child standing near the little boy, as well as an adult in the background, possibly watching over them or enjoying the action. The little boy, who has a baseball bat in hand, seems to be enjoying the fun and excitement of playing baseball.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "baseball bat", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 53465, "caption": "In the image, there is a festive cake with orange writing on it, sitting on a plate on a table. The cake appears to have a message written on it, which suggests a welcome or personalized theme. The table is set for a gathering or event, as several cups, a knife, a bottle, and a fork can be seen around the cake. There are a few chairs nearby, likely for guests to sit on while enjoying the food. Overall, the image creates a celebratory atmosphere, likely for a special occasion.", "mscoco_hallucinated_words": [["orange", "orange"], ["knife", "knife"], ["bottle", "bottle"], ["chair", "chair"]], "mscoco_gt_words": ["cup", "dining table", "bowl", "spoon", "cake", "fork", "person"], "hallucination_idxs": [10, 57, 60, 76], "metrics": {"CHAIRs": 1, "CHAIRi": 0.36363636363636365, "Recall": 0.5714285714285714}}, {"image_id": 381031, "caption": "There is a table that's been set up with different food items. The table features a bowl filled with multiple apples, a plate with two slices of bread on top of it, a cup, and a bottle. The food items include two donuts on a plate, a knife, a fork, a spoon, and some juice. There is also a carrot positioned near the table. ", "mscoco_hallucinated_words": [["bowl", "bowl"], ["knife", "knife"], ["fork", "fork"], ["carrot", "carrot"]], "mscoco_gt_words": ["cup", "dining table", "spoon", "bottle", "apple", "microwave", "donut"], "hallucination_idxs": [18, 54, 57, 70], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.8571428571428571}}, {"image_id": 530520, "caption": "In the image, a group of men is sitting on the lawn using laptops, with one man looking at his laptop on the grass and two others around him. There are a few potted plants located in the scene, one at the top-right corner, another at the center-left, and a third one near the top.", "mscoco_hallucinated_words": [["potted plant", "potted plant"]], "mscoco_gt_words": ["clock", "motorcycle", "laptop", "backpack", "person"], "hallucination_idxs": [36], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.4}}, {"image_id": 539555, "caption": "The image features a man in an action pose, trying to catch a frisbee in mid-air. He is the focus of the scene as he leaps towards the disc with both hands extended. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "frisbee", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 279521, "caption": " The image features a red train traveling along a railroad track. Three people are standing close to the train, with one person in the foreground and two others further back in the scene. A man is in front of the middle train car, a woman is in front of the left train car, and another man is further back. All of the individuals appear to be focused on the train, possibly waiting for it to pass by or boarding it. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "handbag", "train", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 560439, "caption": " The image depicts a group of three people snowboarding down a snowy hill. They are wearing snowshoes and are all close to each other while enjoying their activity. A snowboarder can be seen further back, partially visible from the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "snowboard", "backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 248353, "caption": "The image portrays a snowy ski slope where several people are skiing or snowboarding. There is a group of at least 13 people gathered together on the slope, with various ski and snowboard brands scattered around the area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 9628, "caption": "The image captures a thrilling water skiing scene, with a man on water skis and the skis attached to a power boat in the background. The man appears to be flying through the air, likely doing a trick or landing on the water after a high jump.", "mscoco_hallucinated_words": [["ski", "skis"], ["ski", "skis"]], "mscoco_gt_words": ["surfboard", "boat", "person"], "hallucination_idxs": [14, 17], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.6666666666666666}}, {"image_id": 330050, "caption": "According to the picture, the man and woman are standing on a tennis court, both holding tennis rackets. They appear to be enjoying each other's company while posing for the camera. The woman appears to be wearing a white blouse and a blue skirt, while the man is wearing a white and blue shirt. They are both smiling, likely from the social interaction on the tennis court.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 254161, "caption": " The image depicts a man standing on a street or plaza and flying a kite, with several buildings visible in the background. The man is wearing a red and white shirt and appears to be enjoying the outdoors, possibly in the middle of a park or public space. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "kite", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 275749, "caption": "A group of people are gathered around a white table in a room, sitting and standing in various positions, possibly chatting and catching up with each other. There are four individuals, with three standing and one sitting at the table. The sitting person is closer to the center of the table, while the standing people are positioned around it. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "tv", "dining table", "laptop", "mouse", "bottle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 163518, "caption": "In the image, a woman is working in a kitchen environment. There are several bowls full of salad or other food items on a table, covering most of the dining area. The salad bowls are spread across the table, with some on the left side and others on the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["oven", "dining table", "bowl", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 156696, "caption": "The image is a black and white photograph of a train station, likely located at the platform. There are two trains present in the image, one at the left end of the station and the other on the right side. There are also several people standing near the trains, including one at the front left side of the train on the left and another one in front of the second train.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "train", "handbag", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 525202, "caption": " The image showcases a large pizza placed on a pizza pan, sitting on a table. There are a few people in the background, either preparing food or simply enjoying a meal together. The pizza appears to be ready to be served or consumed, as it is nicely cooked and sitting on the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "spoon", "bottle", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 423984, "caption": "The image depicts a plate of food that contains pasta and vegetables. The plate features a variety of pasta, carrots, and carrot pieces, with some bread placed around it. Carrots are present in different sizes, some close together while others are more spread out. There are multiple carrot pieces, which are likely smaller portions of the carrots on the plate. In addition, the plate also includes some bread towards the sides. The food appears to be prepared and ready for consumption.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["carrot", "broccoli", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 499486, "caption": " The image portrays two cats sitting on a bed, looking tired, worn out and possibly exhausted after their day on the bed. The cats are positioned next to each other, almost touching. One cat can be found on the left side of the bed, while the other cat is on the right. The scene is a bit messy, with clothes scattered around the bed, suggesting that they had some trouble getting a good night's sleep.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "cat", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 118124, "caption": "The image captures a man wearing glasses and a tie in a vehicle. He is smiling and appears to be enjoying himself while traveling in the car. The vehicle appears to be a car, and the man is seated in the back seat.", "mscoco_hallucinated_words": [["seat", "chair"]], "mscoco_gt_words": ["tie", "person", "car"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 288729, "caption": " The image depicts a cake that is shaped like a small circus tent or a big top. The cake is covered with a yellow frosting and is sitting on a table. ", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["teddy bear", "bear", "elephant", "cake", "person"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.2}}, {"image_id": 177721, "caption": " The image captures a man on a skateboard, performing a trick in the air, likely at a skate park. He is positioned in the center of the picture, balancing himself while maintaining the position of his skateboard, making it a captivating display of skill and coordination.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 372220, "caption": "The scene depicts a large group of people standing together in an outdoor setting, with several men and women present. One of the men, the central figure, is a tall man with a long beard and gray hair, wearing a priest costume. He is holding a cell phone in his hand, likely capturing the event or using it for other purposes.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "horse", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 252625, "caption": "The image captures a quiet city street lined with tall buildings on both sides, creating a narrow alleyway with a sidewalk on the side. Several bicycles are parked near the sidewalk, as well as a couple of motorcycles, all likely to be ridden in the area. ", "mscoco_hallucinated_words": [["motorcycle", "motorcycle"]], "mscoco_gt_words": ["traffic light", "potted plant", "person", "bicycle"], "hallucination_idxs": [40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 493610, "caption": "According to the picture, there is a large group of people gathered on a dirt road in front of a dirt-covered blue truck. The truck appears to be quite old and has a large back area with bench seats, creating a space for several people to sit. Many of the people are visible both on the front and back of the truck, some occupying the lower and upper seating areas. The back of the truck has a large, spacious area where several people can sit together, enjoying the ride. There is also a bus driving in the opposite direction, likely on a different road or in a separate location altogether.", "mscoco_hallucinated_words": [["bench", "bench"], ["seat", "chair"]], "mscoco_gt_words": ["truck", "person", "bus", "car"], "hallucination_idxs": [39, 40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 302789, "caption": " The image depicts a small room with a bed and suitcase placed in the corner. The bed is made up with blankets and pillows, suggesting it is being used as a living space. The suitcase is packed, indicating the occupant has traveled and might be preparing to depart. The room also has a sink and a fridge, likely for personal and culinary purposes. There is another item on the bed, possibly a water bottle or a glass. Overall, the room appears to be compact, but functional for the occupant.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["suitcase", "handbag", "refrigerator", "backpack", "bottle", "sink", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7142857142857143}}, {"image_id": 30667, "caption": " A person sits with their legs crossed, enjoying a peaceful moment on a bench. Sitting next to them is a brown dog that seems to be relaxing as well. Several feet can be seen in the scene, indicating that there are others in the vicinity. This serene setting can be the result of a leisurely outing or a cozy get-together.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["person", "boat", "dog"], "hallucination_idxs": [14], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 114108, "caption": "The image features a table with a plate on it, which holds a pizza with two slices of mozzarella on it. There are several people surrounding the table, some of them close to the pizza and others further away. There are also several chairs placed around the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "spoon", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 328110, "caption": "The image features a woman holding a baby on a wooden bench. Another person is present in the scene, but it's unclear what they are doing. There are several other people in the picture, likely enjoying the time outdoors.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "handbag", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 366111, "caption": " In the picture, several people are riding bicycles down a street. They are wearing hats, including a couple of hats on one person, and one person wears a tie. There are ties and a handbag on the bicycles as well. ", "mscoco_hallucinated_words": [["tie", "tie"], ["handbag", "handbag"]], "mscoco_gt_words": ["traffic light", "car", "person", "bicycle"], "hallucination_idxs": [32, 39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.5}}, {"image_id": 532481, "caption": " The image shows a person airborne in the ocean with their parasail out, next to a pair of boats. There is also another parasailer further in the distance, to the right of the main person. The image captures an exciting moment while the person enjoys the thrill of parasailing over the water.", "mscoco_hallucinated_words": [["boat", "boat"]], "mscoco_gt_words": ["person", "surfboard", "kite", "car"], "hallucination_idxs": [19], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.25}}, {"image_id": 291634, "caption": "The image shows a row of motorcycles lined up along a street or sidewalk, near the sidewalk with several people standing around them. The motorcycles appear to be parked as they line the street, creating an interesting sight for passersby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 463678, "caption": " The image shows a young male tennis player on a tennis court, actively engaged in the game. He is holding a tennis racket and is preparing to hit a tennis ball, which is in his vicinity. There is another person in the scene, possibly a fellow player or a spectator, standing nearby on the right side of the court.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 70815, "caption": "In a black and white image, a small dog is walking on a leash along the sidewalk, next to a bicycle and a couple of people. The bicycle can be seen in the background, while the people are further away from the scene. The dog is likely the main focus of the image, with two distinct individuals visible at the sides of the image. There are a few other small details in the scene, such as a cell phone, and a couple of potted plants can be seen further down the sidewalk.", "mscoco_hallucinated_words": [["cell phone", "cell phone"], ["potted plant", "potted plant"]], "mscoco_gt_words": ["bicycle", "person", "truck", "dog"], "hallucination_idxs": [85, 92], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.75}}, {"image_id": 367872, "caption": "The image captures a baseball game in progress, with multiple players on the field. There are several players scattered around the field, with one of them swinging at the ball with a baseball bat. The batter has just finished a hit and stands in the middle of the field.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 353898, "caption": "In the image, a man is sitting on a public bench surrounded by a large group of birds, primarily white doves. The birds are spread out in various positions across the scene, with some located in the background, others on the ground, and a few close to the man sitting down. It seems that the birds are drawn to the person, potentially due to their presence or the area they are in.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "cell phone", "backpack", "bird", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 88269, "caption": "There's a delicious dish of grilled cheese sandwiches and tomato soup on the menu. On the plate is a large bowl of soup and multiple slices of grilled cheese sandwiches. Two knives are positioned on the dining table, suggesting the presence of a utensil for eating the sandwiches. The sandwiches and soup appear to be the highlight of the meal, making for a satisfying and filling meal.", "mscoco_hallucinated_words": [["knife", "knife"], ["table", "dining table"]], "mscoco_gt_words": ["sandwich", "spoon", "bowl"], "hallucination_idxs": [34, 40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.6666666666666666}}, {"image_id": 463211, "caption": "The image features a young girl, possibly in a backpack, walking towards the left side of the picture and another girl further to the right. The yellow school bus can be seen in the background, parked or traveling on the road. Both children seem to be in the process of leaving or getting ready to board the school bus. There are also two handbags in the scene, located further to the right side of the image.", "mscoco_hallucinated_words": [["handbag", "handbag"]], "mscoco_gt_words": ["backpack", "bus", "person"], "hallucination_idxs": [69], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 396269, "caption": "The image features a beautiful arrangement of flowers in a vase. The vase sits on a table in front of a sofa, creating a luxurious and ornate setting. The arrangement includes a mix of different flowers, including roses, which are placed in various parts of the vase. There are also some other types of flowers, like tulips, carnations, and daisies. ", "mscoco_hallucinated_words": [["sofa", "couch"]], "mscoco_gt_words": ["vase", "chair", "dining table"], "hallucination_idxs": [22], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 98979, "caption": "The image features a microwave in the kitchen, sitting on a countertop. The microwave is silver in color and has a front-facing window. The appliance appears to be the only item in the kitchen, with no other objects or people around it.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["microwave", "knife", "oven"], "hallucination_idxs": [43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 244455, "caption": "In the image, there are two men riding motorcycles down a road. They are positioned side by side, with one rider on the left and the other on the right. They appear to be traveling in the same direction. A group of people is gathered behind them, with some standing further away than others. The road is somewhat busy, as there are a number of cars driving and parked nearby. One car is slightly in front of the motorcycles, while two more are parked to the left and right of the men riding the bikes.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 379443, "caption": "The image features a large black holey piece of fabric laid out on a table, with a pair of scissors situated near the center of the table, creating a fun and playful atmosphere. The pair of scissors is closed and positioned close to the black fabric, giving the illusion of a scissor-style hand gesture.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "scissors", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 536154, "caption": "According to the image, a young boy is sitting on a bed surrounded by a large number of stuffed animals. The toys cover the entire surface of the bed and are piled up all around the child. There are at least 10 stuffed animals, some in various sizes and others in different positions, creating an impressive collection.", "mscoco_hallucinated_words": [["bed", "bed"], ["bed", "bed"]], "mscoco_gt_words": ["teddy bear", "cat", "person"], "hallucination_idxs": [12, 30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 374061, "caption": "According to the picture, there is a black cat sleeping on a bed. The cat is holding a stuffed animal tightly and cuddling with it. The scene captures the cute and cozy moment between the cat and the stuffed animal.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "bear", "cat", "bed", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 560018, "caption": " The scene features a large group of people surfing and having fun in the waves at an outdoor location. There are at least eight surfers, some with boards and others without, scattered throughout the ocean. Some surfers can be seen closer to the shore, while others are further out in the water. A few people are paddling kayaks in the vicinity of the surfers. Overall, the ocean offers an exciting and fun-filled atmosphere for the water sports enthusiasts.", "mscoco_hallucinated_words": [["kayak", "boat"]], "mscoco_gt_words": ["surfboard", "kite", "person"], "hallucination_idxs": [63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 353320, "caption": " The image shows a guitar case placed on a bench, surrounded by a few pieces of luggage. There are several bags in various sizes and positions around the bench, including some near the guitar case and others slightly further away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "suitcase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 256628, "caption": "According to the picture, a golden retriever dog is laying down on a wooden table top. It is positioned in the center of the table and its head is on the left side. In the background, there are a couple of chairs, one placed towards the right side of the image and the other closer to the left. The chairs are dotted around the perimeter of the wooden table, with one located in the center, and another near the right side.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"], ["table", "dining table"]], "mscoco_gt_words": ["bowl", "chair", "dog"], "hallucination_idxs": [15, 26, 74], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 0.6666666666666666}}, {"image_id": 158806, "caption": " In the image, there is a big brown dog standing and eating from a blue plate. The plate contains two slices of toasted sandwiches placed side by side. There are several other people in the background, indicating that the dog is surrounded by people in the setting. The scene captures the fun moment of the dog eating the sandwiches, possibly with a little help from its human companions.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 424270, "caption": "A young child, possibly a toddler, is sitting in a bed in the middle of the image. The bed seems to be a queen-sized bed and appears to be on the left side of the picture. There are a few other beds visible in the background, further towards the right. The child is positioned closer to the front side of the bed.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bed", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 368335, "caption": "In this image, a horse is walking towards a car in the middle of a parking lot. The horse is positioned on the left side of the image, crossing from the top left corner to the bottom center of the scene. The car is parked in the top center of the scene, and there are several other cars surrounding it on all sides.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "horse", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 483227, "caption": "The scene is set on a tennis court with several young people standing on the court, each holding a tennis racquet. They are positioned close to each other, possibly in a group setting for a sports activity or team bonding.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 228306, "caption": "The scene features a large red bus traveling down a city street. It is a city bus with passengers on board, possibly on their way to reach their destination. The bus is quite noticeable in the image, taking up a significant portion of the frame.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 416384, "caption": "The image features a young woman sitting in a white plastic chair and holding a cell phone, possibly texting or browsing the internet. She is wearing pink shorts and appears to be enjoying the outdoors in this hot summer day. The chair is located near the edge of a sidewalk, close to a large body of water.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 364783, "caption": " The image depicts a small bathroom painted all red, from the walls to the toilet and sink. The bathroom features a toilet in one corner of the room, and a sink positioned next to it on the right side. The toilet and sink are accompanied by a few other bathroom fixtures, including two bowls and a pair of cups. A mirror is positioned above the sink, adding some brightness to the space.", "mscoco_hallucinated_words": [["bowl", "bowl"]], "mscoco_gt_words": ["toothbrush", "cup", "potted plant", "bottle", "toilet", "sink", "chair"], "hallucination_idxs": [58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.42857142857142855}}, {"image_id": 335041, "caption": " In the image, a person in a red jacket is skiing down a snow-covered street. The individual is wearing snow skis and has his arms spread out as he moves down the street. In the scene, there are two cars parked close to the street, one on the left side and the other on the right side. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 531327, "caption": "The image features a plate of food on a dining table, filled with various types of vegetables. There are multiple pieces of broccoli scattered throughout the dish, and carrots can be found in a mixture with other vegetables. The bowl is placed on top of a dining table and appears to be the centerpiece of the table.", "mscoco_hallucinated_words": [["bowl", "bowl"]], "mscoco_gt_words": ["knife", "dining table", "spoon", "fork", "broccoli", "wine glass", "carrot"], "hallucination_idxs": [43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.42857142857142855}}, {"image_id": 451228, "caption": " The image features a busy street with double-decker buses and people on bicycles. Some of the buses are in motion while others are parked. There is a traffic light at the end of the street, indicating the end of the lane, and a number of street lamps are visible along the street. The street is full of people, including some walking, some riding bicycles, and some on buses. There is a sense of motion, with people walking, biking, and riding public transportation.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "bus", "person", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 256365, "caption": "The image features a large plate filled with numerous cupcakes with white frosting on them, decorated with blue whipped cream swirls. There are at least seven cupcakes on the plate, which covers most of the image's area. The cupcakes vary in their size and are arranged in an intricate pattern, showcasing the artistic designs on them. This visual display of sweet treats is likely a decorative centerpiece or part of a cake-making competition.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "cup", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 502936, "caption": "The image captures an outdoor setting in a backyard, featuring a man playing tennis on a court. The man is holding a tennis racket, standing on the left side of the court, as he is about to swing and hit the ball. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 451449, "caption": "According to the picture, a small dog is lying next to a bookshelf in a bedroom or similar setting. The dog is situated at the center of the scene and appears to be resting in a dog bed, possibly taking a nap or sleeping comfortably. The bookshelf contains a variety of books stacked on top of it, with a few closer to the dog, and some further away. The books provide a background for the dog, creating a cozy and welcoming atmosphere for both the dog and the people around it.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["book", "bed", "dog"], "hallucination_idxs": [96], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 1.0}}, {"image_id": 51054, "caption": " In a bedroom, a woman is sitting on a bed and talking on her cellphone. She is wearing a plaid shirt and her hair appears to be greying. There is another person in the room, likely a relative or friend. A chair can be seen in the background.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["cup", "bottle", "cell phone", "bed", "person"], "hallucination_idxs": [46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6}}, {"image_id": 1268, "caption": "According to the picture, there is a group of people, consisting of at least five people, standing near a bridge. They are positioned near a body of water that flows under the bridge. Some of the individuals appear to be walking, while others are standing still. The scene is capturing a moment of leisure and relaxation, with a couple and a woman standing in different parts of the area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "cell phone", "backpack", "bird", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 62692, "caption": " The image features a woman playing tennis on a tennis court. She is holding a tennis racket and appears to be in the middle of a match. Her outfit consists of a black top and white shorts. The tennis ball is close to her as she swings her racket. The entire scene captures the action of a woman playing a tennis match.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 542771, "caption": " The image depicts a large, spacious room with several chairs and tables arranged around various objects. The focal point of the room is a tall vase on the right side of the image, sitting on a table, and another vase in the foreground of the photograph. Several vases are placed throughout the room, one towards the left side and two more towards the middle of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "potted plant", "vase", "book", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 575406, "caption": "The image shows a blue and white bus traveling down a street with a red car parked beside it. The bus is driving on the road while the car is parked next to it. There are several people visible on the bus, with some standing in the middle of the vehicle and others sitting closer to the front and back.", "mscoco_hallucinated_words": [["car", "car"], ["car", "car"]], "mscoco_gt_words": ["motorcycle", "bus", "potted plant", "person"], "hallucination_idxs": [15, 29], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.25}}, {"image_id": 244834, "caption": "According to the picture, a couple of people are walking down a rain-soaked street holding umbrellas. Two umbrellas are visible, with one being large and white, while the other is smaller and red. The group includes the main subjects and other individuals walking with them. Some of the people have handbags to protect their belongings from the rain.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "clock", "handbag", "umbrella", "potted plant", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 151159, "caption": " Scene Description", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 277073, "caption": "In the image, a man and woman are riding a motorcycle together, possibly on a street in a city. The man is in the front, while the woman is riding on the back of the motorcycle. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 377879, "caption": "In the image, there is a train traveling past a field, which features several military tanks of different sizes and styles on a flatbed train. Some of the tanks are positioned near the front of the train, while others are farther back or off to the sides. The scene has a sense of movement as the train continues to roll along the track.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "train", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 436814, "caption": "The image features a conference room containing various objects. In the middle of the room, there is a table with a laptop and a keyboard on it, as well as a few papers placed at different spots around the room. ", "mscoco_hallucinated_words": [["keyboard", "keyboard"]], "mscoco_gt_words": ["laptop", "mouse", "chair", "dining table"], "hallucination_idxs": [26], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 63934, "caption": "The scene is set in a city street, with several vehicles, such as a car, truck, and van, parked alongside the road. The street is empty except for the person on their skateboard who is riding the skateboard on the sidewalk. There are also two cats, one on the left side of the scene and the other on the right side.", "mscoco_hallucinated_words": [["cat", "cat"]], "mscoco_gt_words": ["truck", "skateboard", "car", "person"], "hallucination_idxs": [52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 275611, "caption": "The image features a delicious Italian cheese pizza placed on a metal panning at a dinner table. It appears to be the center of attention, with a few people gathered around it, enjoying their meals. The pizza has a crispy crust, and various toppings such as mozzarella and basil are visible on its surface.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["pizza", "knife", "dining table"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 516248, "caption": "A person is sitting at a dining table or similar area, holding a computer mouse with their hand, suggesting they might be interacting with the computer or possibly taking a break from work or other activities. The mouse is placed on the table in front of the person, while a computer keyboard is also situated near the person on the left side of the table. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "tv", "dining table", "laptop", "mouse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 478380, "caption": "The image shows a person driving a vehicle down a road with a herd of sheep following behind. The person is seated in the driver's seat and there are at least a dozen sheep in the scene, spread across the scene. A truck is also present in the scene, traveling along the same road as the vehicle. The presence of both the truck and the person suggest the possibility of a traffic-related situation, possibly involving the sheep following closely behind the truck. Overall, the image captures the herd of sheep moving along the street.", "mscoco_hallucinated_words": [["seat", "chair"], ["truck", "truck"], ["truck", "truck"], ["truck", "truck"]], "mscoco_gt_words": ["person", "sheep", "car"], "hallucination_idxs": [27, 46, 68, 88], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 422326, "caption": " The scene features a parking lot filled with various vehicles, including a motorcycle, a car, and two motorcycles with unique design elements. The motorcycle with an unusual form, like a cart, is parked next to a car in the center of the parking lot. Near the car, a group of three potted plants are placed atop a bench.", "mscoco_hallucinated_words": [["potted plant", "potted plant"], ["bench", "bench"]], "mscoco_gt_words": ["truck", "bicycle", "motorcycle", "car", "person"], "hallucination_idxs": [59, 65], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.4}}, {"image_id": 110449, "caption": " The image features two men, one on the right and the other on the left, sitting at a table with a remote control in front of them. Both of the men appear to be looking at the remote control, possibly trying to figure out how to operate it. The remote control is positioned in front of the two men.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "tv", "dining table", "laptop", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 45550, "caption": "The image depicts a man, likely a chef or restaurant owner, standing in front of a table full of food. He is holding up a plate of deep-fried food, likely French fries, and he is smiling at the camera. In addition to the man and the food, there are other people in the background, one of whom is also smiling.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["sandwich", "clock", "person", "bowl"], "hallucination_idxs": [18], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.25}}, {"image_id": 444913, "caption": "According to the picture, a man is standing on a grassy area, looking up at a bird in flight while holding a red parasol. The surrounding environment is minimal, with no buildings or significant features visible in the area. The man appears to be the main focus of the image, capturing an outdoor moment and enjoying nature.", "mscoco_hallucinated_words": [["bird", "bird"]], "mscoco_gt_words": ["frisbee", "person", "car"], "hallucination_idxs": [18], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 82709, "caption": " A black and white dog is captured in the middle of a lively scene, leaping into the air and catching a pink frisbee while it is in mid-air. The dog stands out among the people and benches, while a bottle is visible near the left edge of the image. The dog's action captures everyone's attention as they watch the exciting moment.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["cup", "dog", "bottle", "frisbee", "backpack", "person"], "hallucination_idxs": [38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6666666666666666}}, {"image_id": 132796, "caption": "The image captures a man and two women riding on the back of an elephant, possibly in a park or other outdoor setting. The elephant is large and takes up most of the scene, with the people seated in the back riding area. The three people seem to be enjoying their elephant ride, with one woman seated more towards the front, another towards the middle, and the third person towards the back.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "elephant", "backpack", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 303099, "caption": "The image features two cats sitting on top of a toilet, appearing to be in a bathroom setting. One of the cats is on the left side, and the other is on the right side of the toilet. They seem to be engaged in a conversation, watching each other while in the bathroom. There is a bathroom sink located nearby the toilet, partially visible on the right side of the image. A cup is also placed in the bathroom, perhaps for bathroom use or for decoration. The background is mostly made up of various patterns and colors, creating a more visually interesting scene.", "mscoco_hallucinated_words": [["sink", "sink"], ["cup", "cup"]], "mscoco_gt_words": ["cat", "toilet", "bottle"], "hallucination_idxs": [63, 80], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.6666666666666666}}, {"image_id": 271565, "caption": " There are several motorcycles on a road in the image, with several riders on the vehicles. The road appears to be a residential street, possibly near a neighborhood or in a suburban area. The road is lined with trees, and some of the motorcycles can be seen traveling in both directions. There are at least six motorcycles visible in the image, including some parked on the side of the road.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 205193, "caption": " In the image, there is a brown granite countertop in front of an old-style toaster oven. A bag of green tea sits next to the toaster oven, while a black tea is nearby. The beverage tea is in a brown bottle, while a yellow teapot sits on the counter.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["toaster", "microwave", "oven", "person"], "hallucination_idxs": [44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 272566, "caption": " The image features two men sitting on a sidewalk next to baskets filled with bananas and other fruits. The bananas are placed in front of the men and are also scattered around the area. Some of the bananas are ripe, and others appear to be not quite ripe yet. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "apple", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 170099, "caption": "In the image, a man is seen wearing a red tie and standing in front of a couch. He appears to be talking as he stands near a wall. There are a couple of chairs visible in the background. The room has two chairs and three dining chairs arranged for different activities.", "mscoco_hallucinated_words": [["chair", "chair"], ["chair", "chair"], ["chair", "chair"]], "mscoco_gt_words": ["tie", "orange", "person", "couch"], "hallucination_idxs": [37, 47, 51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.75}}, {"image_id": 14940, "caption": "According to the picture, it features a cat and a stuffed animal, likely a dog, cuddling together. The cat is resting on top of the other animal, with its face partially covered by the dog. The stuffed dog is positioned slightly to the right and above the cat.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "cat", "orange", "chair", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 408449, "caption": " The image shows a man skateboarding down a city street. He is surrounded by several people in the area, with some standing near the street and others further back. The street appears to be empty, with no vehicles present in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "skateboard", "bus", "handbag", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.14285714285714285}}, {"image_id": 140702, "caption": " The scene features two baby elephants walking through the grass. One baby elephant is near the left side, standing close to a small bird, while the other baby elephant is closer to the right side of the image. Both elephants are surrounded by grass in the background. There are a few other birds present in the scene, one located near the left side and the other towards the middle of the picture. Overall, the image captures the two baby elephants walking and exploring their surroundings.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "person", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 354540, "caption": "The image depicts a person sitting at a kitchen counter, using a laptop. In the background, there is a toaster oven and a microwave. The kitchen is adorned with various items such as an air conditioner, a book, and a bowl. The counter also has a bowl and a cell phone placed on it. The person is sitting on a chair and appears to be working on a laptop, using it as a table top while eating a meal.", "mscoco_hallucinated_words": [["book", "book"], ["bowl", "bowl"], ["bowl", "bowl"], ["table", "dining table"]], "mscoco_gt_words": ["toaster", "oven", "laptop", "cell phone", "microwave", "chair", "person"], "hallucination_idxs": [42, 46, 53, 82], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3076923076923077, "Recall": 1.0}}, {"image_id": 19763, "caption": "The image features a close-up of a piece of deep-dish pizza on a plate. There are two pizza slices on the plate, with one slice located on the right and the other on the left. The pizza is placed on a dining table, surrounded by chairs. The dining table and the chairs can be seen in the background, adding a dining atmosphere to the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "fork", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 191301, "caption": "The image features a plate with two desserts \u2013 a chocolate cake and ice cream \u2013 on a table or countertop. The desserts are accompanied by two silver spoons, positioned on either side of the dish. There is a cup, likely holding a beverage, and a chair or stool at the table where the desserts are being served. The table is tipped towards the right side, providing an intimate view for the guests.", "mscoco_hallucinated_words": [["chair", "chair"], ["stool", "chair"]], "mscoco_gt_words": ["cup", "dining table", "spoon", "cake", "person"], "hallucination_idxs": [51, 53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.8}}, {"image_id": 124236, "caption": "The image depicts a kitchen area with sunlight shining in through the window. Several bowls are arranged on a counter, and a table sits near the windows with some pots on it. A sink is located close to the table, and there is a black oven in the middle of the kitchen. Two cups can be seen sitting on the table, along with various kitchenware items scattered around the room. The scene showcases a typical kitchen setting with various utensils and kitchenware spread throughout.", "mscoco_hallucinated_words": [["sink", "sink"], ["cup", "cup"]], "mscoco_gt_words": ["toaster", "oven", "dining table", "bowl", "spoon"], "hallucination_idxs": [36, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.6}}, {"image_id": 344930, "caption": "The scene is on a city street, where a red fire hydrant is positioned on the side of the road. There are cars parked nearby, with a large truck closer to the fire hydrant and a smaller car further back. The sidewalk area in front of the fire hydrant is empty except for the fire hydrant itself.", "mscoco_hallucinated_words": [["truck", "truck"]], "mscoco_gt_words": ["fire hydrant", "orange", "car"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.6666666666666666}}, {"image_id": 14285, "caption": " The scene features a bedroom with a desk and some books, a cat laying on a bed next to a dog, and a few other items in the room. There are two televisions present, one located towards the top left and the other at the top right of the image. ", "mscoco_hallucinated_words": [["desk", "dining table"], ["dog", "dog"]], "mscoco_gt_words": ["remote", "tv", "cat", "chair", "book", "bed"], "hallucination_idxs": [7, 21], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 82488, "caption": "A hotel room with two beds is featured in the image. Both beds are equipped with white comforters and yellow and orange pillows, creating a warm and welcoming atmosphere. The room has two windows and a door, with the left side of the room featuring two lamps and a phone. The right side of the room has two lamps and a television. Both beds are situated on opposite ends of the room. There are no other notable features in the image.", "mscoco_hallucinated_words": [["phone", "cell phone"], ["television", "tv"]], "mscoco_gt_words": ["remote", "clock", "dining table", "orange", "book", "bed"], "hallucination_idxs": [53, 66], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 348251, "caption": " The image features a person wearing skis standing in a snowy area. They appear to be smiling and enjoying their time in the snow. There are multiple ski poles on the ground, likely belonging to the person or others in the area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 150703, "caption": " The image features a delicious-looking hot dog sitting on top of a table alongside various cups. A cup on the right contains a drink, possibly beer, while another cup and a bottle can be seen on the left side of the scene. The hot dog is served in a paper wrap or wrap and is the focal point of the scene. The dish appears to be a tasty and satisfying combination of a hot dog and beer.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["sandwich", "cup", "hot dog", "dining table"], "hallucination_idxs": [34], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.75}}, {"image_id": 381615, "caption": "This image features a window looking out onto a garden with a wide variety of plants. The scene displays multiple pots and several plants hanging in them, creating a lush atmosphere. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "potted plant", "vase", "book", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 352703, "caption": "In the image, a woman is cooking a meal outside under a tent, as several people watch her and enjoy the food. The woman is in the foreground, actively stirring a pot in a frying pan, likely adding seasoning or finishing touches to the meal. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "handbag", "oven", "dining table", "bowl", "spoon", "bottle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.125}}, {"image_id": 111109, "caption": "The image depicts a young boy posing for a photo, likely during a birthday celebration or an important event. The boy is wearing a tie and shorts, and he is standing on a stool. In the background, there are a few chairs, and a TV can also be seen.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tv", "tie", "motorcycle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 226802, "caption": " The scene is of an outdoor street red from being red, situated in an Asian city. A large group of people can be seen in various locations throughout the street, with some standing, walking, and riding vehicles. Several cars and trucks are present in the middle of the street, occupying most of the space, while buses are parked towards the left side of the scene.", "mscoco_hallucinated_words": [["truck", "truck"]], "mscoco_gt_words": ["motorcycle", "bus", "person", "car"], "hallucination_idxs": [46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.75}}, {"image_id": 49005, "caption": "In a baseball game in progress with several players on the field, a baseball player is preparing to pitch the ball. He is standing on the mound with the ball in his hand, while several baseball players can be seen scattered around the field, including catcher, pitcher, and outfielders. ", "mscoco_hallucinated_words": [["ball", "sports ball"], ["ball", "sports ball"]], "mscoco_gt_words": ["baseball bat", "person", "baseball glove"], "hallucination_idxs": [21, 31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 410114, "caption": "The image features a large herd of cows and people walking down the road. There are numerous cows scattered along the road and surrounding it, with some closer to the right side and others in the middle or left side. The herd of cows is walking down the road, with a group of people also walking alongside them. Some of the cows are close to the pavement, while others are further away. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "horse", "person", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 100274, "caption": " The image features a train on a railroad track traveling near a large parking lot, which is in the vicinity. There are several vehicles of various types parked in the parking lot, including a truck, a car, and another vehicle. The train is passing through the scene, moving from left to right. The railroad track continues beyond the parking lot, with another train visible further ahead.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "train", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 317254, "caption": "The scene features a person wearing a motorcycle helmet riding on a motorcycle, and they are positioned in the middle of the street, surrounded by cars. A car is located to the right of the motorcycle, closer to the edge of the street, while another car is behind the first car, closer to the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 425569, "caption": " There is an image of a large commercial kitchen that appears to have multiple appliances and utensils. The kitchen is filled with silver metallic equipment and bowls placed around various areas of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "sink", "oven"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 28842, "caption": "The image captures an outdoor skate park where a group of people is gathered, enjoying various activities. Two individuals are skateboarding down a slope, while others are watching or standing around the area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 319607, "caption": "A street scene with a traffic light atop the pole in the middle of the road is being viewed. There are multiple people riding bicycles around the traffic light, with some of them near the bottom of the pole, and others in the background. A train is also visible in the scene, approaching from the left side.", "mscoco_hallucinated_words": [["train", "train"]], "mscoco_gt_words": ["bicycle", "bus", "traffic light", "car", "backpack", "person"], "hallucination_idxs": [49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 210372, "caption": " The image depicts a small kitchen with red and white cabinets, a sink, and a window near the refrigerator. The sink is located in the corner of the kitchen near the window, with a drain nearby. A dish and a few utensils are placed near the sink, while a handful of items are scattered around the room. Some of these items are a bottle, a wine glass, and a wine bottle, along with a few more items.", "mscoco_hallucinated_words": [["wine glass", "wine glass"]], "mscoco_gt_words": ["sink", "refrigerator", "bottle"], "hallucination_idxs": [73], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 156915, "caption": "The image features a large bouquet of flowers, with a red rose standing out in the center. There are other roses, white flowers, and greenery mixed throughout the arrangement, giving it a full and vibrant look.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 349642, "caption": "According to the image, it features a woman sitting on a wooden bench and a dog next to her. The bench is situated outside and a brick path is in the background. In the foreground, there are several dotted lines representing the paths and the sidewalks, as well as another bench and a person walking away from the scene. Additionally, a knife and a handbag can be spotted in the image.", "mscoco_hallucinated_words": [["knife", "knife"], ["handbag", "handbag"]], "mscoco_gt_words": ["person", "bench", "dog"], "hallucination_idxs": [68, 71], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 423008, "caption": "The image captures a busy scene on the water, with numerous small boats docked near each other. Many people are standing on the boats, enjoying the view or conversing. There are umbrellas providing shade and protection from the sun, covering several of the boats. The boats are arranged closely to each other, with people standing on various platforms and enjoying the area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["suitcase", "handbag", "umbrella", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 45435, "caption": " The image shows a person holding up a cellphone in front of them, with the phone's display side facing towards the camera. It appears to be a newer model of cellphone, with a unique design featuring a mix of silver and black colors. The phone is held in the left hand, with the person's arm and hand occupying the right side of the image. The scene also features two books, one closer to the person and the other further back. The books are placed on a table or desk surface, partially visible behind the person's hand.", "mscoco_hallucinated_words": [["book", "book"], ["book", "book"], ["table", "dining table"], ["desk", "dining table"]], "mscoco_gt_words": ["cell phone", "cup", "person"], "hallucination_idxs": [77, 91, 96, 98], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 35827, "caption": " The image displays a room with various computer equipment, including a laptop sitting on a desk, which is being operated by a man in a white t-shirt. The desk is equipped with a chair, and there is another chair located near the side of the room. The atmosphere is work-oriented as the man is likely using the laptop for work or school.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "tv", "dining table", "laptop", "bottle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 72656, "caption": " The image features a group of three children, two boys and one girl, sitting together at two small dining tables in a playground setting. They are enjoying a meal together, likely on a warm day.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "suitcase", "dining table", "bowl", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 372087, "caption": " The scene takes place at a train station, where a green and yellow train is on the railroad tracks. The train is occupying the majority of the image, stretching across the entire width of the track. Several people can be seen around the train, either walking by or waiting for the train to pass. There are two individuals near the left side of the train, one closer to the front and another towards the back. Another person is visible on the right side of the train, closer to the middle. These people seem to be waiting or preparing to board the train as it passes through the station.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "suitcase", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 336209, "caption": "The image features a young man performing a trick on a skateboard, captured in the act of jumping while riding a skateboarding ramp. The ramp is placed towards the left side of the scene, with the man balancing himself by leaning back slightly. In the background, several benches and a car can be seen, indicating a possible skateboarding area or park.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["skateboard", "bench", "person"], "hallucination_idxs": [55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 46031, "caption": "The image displays a white table with various computer-related items placed on top of it. There is a laptop sitting on the left side of the table, with a mouse positioned next to it. A computer monitor and mouse can be seen towards the center of the table. On the right side, there is another mouse, this time closer to the edge of the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "tv", "dining table", "laptop", "mouse", "cell phone"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 283118, "caption": " The image depicts a family gathering at a dining table with various food items, including pizza and hot dogs. There are three people in the scene, a man, a woman, and a child. They are sitting together, with the man on the left, the woman in the middle, and the child on the right. The dining table is covered with various plates and cups, as well as a bottle placed near the child. Some utensils, like a fork and knife, can be seen on the table. The table is set up with blue tiles and has a chair on both sides.", "mscoco_hallucinated_words": [["hot dog", "hot dog"], ["cup", "cup"], ["fork", "fork"], ["knife", "knife"]], "mscoco_gt_words": ["pizza", "handbag", "dining table", "bottle", "cell phone", "car", "backpack", "chair", "person"], "hallucination_idxs": [18, 73, 90, 92], "metrics": {"CHAIRs": 1, "CHAIRi": 0.21052631578947367, "Recall": 0.5555555555555556}}, {"image_id": 429598, "caption": "This image features a kitchen with brick-patterned walls. The kitchen has a small refrigerator, a sink, a white oven, and a toilet in the back room. The stove is positioned closer to the sink, while the refrigerator and toilet occupy the center of the room. The white oven is visible to the left side of the image.", "mscoco_hallucinated_words": [["toilet", "toilet"], ["toilet", "toilet"]], "mscoco_gt_words": ["sink", "oven", "refrigerator", "bottle"], "hallucination_idxs": [25, 44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 0.75}}, {"image_id": 333365, "caption": " This is a scene from an airport, featuring several airplanes parked on the ground and a couple of them on the runways. There are multiple large passenger airplanes parked across the scene, including some closer to the front and others farther back, while a few of them are on the runways. A few smaller airplanes can also be observed, such as one located in the middle of the picture and another towards the right edge. The planes are placed in various positions, with some appearing to be closer together and others dispersed further apart.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 424174, "caption": " The image shows a large bowl of assorted fruit on a dining table. It is filled with various fruits such as apples, bananas, oranges, and possibly grapes. The fruits are arranged in different locations within the bowl, with some being concentrated in the center, while others are scattered throughout the rest of the bowl. A dining chair can also be seen in the scene, possibly used for guests during a meal or entertaining. The scene represents a welcoming and hospitable gathering with a table full of delicious fruit.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["dining table", "bowl", "apple", "orange", "banana"], "hallucination_idxs": [64], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 1.0}}, {"image_id": 3035, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "airplane", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 177213, "caption": " The image features a delicious-looking pizza placed on a plate with a fork near it. The pizza appears to be freshly prepared, with melted mozzarella cheese on top, which gives it a white color. The fork adds a finishing touch, making it easier to eat the pizza. The plate sits on a dining table, likely in a restaurant, indicating the dish is ready to be enjoyed by guests.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "cup", "knife", "dining table", "fork", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 438535, "caption": "This image features a street scene with a large clock prominently placed in the middle of the street near some buildings. The clock is positioned at an angle, facing towards the top right corner of the photo. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "clock", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 490711, "caption": "The image features a clean kitchen with a white refrigerator, a stove, a sink, a dishwasher, and various countertop items. The refrigerator is positioned near the center of the room, while the stove occupies the left side of the kitchen. The sink is placed towards the right side of the kitchen. A dishwasher can also be found, situated near the refrigerator.", "mscoco_hallucinated_words": [["stove", "oven"], ["stove", "oven"]], "mscoco_gt_words": ["sink", "refrigerator", "banana"], "hallucination_idxs": [12, 38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.6666666666666666}}, {"image_id": 163728, "caption": "The image features a sleeping cat lying on a table with other items, including a mouse and a backpack, present in the scene. The kitten appears to be curled up and quite comfortable in its position. The computer mouse is also on the table, with a keyboard in close proximity. The backpack is located near the right side of the table, possibly representing a luggage item for travel.", "mscoco_hallucinated_words": [["backpack", "backpack"], ["keyboard", "keyboard"], ["backpack", "backpack"], ["luggage", "suitcase"]], "mscoco_gt_words": ["cup", "dining table", "laptop", "mouse", "cat"], "hallucination_idxs": [19, 51, 57, 71], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.8}}, {"image_id": 456756, "caption": " The image features two giraffes, one adult and one baby, standing in a zoo enclosure. They appear to be facing each other and the baby giraffe is closer to the center of the scene, while the adult one is slightly off to the side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["zebra", "giraffe", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 281028, "caption": " The image features a large empty train car with several blue seats arranged throughout the vehicle. There are three seats in the front of the car and four more seats located in the middle of the car, with the last two chairs towards the back of the vehicle. There are several bottles placed on a table or countertop. One person is visible in the front of the train car, possibly checking out the interior or getting ready to board. ", "mscoco_hallucinated_words": [["bottle", "bottle"], ["table", "dining table"]], "mscoco_gt_words": ["train", "person", "chair", "car"], "hallucination_idxs": [54, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.15384615384615385, "Recall": 1.0}}, {"image_id": 343937, "caption": " The image features a snow-covered slope with a person wearing snow goggles snowboarding down the hill. The woman, wearing a white snowboard helmet, is dressed in a blue winter hat and pants. She appears to be enjoying the ride, as her snowboard is visible underneath her. The scene is set against a backdrop of snowy mountains and the sky.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "person", "snowboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 277051, "caption": "In the image, there are two small birds standing on a dining table. They are both positioned close to the edge of the table, one towards the left side and the other towards the right side. The birds are looking down and appear to be focusing on something on the table in front of them.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "bottle", "bird", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 145666, "caption": "The image shows a large pot filled with a variety of vegetables, including broccoli and carrots. There is an abundance of both vegetables throughout the pot, covering much of its surface area. The broccoli is placed in a range of different locations, and carrots are distributed in various sizes and locations as well. This indicates that the dish is likely to be a blend of both vegetables, cooked together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "carrot", "broccoli", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 304984, "caption": " The image shows a white plate topped with a sandwich and a bowl, likely filled with a side dish. There are also two spoons placed on the table, one on the left side and the other on the right side, possibly for eating or serving. ", "mscoco_hallucinated_words": [["spoon", "spoon"], ["table", "dining table"]], "mscoco_gt_words": ["bowl", "sandwich", "cake", "fork", "orange", "wine glass"], "hallucination_idxs": [25, 29], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 122467, "caption": "The image features a close-up view of a person's hand holding a Nintendo Wii remote. This person is likely holding the remote while sitting on a couch. The remote has been damaged, likely from use or an accident. The person's hand is placed near the bottom of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "dining table", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 518287, "caption": "According to the image, there is a toothbrush in a cup with its top half removed to resemble a face with a toothy grin. The toothbrush is on an orange background, which makes up most of the image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["toothbrush", "cup", "orange", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 145448, "caption": "According to the picture, there is a cat sitting on a tarp located on the floor of a room, in front of a motorcycle. A person can be seen sitting close to the cat. There are also three other people in the scene, situated around the room in different areas. The motorcycle is placed to the right side of the room, partially obscuring it. There are other objects and furniture in the room, including a handbag on the right side and a few potted plants around the space. The overall setting seems to be a relaxed and casual atmosphere, with the cat resting in the tarp and the people in various positions.", "mscoco_hallucinated_words": [["handbag", "handbag"], ["potted plant", "potted plant"]], "mscoco_gt_words": ["dining table", "motorcycle", "cat", "car", "chair", "person"], "hallucination_idxs": [84, 92], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 352900, "caption": "The dish features a mix of pasta and broccoli, prepared and ready to be served. The pasta is placed at the center of the plate, while the broccoli covers a significant portion of the dish. It appears that the dish is finished and ready to be served. A bottle of wine or an alcoholic beverage can be seen nearby, placed on the right side of the plate.", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["bowl", "fork", "knife", "broccoli"], "hallucination_idxs": [53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.25}}, {"image_id": 153607, "caption": " The image features three identical glass vases, each with a different orange flower floating inside, creating a stunning visual display. These vases have the word \"inspiration\" engraved in them. The flowers are situated in the middle of the vases, appearing as though they were suspended or placed inside the glass containers. There is a fourth vase in the middle of the other three vases, but it does not contain a flower or any other item.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "orange", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 157581, "caption": "The image features a long table topped with a variety of wine bottles, all empty, arranged in a line. The table appears to be a restaurant dining area. There are several chairs surrounding the table, some placed closer to the table while others are farther away. The chairs are likely for customers to sit and enjoy their meals. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "dining table", "bottle", "wine glass", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 227438, "caption": "The image depicts a bedroom with a large bed placed in the center of the room. The bed is neatly made and appears to be quite large, taking up most of the space in the room. The bedroom is darkly lit with a few lamps, giving it an intimate and cozy feel. There is a chair near the bed, situated close to the left edge of the image. The room also has a window that extends towards the top of the scene. Overall, the bedroom is well-furnished with the bed being the primary piece of furniture.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "couch", "bed", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 116037, "caption": "The image captures a baseball game in progress. In the foreground, a batter is swinging a bat, attempting to hit the baseball that is approaching. The catcher, an essential member of the team, is positioned close to the bat to catch the ball as soon as it's hit. Behind the batter and catcher, there are several other players on the field. These players could be teammates or opponents.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 202658, "caption": "In the image, there is a white toilet sitting in the middle of a parking area. The toilet has a hello kitty seat on it, which adds a cute touch to the overall scene. There are several other items in the vicinity, including a chair, two buckets, and a suitcase. ", "mscoco_hallucinated_words": [["suitcase", "suitcase"]], "mscoco_gt_words": ["bowl", "chair", "toilet"], "hallucination_idxs": [55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 18150, "caption": "According to the picture, it is an image of a father and son sharing a piece of pizza. The father is sitting on the floor while the child sits beside him on the couch, possibly helping the son eat his slice of pizza. There are several other people in the background of the image. One person is located towards the left side of the image, another towards the center, and two others towards the right side.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "bottle", "backpack", "person", "couch"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 418711, "caption": "The scene takes place in a dark alley where a man is pushing a bike, filled with several bunches of bananas. The bike has bananas hanging from the basket attached to the front and from the frame itself. It seems like the bananas were collected from the bunches and gathered on the bike to be sold or distributed.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bicycle", "car", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 92107, "caption": "The image features a bowl of soup that appears to contain broccoli and corn. There is a piece of bread in the scene as well. The bowl of soup and bread are sitting on a table or counter, possibly within a dining room or restaurant setting. The scene appears to be a food-related image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "spoon", "broccoli", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 491000, "caption": "According to the image, there is a long, narrow street filled with people walking, shopping, and socializing. Multiple individuals are walking on the street, some alone and others in small groups, as they engage in their daily activities. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "motorcycle", "dog", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 401163, "caption": " The image depicts a park setting with two people sitting on a bench, with one sitting closer to the left and the other towards the right. There is a small dog sitting to the left of the two people, sitting closer to the left individual. The scene features a number of pine trees dispersed throughout the park, including a couple of tall ones, as well as several smaller ones. In addition, there are a few other benches scattered throughout the park. The atmosphere of the park appears to be peaceful and relaxing, with the people and the dog enjoying the outdoor setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bench", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 39663, "caption": " The image displays a backpack containing various items, including a laptop, school supplies, and personal items. There are two cell phones, one placed on top of the backpack and another situated slightly to the right of the first one. Among the assorted items, there is a multi-colored pen, several scissors, a notebook, and a handbag, possibly belonging to the owner of the backpack. Some of the objects, such as the laptop, are positioned near the edge of the backpack, while others are spread out across the entire surface.", "mscoco_hallucinated_words": [["handbag", "handbag"]], "mscoco_gt_words": ["laptop", "book", "cell phone", "backpack", "scissors"], "hallucination_idxs": [64], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 0.8}}, {"image_id": 268293, "caption": "The image features a man swinging a tennis racket at a tennis ball, trying to hit it. He appears to be focused on the ball, which is mid-air as he completes the swing. There are also a few other people in the background, but their exact positions and activities are not clear.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 474499, "caption": " This image depicts a small, narrow kitchen featuring wooden cabinets and appliances. There is an oven to the right, a refrigerator to the left, and a toaster above the oven. Additionally, there is a sink with two faucets on the left side and a microwave situated towards the back. The kitchen countertops have multiple bottles on them, with one located close to the sink and several others around the countertop. There is a washing machine visible towards the front of the room. A clock is also present near the wall above the sink.", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"], ["toaster", "toaster"], ["microwave", "microwave"]], "mscoco_gt_words": ["clock", "oven", "spoon", "bottle", "vase", "apple", "sink"], "hallucination_idxs": [23, 30, 50], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3, "Recall": 0.5714285714285714}}, {"image_id": 358086, "caption": " The image portrays a group of young women playing a game of frisbee, with several players actively participating in the game. They are jumping in the air or close to the ground, attempting to catch the frisbee in various spots.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "frisbee", "backpack", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 89541, "caption": "According to the picture, there is a variety of foods placed in a yellow container, including bananas, strawberries, and muffins. The different fruits and baked goods are presented in various sizes and locations within the container. There are two bananas, one on the left side and another one towards the center, with some muffins occupying various spots. Among these, there are several strawberries, including one in the upper-left corner, another one towards the middle left, and another one located towards the upper right. The selection of foods creates a colorful and appetizing assortment, suitable for serving or eating.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "cake", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 239445, "caption": " In the image, a young man is standing in a park area, holding a frisbee and smiling. He seems to be playing with the frisbee, asking another person to play with him. The surroundings feature a few people spread out throughout the area, indicating a public space like a park.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["frisbee", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 105782, "caption": " The scene takes place in a grassy park where a giraffe is standing tall, enjoying its time on the field. The giraffe is in the center of the image, while there are also a few cars and trucks parked around the area, giving a sense of a parking lot nearby. The park is likely a place where the giraffe can wander around freely and fly a kite, providing some entertainment and joy.", "mscoco_hallucinated_words": [["giraffe", "giraffe"], ["giraffe", "giraffe"], ["truck", "truck"], ["giraffe", "giraffe"]], "mscoco_gt_words": ["cat", "kite", "dog", "car"], "hallucination_idxs": [10, 23, 40, 63], "metrics": {"CHAIRs": 1, "CHAIRi": 0.6666666666666666, "Recall": 0.5}}, {"image_id": 487990, "caption": " There is an image, possibly a photograph, displayed at a picnic table or dining table filled with various food items, including sandwiches, fruits, bottles, and cups. A guitar, resembling a ukulele, is also present in the image. The table is surrounded by multiple chairs, with a woman sitting in one of them and holding the guitar.", "mscoco_hallucinated_words": [["sandwich", "sandwich"], ["chair", "chair"]], "mscoco_gt_words": ["cup", "knife", "dining table", "bowl", "spoon", "bottle", "cake", "fork", "donut", "person"], "hallucination_idxs": [24, 53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.4}}, {"image_id": 138834, "caption": "According to the picture, there is a man walking with his ski equipment along a snowy area. The man is carrying two pairs of skis on his back, one larger pair and another smaller one. He is moving away from a car that is parked behind him, presumably heading towards the mountains. There is another person nearby, who seems to be walking alongside the main subject. The scene is set against the snowy backdrop.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 492057, "caption": " The image shows a man standing in a bathroom, taking a photo of himself using his cell phone. The man is wearing a black jacket, and a tie can be seen at the bottom left of the mirror. The mirror reflects both the man and the bathroom behind him.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "person", "cell phone"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 388215, "caption": " In the image, a man is playing tennis on a tennis court, holding a tennis racquet and reaching out to hit a ball. The ball appears to be near the right side of the court, possibly ready for the player to strike it. There is a pole in the background of the image, possibly located along the edge of the tennis court. The player appears to be in the middle of the court, positioned for his serve. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 111409, "caption": "The image features a man wearing a motorcycle helmet riding on a motorcycle down a road. The motorcycle appears to be orange and has a small, round backrest. The man is dressed in a light blue t-shirt and has an appearance of being comfortable and relaxed while on his ride. The scene is surrounded by various parked cars and a truck, with one car in the foreground, one in the background, and the others placed around the road.", "mscoco_hallucinated_words": [["orange", "orange"], ["car", "car"], ["truck", "truck"], ["car", "car"]], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [22, 61, 64, 68], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4444444444444444, "Recall": 0.6666666666666666}}, {"image_id": 451391, "caption": " The image features a large green plant or vegetable bunched together in the center of the scene. It appears to be made up of numerous small, green vegetables. There is also a person standing behind the green bunch, seemingly in a grocery store or market environment, giving the impression of someone shopping for produce. The individual appears to be reaching for the vegetables, perhaps to grab a closer look or to examine the produce more closely.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["broccoli", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 92462, "caption": " In the image, there is a group of cows walking down a dirt road or street with a car driving alongside them. A person can be seen standing nearby. The cars appear to be part of the scene, possibly passing by or driving alongside the cows.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "cow", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 397268, "caption": "The scene depicts two bikes placed in front of a store window, with two chairs sitting alongside them. The bikes appear to be different from one another, as they have different designs and colors. The chairs are positioned close to the bikes, with one slightly further back than the other. In the front of the store, there is a sign hanging above the entrance. A few plants are also visible, including one close to the left side of the scene and another closer to the center.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "chair", "potted plant", "bench"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 127659, "caption": "The scene takes place at a train station where the main feature is a large yellow and white train that is moving along the tracks. The train is in the center of the image, pulling a set of passenger cars. The station has a bench near the tracks where people can sit and wait for their train. There are several people waiting around the station, some near the bench and others dispersed in different locations. Some individuals can be seen wearing backpacks, indicating their presence or preparation to board the train.", "mscoco_hallucinated_words": [["car", "car"], ["backpack", "backpack"]], "mscoco_gt_words": ["train", "person", "bench", "dog"], "hallucination_idxs": [41, 87], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.75}}, {"image_id": 251857, "caption": "The image captures a skateboarder in action as he stands on a skateboard in the middle of a park. In the background, there are several people scattered throughout the park, some closer to the foreground, and others more distant. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "skateboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 243212, "caption": "A small child is standing in a grassy area, holding a blue frisbee. The child seems to be playing with the frisbee, as they are standing in the middle of the field and holding it close to their body. In the scene, there are a couple of other people, but they appear to be further away from the child.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "dog", "frisbee", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 444304, "caption": "The image depicts a person, likely a man, crouching down and looking at a screen, which might be a television, possibly watching someone on the screen. There are two Wii remotes lying next to each other, likely for playing video games, on the bed where the person is sitting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "person", "bed", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 305000, "caption": "The image depicts an outdoor fruit and vegetable market where two women are shopping. They are positioned near the center of the image, surrounded by various produce such as apples, bananas, and pears. These fruits are arranged on multiple tables and dining chairs. ", "mscoco_hallucinated_words": [["table", "dining table"], ["chair", "chair"]], "mscoco_gt_words": ["apple", "person", "banana"], "hallucination_idxs": [44, 47], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 1.0}}, {"image_id": 44136, "caption": " The image features two men dressed in suits and sitting at a table during a party or event. The table is set with multiple chairs, as well as a variety of dishes and glasses. In addition to the table, there is another chair positioned on the side.", "mscoco_hallucinated_words": [["chair", "chair"], ["chair", "chair"]], "mscoco_gt_words": ["dining table", "tie", "bottle", "wine glass", "person"], "hallucination_idxs": [25, 46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.4}}, {"image_id": 154846, "caption": " The image depicts a silver pizza tray with a cheese pizza topped with mushrooms in the center of the frame. Several people are in the vicinity of the pizza, with a few of them interacting directly with the pizza. There are four people within the scene: one person in the foreground, two in the middle, and another in the background. They appear to be gathered around a dining table or countertop. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 567254, "caption": " In the image, a young child stands in front of a television screen, holding a toy in her hand. There is a baby or toddler sitting or standing behind her, possibly being displayed on the TV screen.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "vase", "tv", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 79261, "caption": "According to the picture, there is a man pushing a broom in front of a building, possibly at a plaza or a park. The man is walking in the water, most likely trying to clean the area around the building. There are other people in the background, most of them seem to be observing the man at work. Some of them are standing in various positions, while others appear to be walking around in the vicinity. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "clock", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 301402, "caption": " The image depicts a man sitting in a chair with a laptop open on his lap. He is wearing a black hat and appears to be sitting in a living room setting. Behind him, there are two chairs positioned diagonally in the room. A person can be seen standing near one of the chairs and sitting next to the wall.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "potted plant", "bottle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 415828, "caption": "The image shows a group of people standing near a city street under an umbrella, protecting them from the rain. The umbrella is set up with several people gathered around it, creating an outdoor gathering atmosphere. The people are on the right side of the scene, close to the buildings and a car parked nearby. Some of the individuals have backpacks on their backs, while others are carrying bags.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "umbrella", "traffic light", "stop sign", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 543231, "caption": " In the image, a woman with long hair is standing in a living room, holding a Wii remote and taking a picture with it. Behind her, there are two people: one person on the left side of the scene and another on the right side. All three people are looking at the woman taking the picture, creating a dynamic scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "bowl", "book", "bed", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 495357, "caption": "The image captures a scene where a man is riding a motorcycle down a street or avenue, with cars and other traffic around him. There are several people in the scene, with one person standing near a vehicle in the middle of the street. The street appears to be busy with various people walking, including at least one close to the motorcycle rider.", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["motorcycle", "person", "handbag", "dog"], "hallucination_idxs": [19], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.5}}, {"image_id": 523175, "caption": "A bowl of vegetable soup is the main attraction of the image, filled with various ingredients, including carrots and tomatoes. The soup has a spoon in it, likely filled with the soup's delicious contents. The bowl is placed on a dining table, and the spoon can be seen close to the bottom of the bowl.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "carrot", "spoon", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 184945, "caption": " The image features a parking lot filled with motorcycles and scooters of different sizes, including a red motorcycle with a sidecar, as well as several smaller motorcycles and scooters. Some of the motorcycles are situated near a few bicycles and bicycle helmets. The scene suggests a parked motorcycle parking lot along the side of a road. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "potted plant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 49942, "caption": "The image shows a man wearing a tie and a pink shirt, with his head under an umbrella. He appears to be staring intensely at something, possibly a light or an illuminated area, which is outside the frame. There are other people in the background, some closer to the man and others more towards the edges of the image. A few umbrellas are visible, including one located behind the man and another umbrella placed further back in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 223019, "caption": "According to the image, there is a blue train sitting at a train station. The train is positioned close to the end of the station platform, and some passengers are approaching the train, likely disembarking or boarding it. There are a total of 11 people visible in the image, and they are spread out across the train station, with some closer to the train and others further away. A few of them are holding bags, likely containing their belongings or shopping purchases.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["suitcase", "handbag", "train", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 42889, "caption": " In the image, there are two computer-related items: a computer keyboard and a computer mouse. There is also a teddy bear with headphones on top of them, positioned close to the keyboard and mouse. The teddy bear appears to be wearing glasses as well.", "mscoco_hallucinated_words": [["computer", "laptop"], ["computer", "laptop"], ["mouse", "mouse"], ["mouse", "mouse"]], "mscoco_gt_words": ["keyboard", "teddy bear", "bear"], "hallucination_idxs": [11, 15, 16, 37], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.6666666666666666}}, {"image_id": 22112, "caption": " The image features multiple vases and sculptures placed on a dining table. The main focal point is a yellow bowl-shaped vase filled with green grass and a variety of sculptures. A few vases and sculptures are situated around the yellow vase, creating an interesting composition. Some of the sculptures are located on the left side, a few are on the right side, and one is at the back of the table. The table seems to be used for displaying artwork or decorating the room.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "potted plant", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 247587, "caption": "In the image, there is a young man standing on a surfboard, apparently attempting to surf. He is the central figure in the scene, with others scattered around him. The beach setting is surrounded by trees, giving it a more natural and lively feel.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 171962, "caption": "There is a large blue city bus stopped on the street, with a bicycle parked at the edge of the bus. The bus appears to be empty, as it's parked on the street with the passenger door open. A man is standing on the left side of the bus, possibly waiting or walking by. Another person can be seen in the distance on the right side of the street, possibly getting off another bus or just passing by. The scene is a mix of city life and the presence of a bicycle.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 545056, "caption": "In the scene, there is an expansive field filled with black and white cows grazing in various positions. Some are standing, while others are lying down, dispersed throughout the grassy area. The cows can be found throughout the field, with some closer to the left side and others closer to the right side. In the foreground, there is a fence separating the cows and the field from the rest of the area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "sheep", "cow", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 401411, "caption": "There is a man standing in a kitchen, likely preparing food or attending to ingredients, and a woman is present nearby. There are multiple bottles in various positions around the kitchen, and there are two spoons in the image.", "mscoco_hallucinated_words": [["bottle", "bottle"], ["spoon", "spoon"]], "mscoco_gt_words": ["cup", "oven", "dining table", "bowl", "book", "carrot", "person"], "hallucination_idxs": [27, 39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.14285714285714285}}, {"image_id": 2867, "caption": "The image shows a group of people standing outdoors in a snowy setting. They appear to be skiers and are wearing backpacks. There are eight people in total, spread out in different parts of the scene, with four of them standing near the snow, holding skis.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 246963, "caption": " The image depicts a group of motorcyclists riding down a road with the traffic sign of a \"STOP\" sign nearby. There are seven motorcycles visible in the scene, with a couple of them closely followed by the rest. The motorcycles are positioned along a road, with some of them closer to the \"STOP\" sign and others farther away.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "stop sign", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 498225, "caption": " In the image, a group of people is standing in front of a shop, likely a cake store, with a sign displaying a cake inside. There are three men in the crowd, one closer to the left side, one more centered in the group, and the last person at the far right. There are also a couple of women in the mix, with one on the right side and another standing in the middle. A handbag and a hand can be seen among the crowd. The scene appears to be outdoors, possibly in front of a bakery store.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "cake", "cell phone", "donut", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 124601, "caption": "The image displays a nicely appointed small kitchen area, featuring wooden cabinets, countertops, and appliances. There is a laptop sitting on the counter, possibly being used by the occupants or for working remotely. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["keyboard", "cup", "dining table", "laptop", "bottle", "wine glass", "sink", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.125}}, {"image_id": 581749, "caption": " The image features a table full of fruits and vegetables. Among the produce, there are multiple types of vegetables such as broccoli, oranges, and bananas. Some of the broccoli is placed on the top of the table, with a large amount covering most of it, while others are scattered around the rest of the table. The oranges are also scattered around, with some located near the broccoli and others closer to the edges of the table. Bananas can be seen in the background, placed on the left side of the table.", "mscoco_hallucinated_words": [["banana", "banana"], ["banana", "banana"]], "mscoco_gt_words": ["broccoli", "orange", "dining table"], "hallucination_idxs": [28, 86], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 227960, "caption": " The image displays a tray of food with various vegetables and carrots on a dining table. There are several spoons in the pan, indicating different levels of preparation. One spoon appears to be closer to the top of the food, while another spoon can be found towards the middle of the dish. The carrots can be seen in the upper and lower parts of the pan, with a few scattered throughout the dining table.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"]], "mscoco_gt_words": ["cup", "bowl", "spoon", "wine glass", "carrot"], "hallucination_idxs": [15, 79], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.4}}, {"image_id": 286503, "caption": " In the image, an elephant stands outdoors near a fence, likely in a zoo or park. The elephant is occupying the center of the scene, with two people standing nearby: one near the left edge of the image and the other to the right. Both people are standing and watching the elephant eat a bird nest.", "mscoco_hallucinated_words": [["bird", "bird"]], "mscoco_gt_words": ["person", "sports ball", "elephant"], "hallucination_idxs": [60], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 315740, "caption": " There is a large, clean kitchen with granite countertops in the picture. The kitchen features stainless steel appliances including a sink and a refrigerator. A dining area is situated nearby, complete with chairs and a couch, creating a comfortable seating arrangement. The couch is located in the middle of the room, while the chairs are arranged around it. ", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"]], "mscoco_gt_words": ["oven", "tv", "bowl", "potted plant", "sink", "microwave", "chair", "couch"], "hallucination_idxs": [25], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.375}}, {"image_id": 142108, "caption": " The image depicts a group of people in a kitchen with several bananas spread out over a long table. They are gathered together to eat the bananas and are wearing blindfolds, possibly playing a game or completing a dare. The scene includes three main people among the group, each wearing a backpack.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["dining table", "person", "banana"], "hallucination_idxs": [55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 542933, "caption": " The image captures a baseball game scene, with several people gathered around a field. A baseball player is getting ready to hit a ball, holding a bat and standing at home plate. There are several other individuals in the area, including an umpire, a catcher, and a coach who are all focused on the action happening on the field. ", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "chair", "baseball bat", "sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 475043, "caption": "In the image, there is a woman sitting on a couch with a laptop in front of her. She is holding a beer and is about to drink it, while using the laptop. In addition to the woman, there are several chairs around the room, with one chair placed near the laptop and the other chairs scattered around. ", "mscoco_hallucinated_words": [["chair", "chair"], ["chair", "chair"], ["chair", "chair"]], "mscoco_gt_words": ["bowl", "spoon", "laptop", "bottle", "person", "couch"], "hallucination_idxs": [46, 53, 61], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 456784, "caption": "The image features a person holding a knife and cutting a chocolate cake. The knife is used to remove the cake's top, revealing its frosting. The person appears to be intent on completing the cutting task with precision.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 33753, "caption": " In the scene, there is a park bench sitting on grass with a remote control placed on top of it. The remote control is sitting near the armrest of the bench, giving the impression that it has fallen onto the bench. The bench seems to be made of wood and features the words \"wood\" and \"iron\" in various locations.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "chair", "bench", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 448694, "caption": " In the image, a large crowd of people is gathered at an open field, with a group of boys actively playing a frisbee game. Among the crowd, some people appear to be watching the frisbee players closely, while others are scattered throughout the scene, enjoying the day or waiting for their turn to play.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["frisbee", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 99053, "caption": "The image features a bowl filled with food, specifically noodles and vegetables. The noodles are likely stir-fried noodles, and the vegetables include broccoli and carrots. There are several carrots visible in the dish, along with several pieces of broccoli distributed throughout the bowl.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "fork", "broccoli", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 503210, "caption": "This is an image of a nicely furnished living room filled with a variety of seating options. The room has a couch on the left side and a chair placed in front of it. Two chairs can be seen on the right side of the room, one closer to the couch and another towards the center. A bench or loveseat can also be spotted towards the center of the living space. The room has two potted plants, one placed near the center and the other near the right side.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["clock", "potted plant", "vase", "orange", "book", "chair", "couch"], "hallucination_idxs": [61], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.42857142857142855}}, {"image_id": 142257, "caption": "There are several people waiting at a stop light in this scene. Numerous cars can be seen along the street, each with a traffic light above them. A person stands near a car, and there are others around the traffic light, likely waiting to proceed. The cars are parked or driving in various directions, indicating that it's a busy street with several cars in traffic.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}], "overall_metrics": {"CHAIRs": 0.33867735470941884, "CHAIRi": 0.10801393728222997, "Recall": 0.5586490187129165, "num_hallucinated_caps": 169.0, "num_caps": 499.0, "hallucinated_word_count": 310.0, "coco_word_count": 2870.0, "length_response": 69.64529058116233, "hallucinated_caps_ls": [144305, 308003, 54870, 324008, 234433, 524656, 401027, 253630, 326021, 161227, 343534, 254368, 231534, 209347, 523292, 209925, 2154, 104326, 233888, 277479, 300784, 169169, 344942, 465601, 300124, 490491, 133146, 498747, 227204, 185327, 348654, 482432, 37907, 506401, 406611, 278365, 368367, 21779, 405183, 148977, 399058, 325836, 437910, 401010, 567562, 313928, 346940, 122602, 183709, 59710, 527497, 411564, 285302, 258209, 285820, 245664, 453731, 253262, 418692, 259345, 443299, 545594, 437720, 288576, 535403, 426172, 180764, 147762, 425447, 443259, 301837, 240681, 278890, 47781, 14773, 521863, 250540, 298067, 563938, 261318, 272728, 188136, 99874, 557447, 357, 134223, 301266, 475813, 539951, 143479, 230983, 564629, 53465, 381031, 530520, 9628, 118124, 288729, 252625, 493610, 30667, 366111, 532481, 70815, 88269, 463211, 396269, 98979, 536154, 560018, 256628, 364783, 531327, 451449, 51054, 575406, 436814, 63934, 275611, 478380, 422326, 45550, 444913, 82709, 303099, 205193, 170099, 354540, 191301, 124236, 344930, 14285, 82488, 150703, 226802, 49005, 319607, 210372, 349642, 45435, 336209, 283118, 429598, 424174, 490711, 163728, 281028, 304984, 145448, 352900, 202658, 39663, 474499, 105782, 487990, 111409, 127659, 305000, 44136, 495357, 42889, 401411, 581749, 227960, 286503, 315740, 142108, 475043, 503210]}}}