{"overall_metrics": {"CHAIRs": 0.15430861723446893, "CHAIRi": 0.06597434331093463, "Recall": 0.45458694659972615, "num_hallucinated_caps": 77.0, "num_caps": 499.0, "hallucinated_word_count": 108.0, "coco_word_count": 1637.0, "length_response": 37.33066132264529, "hallucinated_caps_ls": [144305, 54870, 524656, 253630, 326021, 254368, 231534, 209347, 523292, 209925, 277479, 344942, 465601, 185327, 348654, 482432, 37907, 506401, 368367, 325836, 401010, 567562, 346940, 122602, 411564, 259345, 545594, 437720, 535403, 426172, 443259, 563938, 301266, 475813, 539951, 143479, 230983, 564629, 53465, 381031, 530520, 9628, 288729, 252625, 493610, 30667, 366111, 532481, 396269, 536154, 256628, 575406, 275611, 478380, 45550, 444913, 82709, 344930, 14285, 150703, 49005, 283118, 429598, 490711, 163728, 304984, 105782, 487990, 111409, 127659, 44136, 495357, 42889, 401411, 581749, 227960, 315740]}, "results": {"sentences": [{"image_id": 144305, "caption": "There are a few computer devices displayed in the image. A laptop, placed on top of a table, is sitting in the center of the scene, connected to various cables and a monitor", "mscoco_hallucinated_words": [["monitor", "tv"]], "mscoco_gt_words": ["laptop", "keyboard", "dining table"], "hallucination_idxs": [36], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 164091, "caption": "In the image, there are two dolls placed side by side in a baby carriage, a white one and a teddy bear, along with a bed in the background. The white doll, dressed in a blue outfit, seems to be sleeping in the carriage while the teddy bear is sitting next to it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "person", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 308003, "caption": "The image shows a wooden bench with a splintered, worn surface. It appears to be in a state of disrepair, possibly indicating a public park or a picnic area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "bench", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 54870, "caption": "There is a plate with a sandwich and salad on it, sitting on a table", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["knife", "sandwich", "fork"], "hallucination_idxs": [15], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 95908, "caption": " The image features a desk with a mouse pad placed in front of it, and a computer keyboard resting on the desk's surface. A computer mouse is sitting on top of the keyboard, while a small metal cup with a handle is positioned nearby", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "mouse", "keyboard", "laptop", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 329614, "caption": "This is a large image, featuring a cobblestone street with multiple lanes and a few buildings surrounding it. In the foreground is a cute, yellow school bus travelling down the street, possibly passing by a group of people near a church or other building", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 324008, "caption": "The image displays a tray full of food with various items. There are three bananas neatly arranged on the tray, as well as a cup filled with yogurt", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "banana", "train", "orange", "airplane", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 52700, "caption": "A group of people is standing around a dining table in a large, open room", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "spoon", "cake", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 501311, "caption": "The image captures a baseball game in progress, with multiple people playing on the field. The main focus is a pitcher throwing a baseball while wearing baseball cleats, which can be seen in the center of the field", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 247179, "caption": "The scene is a city street at night where a group of people dressed in white is gathered around two elephants walking down the street. The elephants are decorated with garlands or lanterns on their backs", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "elephant", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 577912, "caption": " It is a well-lit bathroom scene with a toilet and a sink positioned along the walls", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "toothbrush", "toilet", "sink"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 196341, "caption": " The image shows a large clock with two circles on its face, surrounded by a group of people standing close to it. The clock appears to be the primary focus of the scene, with many people gathered around it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "clock", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 234433, "caption": "According to the image, it depicts a street scene with a couple of cars parked on the street", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 455610, "caption": "The scene depicts a group of people gathered around a red and green motorcycle parked on the street. Several individuals are standing near the motorcycle, engaged in conversation or observing the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 39053, "caption": "The image presents a table filled with a variety of food items. There are several dishes on the table, including bread and a bowl filled with a yellow substance", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "bowl", "dining table", "spoon", "cake", "apple", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 493862, "caption": "According to the picture, it features a person standing in a field with a fly swatter, seemingly enjoying the outdoors. The scene is calm, and the person is standing still, surrounded by green grass in the field", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 269873, "caption": "The image is a black and white photo of a group of people standing in the street near the intersection of Avenue [IDK]. There are several people in the scene, with some of them wearing ties, as well as some others standing nearby", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 524656, "caption": " The image depicts a small kitchen filled with various items and appliances, such as a sink, an oven, and a microwave. In the center of the kitchen, there is a counter with two bowls, multiple cups and wine glasses placed on top, creating a display of bottles and vessels", "mscoco_hallucinated_words": [["vessel", "boat"]], "mscoco_gt_words": ["bowl", "oven", "spoon", "bottle", "sink", "wine glass", "knife", "microwave", "toaster", "refrigerator", "cup"], "hallucination_idxs": [55], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.6363636363636364}}, {"image_id": 369547, "caption": "The image shows a beach scene where a dog and a cat are sitting on the beach together. The dog is near the left side of the image, while the cat is more centered in the shot", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cat", "bird", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 181446, "caption": " In this image, a man is leaning on the open refrigerator door while standing in front of it. The scene appears to take place in a kitchen setting", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "bottle", "sink", "person", "refrigerator"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 128119, "caption": " The image captures a large bus on a city street with a bike and a number of bicycles parked on the sidewalk in front of it. Some of the bikes have been locked to the posts while others are unlocked", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "backpack", "bus", "bicycle", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 401027, "caption": " The image features a table with a plate of food on it, including a sandwich and a cup of dressing. There is also a knife on the table", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "sandwich", "knife", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 253630, "caption": "The scene features a small square table sitting in the middle of the room, surrounded by various items including a television, a remote control, and a bowl. The table is positioned next to a window on the left side of the room", "mscoco_hallucinated_words": [["remote", "remote"], ["bowl", "bowl"]], "mscoco_gt_words": ["chair", "dining table", "potted plant", "tv"], "hallucination_idxs": [24, 29], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.5}}, {"image_id": 326021, "caption": " A woman sits at a table wearing a blue denim jacket with a teddy bear beside her. The table is draped with a red tablecloth and is surrounded by chairs on all sides", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["teddy bear", "dining table", "person"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 317320, "caption": " The scene depicts a man riding a bicycle down a street with several other people, some of whom are walking. The streets have been marked with yellow crosswalks", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 507171, "caption": "In the image, a man is riding a surfboard on a wave in a large body of water. The waves are near a bridge, which can be seen in the background of the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "cat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 161227, "caption": " According to the picture, there is a chair sitting in the room with two remotes and a cell phone on it. One of the remotes is located on the left side of the chair while the other two remotes are placed on the right side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "remote", "chair", "couch", "keyboard", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 25812, "caption": " The scene features a person snowboarding down a ski slope, skiing with a snowboard rather than skis. There is a ski lift nearby, possibly at a ski resort", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 267537, "caption": "The scene depicts a group of women sitting on bleachers, with one wearing a coat. The women are sitting next to each other and covering themselves with rain umbrellas", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "umbrella", "person", "bench"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 195645, "caption": " There is a group of people sitting outside at a table, dining and having a conversation. They are gathered around a dining table, which is placed in the middle of the outdoor space", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bottle", "wine glass", "handbag", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 155671, "caption": " A sunny day finds a traffic light hanging over the middle of the street, near the intersection with several roads. Cars, trucks, and a semi-truck are driving on various parts of the city street", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 412894, "caption": "A busy street scene is taking place with a large group of people walking along the sidewalk near a subway entrance, creating a bustling atmosphere. Some people are carrying bags and purses, while others are just walking around in the vicinity", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "backpack", "traffic light", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 32625, "caption": "There is an image in front of a table, displaying several plates with food. There are various pieces of food on the plates, including two pieces of food in the upper left corner, a bowl and another plate of food", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "bowl", "dining table", "book", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 18688, "caption": "The image shows a baseball game taking place in a field. There are several people standing around the area, some near the batter and others scattered throughout the field", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 343534, "caption": "The image depicts a small doll sitting next to two stuffed animals. The doll is wearing a pink outfit, possibly a top and skirt, and is holding onto one of the stuffed animals", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "potted plant", "chair", "book", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 370657, "caption": "According to the image, there are several people riding motorcycles, one of which has a young child sitting on it. Some of the people are wearing masks", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 481596, "caption": " In a crowded sports stadium, a man is standing in the bleachers while eating a hot dog. There are several people in the background, spread throughout the stadium, suggesting a busy atmosphere", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "hot dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 460347, "caption": "The scene depicted is a large bus driving down a road with many cars following it. There are a total of eight cars, two of them are in front of the bus and the others are positioned further behind it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 398753, "caption": "The scene is an outdoor baseball field with a large crowd watching a game. There is a baseball player on the pitcher's mound, while the batter is holding a baseball bat ready to hit the ball", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 67833, "caption": " The image features a large pizza placed on a tabletop, likely in a restaurant or home setting. There is another pizza sitting on top of a plate placed near the first pizza", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "bowl", "dining table", "bottle", "knife", "wine glass", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.1111111111111111}}, {"image_id": 272880, "caption": " The image shows a piece of pizza sitting on a plate that is placed on a table or dining table. The pizza appears to be mystic pizza, an artistic and hand-painted pizza that is famous for its unique and intricate designs", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "knife", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 254368, "caption": " The image features a man sitting in a bed and eating a dish, which could be either a piece of cake or a sandwich. He is dressed in a robe, which adds a comforting and warm atmosphere to the scene", "mscoco_hallucinated_words": [["cake", "cake"], ["sandwich", "sandwich"]], "mscoco_gt_words": ["pizza", "person", "bed"], "hallucination_idxs": [21, 24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.6666666666666666}}, {"image_id": 193108, "caption": " The scene takes place at a baseball field where a couple of players are standing on the field. There are two main players, one of them holding a baseball bat in his hand, swinging at an object near him, likely an incoming ball", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 459141, "caption": "According to the picture, there is a woman wearing a conical hat, riding a bicycle, and carrying a large bag on the bike's rack", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "potted plant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 231534, "caption": "The image features an airport runway with a few vehicles, including a large truck and a few cars. The truck is positioned near the runway, loading luggage", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["handbag", "airplane", "person", "truck", "suitcase"], "hallucination_idxs": [18], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.4}}, {"image_id": 227031, "caption": "The image depicts a man and a woman cutting a three-tiered wedding cake together, possibly in front of an audience. The man is wearing a black suit, and the woman is wearing a gray suit", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "potted plant", "tie", "cake", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 209347, "caption": "The image depicts a woman in a black leather jacket sitting on a bench in a subway car, holding her cellphone and looking down at it while wearing earrings. The woman is surrounded by various people on the subway car, including two people to her left and one person in front of her", "mscoco_hallucinated_words": [["bench", "bench"], ["car", "car"], ["car", "car"]], "mscoco_gt_words": ["cell phone", "train", "tie", "handbag", "person"], "hallucination_idxs": [13, 17, 41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.4}}, {"image_id": 523292, "caption": "The scene features a cat sitting on a couch, which is located at the center of the image. It is leaning its head onto a large open laptop that is on top of a table in the living room", "mscoco_hallucinated_words": [["couch", "couch"]], "mscoco_gt_words": ["dining table", "cat", "chair", "laptop", "tv"], "hallucination_idxs": [8], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6}}, {"image_id": 12375, "caption": "According to the picture, a plate with food on it is a meal featuring multiple items including meat, potatoes, and vegetables. The meal includes a variety of items like potatoes, carrots, and broccoli", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["carrot", "broccoli", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 13550, "caption": "In the image, a woman is standing on top of a large rock, surrounded by two elephants. The woman appears to be kneeling down and leaning over the rock, close to the water", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "elephant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 146837, "caption": " The image features a young man riding a skateboard on a rail, which is a horizontal surface that extends from the sidewalk, possibly a concrete edge or metal rail. The skateboarder is positioned on the rail, focusing on his ride", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 67042, "caption": " In the image, a man is standing in a corridor or walkway and checking his cell phone. The man is wearing a white shirt and tie", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "tie", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 209925, "caption": "A man is standing on a baseball field, wearing an orange uniform. He is holding a baseball and appears to be getting ready to throw it during an inning", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["person", "sports ball", "baseball glove"], "hallucination_idxs": [11], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 2154, "caption": "The image features a shopping cart sitting on the sidewalk near a curb, next to a park bench. The bench appears to be a common area where people gather or rest", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "car", "person", "umbrella"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 274931, "caption": "The image is a black and white photo of a sidewalk with parked cars along the side of the street. There are several cars parked next to each other on the sidewalk, creating a row of vehicles", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fire hydrant", "traffic light", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 104326, "caption": "In a scene filled with green, a small but cozy kitchen has been renovated with wood floors. A dining table with six chairs is situated near the kitchen, occupying most of the space", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "potted plant", "oven", "chair", "clock", "sink", "knife", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 233888, "caption": " The image features a young boy, likely in his home, playing baseball in front of a large TV. He is holding a bat and has just swung it at a television screen that displays a baseball game", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["potted plant", "baseball bat", "person", "baseball glove", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 213905, "caption": "The scene features two female tennis players, both dressed in white, standing on a tennis court together. They are holding tennis rackets and appear to be friends", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "backpack", "tennis racket", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 277479, "caption": " The image shows a young girl standing near the refrigerator, looking down into the refrigerator and about to open the door to inspect the contents. The fridge is positioned in the left-hand corner of the scene, surrounded by various bottles and a bowl placed on the shelf", "mscoco_hallucinated_words": [["bowl", "bowl"]], "mscoco_gt_words": ["bottle", "refrigerator", "person"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 300784, "caption": "According to the picture, there is a group of four children and a pet dog sitting on a couch. The children are smiling, indicating a friendly atmosphere among them", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["couch", "dog", "person", "bench"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 183766, "caption": "The image features a collection of small cakes with fruit on display. They appear to be decorated with various types of berries, including cherries, as well as ornaments", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "cake", "knife", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 169169, "caption": "A busy sidewalk is the setting of the image with people walking near different businesses. The street is lined with green banners, with one banner saying \"spring kkt", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "potted plant", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 409346, "caption": "The image depicts two women standing in a room, attentively serving food from a buffet table. The table is filled with various desserts and sweet treats, like cupcakes, cake slices, and cookies", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "spoon", "cake", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 522262, "caption": "In the image, there are several bicycles parked against a fence, with some located closer to the viewer and others farther away. A handful of these bicycles are colorful, and most of them appear to be the same size", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 79014, "caption": "According to the picture, there is a man, wearing glasses, standing behind a mirror and holding an orange in front of his face, creating an interesting optical illusion. The orange creates the illusion that the man is holding a real lemon in front of him, while the reflection shows a different face or another object", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 344942, "caption": "The image is of a small robot toy sitting on top of a laptop computer. The robot is positioned prominently in the scene, appearing as if it is attached to the keyboard", "mscoco_hallucinated_words": [["keyboard", "keyboard"]], "mscoco_gt_words": ["clock", "laptop", "tv"], "hallucination_idxs": [33], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 275929, "caption": " A series of white toilets can be seen on the stairs in this image, with several of them broken and placed against the steps in an unusual arrangement. The toilets appear to be scattered up the stairs, with some situated closer to the bottom, while others are closer to the top", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "toilet", "sink"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 465601, "caption": " The scene depicts a group of people standing outside under umbrellas, which provides shelter from the rain. Most of them are wearing backpacks and walking around", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["handbag", "bench", "person", "umbrella"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 300124, "caption": "The image shows a microwave sitting on a couch, which is placed on the sidewalk outside a building. The couch is empty except for the microwave, which appears to be in the process of being removed or repaired", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "couch", "dining table", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 46919, "caption": "According to the image, a group of people is hiking up a snow-covered hill. The group consists of at least three individuals, with one of them standing at the front of the group while the others are trailing behind", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 434022, "caption": " In the picture, a train is moving along the train tracks. The train is quite long and has multiple railroad tracks beneath it, extending off in different directions", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 490491, "caption": "There is an image with various signs on a red brick wall. The image includes several street signs in various positions, and a sign with a cross on it is visible", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["stop sign", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 175878, "caption": " In the image, a group of four people, two men and two women, are standing together at a porch, enjoying the outdoors. One of the men is looking at a cell phone, possibly using it to take a picture of the group", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 12669, "caption": "A woman in a white top and white pants is standing in front of a large artwork, possibly a painting or a statue, holding a wine glass. She is surrounded by various other objects and sculptures, including a vase on the right side of the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "vase", "wine glass", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 192496, "caption": " There is a train station in the image, with two trains parked at a platform next to a building. The first train is closer to the building, occupying a central position along the platform", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 133146, "caption": "The image displays a large, comfortable living room with several pieces of furniture", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "chair", "couch", "book", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 53744, "caption": "The image displays a cutting board with pieces of tomato, lettuce, and a knife on it. The tomato is cut into several pieces, with some pieces lying close together, and others scattered on the cutting board", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "dining table", "sandwich"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 375902, "caption": "The image features a winter scene with skiers gathered around an opening in the mountainside. There is a yellow jacket being worn by one of the skiers, who is positioned at the front of the group with their skis and poles on display", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 498747, "caption": " The image features a young man in the center of the picture, talking on a cell phone. He is surrounded by several people, some of whom are looking directly at the man while others are in the background", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "bottle", "bird", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 52435, "caption": "There are numerous people enjoying a beach day, with several people in the water, some surfing, and others watching from the shore", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "bird", "person", "umbrella"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 301977, "caption": "The scene features a brown cow, standing next to a store selling food and drinks, likely a grocery store or a food stand. The cow appears to be positioned close to the front of the store, with some of its body on the sidewalk and the remainder standing in the store's area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "refrigerator", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 261097, "caption": " A man is standing on a tennis court holding a tennis racket, possibly waiting for a match to begin. The court is crowded, with multiple people spread throughout the space", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "bench", "tennis racket", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 227204, "caption": " The image depicts a group of three women, possibly of Asian origin, sitting around a dining table and talking on their cell phones. They are in a relaxed and casual setting", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "bowl", "dining table", "banana", "spoon", "bottle", "knife", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 353981, "caption": "The image is of an old airplane sitting on the ground. It appears to be a military aircraft, with a white and black coloring", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["airplane", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 185327, "caption": "The scene depicts a man walking in the middle of a street with a herd of domestic animals, including a few cows and an overwhelming number of zebras. They appear to be walking on the street, perhaps to be led by the man", "mscoco_hallucinated_words": [["zebra", "zebra"]], "mscoco_gt_words": ["bus", "horse", "car", "person", "cow"], "hallucination_idxs": [28], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.4}}, {"image_id": 348654, "caption": "There is a cell phone lying on a stand in front of a computer. The cell phone appears to be turned on and connected to a keyboard and a mouse, which are also placed in close proximity to the phone", "mscoco_hallucinated_words": [["keyboard", "keyboard"], ["mouse", "mouse"]], "mscoco_gt_words": ["cell phone", "laptop", "tv", "dining table"], "hallucination_idxs": [27, 30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 482432, "caption": "The image shows two bottles placed close to each other on a bathroom sink, creating an interesting contrast. One bottle appears to be containing toothpaste, while the other is a dental product such as toothbrush", "mscoco_hallucinated_words": [["bottle", "bottle"], ["bottle", "bottle"]], "mscoco_gt_words": ["bowl", "toothbrush", "sink"], "hallucination_idxs": [4, 21], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.6666666666666666}}, {"image_id": 454161, "caption": "The image features a piece of pizza cut in half on a plate, accompanied by a fork and a wine glass. The pizza sits on a dining table on a dining room rug", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "fork", "dining table", "wine glass"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 144723, "caption": "In the image, a dog is in the middle of a field, leaping forward to catch a frisbee that is being held by another dog close to its side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "dog", "frisbee", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 129707, "caption": "A person is holding a large cordless phone with a button-type keypad. The phone is placed in front of the person's face, taking up a significant portion of the image", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "book", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 379842, "caption": "The image displays two Wii video games, one on top of the other, with the Wii controller partially inserted into the top one. The games are arranged side-by-side, creating an interesting scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "dining table", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 203110, "caption": " The image captures a yellow trolley moving down a city street on a sunny day. The trolley has a sleek, modern appearance, featuring a large wheel on one side and an opening door on the other side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "train", "orange", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 556478, "caption": "In the image, there are two pizzas in metal pans, placed side by side on a kitchen stove. The tops of the pizzas appear crusted and browned, and they likely have been cooked and are ready to be served", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "oven", "bottle", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 103815, "caption": "The image captures a baseball player on the mound, actively pitching the ball. The batter is positioned on the opposite side of the field, holding a baseball bat and swinging it towards the ball during a game", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 451943, "caption": "The scene depicts a spacious bedroom featuring a large bed, likely a king-size or queen-size bed, with matching bed sheets and white bedding. The bed is situated towards the center of the room and is accompanied by two bedside lamps, one positioned on each side of the bed", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "suitcase", "dining table", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 268854, "caption": " The scene takes place near a beach or lake where a woman is carrying a large basket, walking on the sand. She is near the water and there are a few small boats parked or moored near her, which have caught her attention", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 34464, "caption": " The image features a group of people and a surfboard at the beach", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "surfboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 495513, "caption": " The scene in the picture features two people, a man and a woman, standing next to each other holding up hot dogs on sticks for a photo. They appear to be posing for a picture, perhaps as a fun way to capture a moment or for a social event", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "hot dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 529256, "caption": " The scene features a red bedspread with two dogs on it, one on the left side and another on the right. The left dog is positioned near the edge of the bed, while the right dog is closer to the middle", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "book", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 37907, "caption": "According to the picture, there is a cat sitting on various electronic devices near a television. The cat appears to be sitting on top of a TV, as well as other electronics, such as two TV remotes, a remote control, and a CD player", "mscoco_hallucinated_words": [["remote", "remote"], ["remote", "remote"], ["player", "person"]], "mscoco_gt_words": ["cat", "potted plant", "tv"], "hallucination_idxs": [40, 43, 49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.375, "Recall": 0.6666666666666666}}, {"image_id": 506401, "caption": "The image features a traffic light in a small town or village, situated among some buildings and near a street. There are two dogs nearby, with one dog close to the traffic light and another further away", "mscoco_hallucinated_words": [["dog", "dog"], ["dog", "dog"]], "mscoco_gt_words": ["cat", "traffic light", "car", "person", "truck"], "hallucination_idxs": [25, 30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.2}}, {"image_id": 406611, "caption": " The image features a snow-covered hill where a large group of people is gathered. Some people are sitting in the snow, while others are skiing", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 267225, "caption": "The scene shows several trucks and vehicles parked outside, including a truck with a white body and satellite dishes mounted on top of the roof. There are at least five people walking around in the area, some closer to the truck and others further away", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "car", "person", "truck", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 278365, "caption": "In the image, there are two people seated at a dining table enjoying pizza", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "dining table", "chair", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 69757, "caption": "The image portrays a family gathering around a dining table, where an elderly woman is cutting a piece of chocolate cake. The family members appear to be interested in the cake-cutting process, with one man looking closely at the knife", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "cake", "knife", "wine glass", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 432213, "caption": "The image features a plate with a sandwich and fries as the main food items on the menu. The sandwich appears to be served on a bun, with two slices and several toppings, giving it an appealing appearance", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "sandwich", "wine glass", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 368367, "caption": " The image depicts a smiling woman holding a cell phone, talking on the phone while in an outdoor setting. She is standing close to a table, likely the dining table, and has her arms close to her body", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"]], "mscoco_gt_words": ["cell phone", "person", "cup"], "hallucination_idxs": [27, 32], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.6666666666666666}}, {"image_id": 201555, "caption": "There is a dirt road right next to a sandy area where construction is occurring. A few traffic lights are scattered throughout the area, and multiple road signs are present, including a \"Royal National Boating Championships\" sign near the construction site", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 92131, "caption": "The image captures a public transit scene, with a green and white city bus parked at the edge of a parking lot. The bus is quite large and occupies a significant amount of space", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bus", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 21779, "caption": "In the image, there's a woman who appears to be lying down on the floor. She is holding a large red and white suitcase and is positioned near the left side of the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "suitcase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 565148, "caption": "The scene takes place in a baseball stadium where a baseball player is holding a baseball bat on the baseball field. The player is getting ready to hit a baseball and take a swing during a baseball game", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 160183, "caption": "The scene features a fire truck parked in a closed parking lot. It is positioned close to the building, near the front", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 555456, "caption": " The image depicts a person skiing on a snowy hill. They are dressed in a blue parka and black pants, and their snow skis are visible on the ground", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 235595, "caption": "The image shows several motorcycles parked next to a building along a street. One of the motorcycles appears to be red, while the others are black", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "potted plant", "car", "parking meter", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 489909, "caption": "The image displays a basket filled with various fruits, specifically oranges, bananas, and cucumbers. These three fruits are spread out within the basket, each occupying a different section", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["banana", "orange", "bowl"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 501080, "caption": "The image shows a large, white vase filled with a beautiful, pink flower arrangement. The pink flowers are arranged in a way that creates a striking, colorful composition", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "potted plant", "vase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 405183, "caption": "A group of people, including a woman, are gathered in a living room playing a video game. The woman is holding a Wii remote, presumably controlling the game, while others watch her", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "potted plant", "chair", "couch", "book", "handbag", "person", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 409138, "caption": "The scene is set in front of a park or a parking lot, with a large white and purple bus positioned in the center of the image. The bus has various colored stripes, giving it a distinct appearance", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 411840, "caption": "The scene is set on a tennis court, where two women in tennis outfits are walking across the court. Both players are holding tennis rackets, indicating that they are playing a match", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "chair", "sports ball", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 333712, "caption": "The image is a pizza topped with cheese and onions, with a sliced slice taken out of it. The pizza appears to be crispy and well-cooked", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "spoon", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 148977, "caption": "In the image, there is a teddy bear with headphones on. It is sitting on a table or surface, possibly indicating that the environment is someone's home or workplace", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "dining table", "bottle", "bear", "keyboard", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 399058, "caption": " The image shows a man riding on the back of an elephant while walking down a road. The elephant is positioned near the center of the image, and the man appears to be sitting on its back as they travel together", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "orange", "person", "truck", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 183571, "caption": " The scene features a boat with a red and black design, likely a houseboat, on which a man is standing, looking down at the water on the other side of the boat. A dog is situated on the other side of the boat, near the man", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 325836, "caption": "The scene shows a motorcycle prominently displayed at the front of a room, along with other items such as a wine bottle, cups, and pots placed around it. There is a crowd of people in the room, some of them closer to the motorcycle while others are more spread out", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cup", "cup"]], "mscoco_gt_words": ["handbag", "motorcycle", "bicycle", "person"], "hallucination_idxs": [22, 24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.5}}, {"image_id": 437910, "caption": "The image features a small, narrow bathroom with a toilet and sink occupying a significant portion of the room. The toilet is positioned towards the right side, and the sink is near the left side, both visible from the bathroom door", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "toilet", "sink"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 155885, "caption": "The image is an old black and white photo of a small harbor with several boats docked in it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "bird", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 467887, "caption": "The image depicts a man sitting in a chair, wearing glasses, and using a laptop computer. He is positioned at the edge of a couch, in front of a table with various items on it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "couch", "mouse", "person", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 174402, "caption": " The image features a young man standing on a tennis court, holding a tennis racket in his hand as he is ready to play. He is dressed in white clothing and appears to be waiting for the ball to come to him or for the game to begin", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 352734, "caption": " In the image, a young boy is eating some kind of sweet dessert, likely cake, with a spoon", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "spoon", "cake", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 346716, "caption": "The image features a busy city street with two red double-decker buses moving down the street, following a route close to one another. The first bus is travelling from left to right near the center of the scene, while the second bus is located towards the right side of the street", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 451144, "caption": "The image features two snowboarders on a snow-covered slope, with their skis on their backs. The scene is a beautiful backdrop for the mountains, with some clouds visible in the sky", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 295656, "caption": "According to the image, there are two people in the scene who seem to be in the process of eating slices of pizza", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "tie", "clock", "car", "person", "pizza", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 129886, "caption": " The image shows a group of people dressed in snow gear and standing near ski equipment on a ski slope. One person is wearing skis and two others are standing nearby", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 557998, "caption": "According to the picture, it features a young boy sitting at a table with a bottle placed next to him, possibly alcohol or a drink. He is holding his hand close to his face, seeming to be taking a picture or having a conversation", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "bottle", "wine glass", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 401010, "caption": "The scene is a bedroom with a single bed and a television, giving the appearance of a hotel room or a room for a disabled person", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["chair", "sink", "tv", "bed"], "hallucination_idxs": [26], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 567562, "caption": "According to the picture, there is a woman sitting at a table with two children, enjoying a meal together. On the table, there is a bowl of food and a knife", "mscoco_hallucinated_words": [["knife", "knife"]], "mscoco_gt_words": ["fork", "bowl", "dining table", "chair", "spoon", "bottle", "book", "broccoli", "person", "pizza", "cup"], "hallucination_idxs": [34], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.2727272727272727}}, {"image_id": 313928, "caption": "The image depicts a balcony or patio area overlooking a beach with a surfboard placed on the beach. There are three chairs surrounding the table, with one chair located closer to the surfboard, another chair near the edge of the balcony, and a third chair further back", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "dining table", "chair", "bottle", "wine glass"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 490284, "caption": "In the image, a black dog is laying across a couch, possibly resting or sleeping. The dog is positioned in the foreground and extends towards the left side of the frame", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["couch", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 346940, "caption": " The image depicts a kitchen scene with a stovetop featuring a steaming kettle and a frying pan, along with a few other kitchen utensils. There are several knives scattered around the kitchen, with a bottle and a spoon nearby", "mscoco_hallucinated_words": [["bottle", "bottle"], ["spoon", "spoon"]], "mscoco_gt_words": ["bowl", "knife", "oven"], "hallucination_idxs": [37, 40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.6666666666666666}}, {"image_id": 222782, "caption": "The image features a street scene with a person standing next to an old fashioned street sign that says \" amoco\" and another person walking on the sidewalk. The people are not doing anything special or significant in the picture, just standing there on the sidewalk", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 255934, "caption": " The image features an elephant and a smaller elephant walking next to each other. They are standing in a dirt area, possibly an enclosure, surrounded by various people and other elephants", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "elephant", "person", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 278627, "caption": "The image is a collage of various dishes and people eating and posing. The scene has two men standing close to each other, a woman in a red blouse, a man standing in front of a plate of food, a person holding a plate, and a person holding two cups of food", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "potted plant", "spoon", "chair", "bottle", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 400851, "caption": " The image features a large group of people enjoying a day at the beach, with a handful of tents and some chairs providing shade. The beach area is surrounded by various tents and umbrellas, likely set up for those attending the event", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "boat", "chair", "sports ball", "umbrella", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 472610, "caption": " In the image, a woman stands next to a white dog while a group of cows surrounds them. The cows are spread across the field, some closer to the woman and others further away", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "person", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 78565, "caption": "The image portrays a group of people standing on a sandy beach. There are several small sailboats floating around in the water nearby, giving the impression of a windy day", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 388237, "caption": " The image shows a man riding his skateboard down the street, with the surrounding area filled with people in various poses", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "chair", "bottle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 289640, "caption": "The image features a man standing in a public setting, holding a frisbee over his face. In the background, several people can be seen sitting at dining tables arranged around the area, some close to the foreground, and others a little further away", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "wine glass", "orange", "person", "cup", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 122602, "caption": "A person is sitting and working on a large table covered with several different items such as books, scissors, glue, and a glue bottle. The table is filled with various objects related to crafts or paper projects", "mscoco_hallucinated_words": [["person", "person"], ["book", "book"]], "mscoco_gt_words": ["scissors", "bottle", "dining table"], "hallucination_idxs": [1, 17], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 1.0}}, {"image_id": 24104, "caption": "The image features a desk with three computer monitors neatly arranged on it. On the left side of the desk, there is a laptop with a screen occupying the central position, while another laptop sits towards the top right corner", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bed", "mouse", "keyboard", "book", "laptop", "tv", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.375}}, {"image_id": 183709, "caption": "According to the picture, there is a group of people standing next to each other near a crosswalk. They appear to be walking along the sidewalk, likely crossing the street", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 59710, "caption": " The image shows a refrigerator with its door open, revealing its contents. The fridge sits on top of a wooden floor in a bedroom", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "dining table", "refrigerator"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 279407, "caption": "According to the picture, the scene features a young boy in a baseball uniform standing on a field and holding a baseball ball. Another child, also in a baseball uniform, is standing in the background", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "sports ball", "bench", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 314729, "caption": "The image is a vintage baseball card featuring a baseball player posing for the photograph. The player is holding a baseball bat, swinging it and waiting for the ball to be thrown", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 527497, "caption": "A computer is sitting on a desk, with two computer monitors displaying different screens. There is a keyboard sitting next to the computer, possibly indicating an office or work setting", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "mouse", "keyboard", "laptop", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 411564, "caption": "According to the picture, there is a man sitting by a window, likely looking outside. The man is sitting on a couch or chair placed underneath the window", "mscoco_hallucinated_words": [["couch", "couch"]], "mscoco_gt_words": ["cell phone", "dining table", "chair", "bottle", "person", "refrigerator"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.3333333333333333}}, {"image_id": 320350, "caption": "The image displays a snowy scene where two people, presumably a man and a woman, are skiing with their two dogs. They are standing next to each other and their skis are positioned on the snowy ground, indicating their presence", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 401857, "caption": " A baseball game is taking place on a grassy field. There are several people on the field, including players, coaches, and umpires", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "sports ball", "bench", "person", "baseball glove", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 285302, "caption": " The image features a young boy sitting at a dining table, possibly playing or engaging in an art project. He is cutting up a paper sheet using scissors, located near the table's center", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "scissors", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 61959, "caption": "The image features a bus traveling down the street, driving alongside several cars parked near the sidewalk. The bus appears white, and some of the cars are parked on both sides of the street", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "traffic light", "car", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 229659, "caption": "This is an image of a kitchen and restaurant. The kitchen contains a table and several chairs, placed throughout the space", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "bottle", "clock", "person", "laptop", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 473118, "caption": "In the image, there is a person on a skateboard performing a trick, likely riding a skate ramp, as they are in the process of riding up the ramp. The person is in the center of the scene, with their back towards the right side of the image", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "bicycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 391222, "caption": "The image features two people in winter clothing, possibly skiing or wearing cross-country skiing equipment, standing in the snow and looking at a sign", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 416104, "caption": "According to the picture, there is a large group of people seated outside at various tables under blue umbrellas, possibly to shield them from the sun. Some of the people are eating, likely with food in front of them", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "bottle", "wine glass", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 511852, "caption": " Two people are skiing down a snowy hill in the image, with two sets of skis on the snow-covered slope. The first person appears on the left side of the hill, while the second person is closer to the right side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 59386, "caption": "The image features a group of giraffes standing together in a grassy field. There are five giraffes visible, some standing close to each other, while others are more spread out across the field", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["zebra", "giraffe", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 540180, "caption": "According to the picture, there is a traffic light with a green smiley face in the middle of a road located in a city street. The light is positioned on the side of the road next to a building, and is easily visible in the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 153692, "caption": " The image depicts a man on a bicycle with the dog, passing through an intersection. They are in the middle of a busy street, surrounded by traffic lights, while crossing the street", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bicycle", "traffic light", "car", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 265628, "caption": "According to the picture, there is a toilet visible in a small room under a circular mirror. The toilet takes up a significant portion of the space, occupying roughly half of the room", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "toilet", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 499594, "caption": " The image features a dinner meal with meat, fish, and vegetables on a white plate. The meal includes meat, broccoli, and cheese", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "bottle", "knife", "broccoli", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 5934, "caption": "The image features a baseball and a baseball glove sitting on a bench in the dugout. The baseball is positioned close to the glove, suggesting that it might have just been caught or thrown in the game", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 47828, "caption": "In the image, there are several people walking along the side of a water body. The water body, likely a river or canal, is surrounded by a variety of buildings and bridges, and some people are walking near the water", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 47916, "caption": "The image depicts a grassy area filled with several dogs of different sizes, all enjoying the playtime together. One dog is in front of the group, playing with a red frisbee, while a second dog is farther behind and further to the right", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "dog", "car", "truck", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 255483, "caption": "The image is of a person sitting at a table with various pieces of paper laid out in front of them. There are several books and sheets on the table", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["scissors", "dining table", "book", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 131138, "caption": "The image displays a workspace with various computer and other electronic devices placed on a desk. There are two monitors \u2013 a computer monitor and a television \u2013 both placed on the desk", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "potted plant", "spoon", "mouse", "keyboard", "person", "laptop", "tv", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 258209, "caption": "A man is sitting on a bench in front of a building. The bench is located in the middle of a sidewalk, while the man appears to be sitting on the edge of the sidewalk", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bench", "book", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 347453, "caption": "The scene captures a busy city street filled with multiple red buses and a taxi. The buses are parked at various points on the street, while the taxi is positioned near the middle", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "bench", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 285820, "caption": "In a large room with windows, a couch is positioned near one of the windows. The couch is brown and leather-covered", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "bowl", "dining table", "couch", "book", "bench", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.14285714285714285}}, {"image_id": 245664, "caption": "In the image, there is a desk with a laptop and a computer set-up on it. The laptop is placed towards the right side of the desk, while the computer is situated near the left side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "mouse", "keyboard", "wine glass", "book", "laptop", "tv", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 453731, "caption": "According to the image, there is a sleeping dog on the floor in front of a large wooden barrels, possibly wine barrels. The dog is in a relaxed position, taking a nap", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "dog", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 524601, "caption": "The image features a group of young people gathered in a large room, actively playing a video game. They are spread out across the room, some standing closer to the camera and others a bit further away", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "chair", "couch", "book", "person", "refrigerator"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 253262, "caption": "The image features a small white plane parked on the tarmac near a group of people. There are three people, two men and one woman, standing near the aircraft", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "airplane", "person", "truck", "suitcase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 87052, "caption": "In the image, a man is looking at a refrigerator in a kitchen. He is wearing a gray sweatshirt", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "vase", "oven", "bottle", "person", "refrigerator", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 357578, "caption": "In this image, a red fire truck is driving down a street past the car and truck. There are several cars in the scene, including one near the front of the car, one partially visible to the left of the car, another one closer to the right side of the car, and two more towards the back of the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 418692, "caption": " The scene depicts a living room with a couch, a chair, and a lamp. There are two couches in the room, one positioned closer to the left side and the other farther to the right", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "remote", "chair", "couch", "book", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 259345, "caption": " A man in a colorful dress is sitting under a pink umbrella, holding it open to provide shade. He is sitting on a bench and there is a second bench in the background, possibly indicating an outdoor setting", "mscoco_hallucinated_words": [["bench", "bench"], ["bench", "bench"]], "mscoco_gt_words": ["chair", "person", "umbrella"], "hallucination_idxs": [25, 31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.6666666666666666}}, {"image_id": 119709, "caption": "According to the picture, there is a woman standing in a kitchen, actively involved in the baking process. She appears to be preparing or working on some donuts, arranging them on a dining table nearby", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "dining table", "person", "donut"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 443299, "caption": "According to the picture, there is a man sitting on a bench with a cell phone in his hand, and a woman sitting next to him. Both individuals are wearing caps, which suggests that they might be from an Asian culture", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "backpack", "handbag", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 120776, "caption": "The image captures an outdoor scene with a large group of people gathered near a modern building and a couple of tall umbrellas. The people are standing in various locations, some closer to the street and others further away", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "person", "umbrella"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 411765, "caption": " The scene is set on a tennis court, where a man is playing tennis. He is holding a tennis racket, swinging to hit a ball that is in the air", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 447991, "caption": " A person in a blue shirt is standing on the sidewalk in front of a row of shops, including a large shopping mall and a smaller shop nearby. The street is a bit dark and there are some streetlights on it, providing light to the area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "bicycle", "handbag", "traffic light", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 276719, "caption": " The scene takes place on a street with a dog, surrounded by various cars. The dog is walking across the street, leading the group of people who appear to be following the dog", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "truck", "chair", "handbag", "dog", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 422328, "caption": " The image features a horse pulling a large sack or bag filled with snow while wearing a blanket. The horse appears to be outside near a building or a house, possibly on a path or a road", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "horse", "skis", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 354559, "caption": "The image depicts a woman standing in front of a white wall, possibly taking a selfie or a photo. She is wearing a black and white striped shirt and has her long brown hair held back with braces, giving her a stuffed animal a closer look", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "bear", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 438788, "caption": "The image features a cake placed on a table, with two pieces of it missing. There are two knives, one placed near a piece of the cake, and the other knife close to the other piece of cake", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "knife", "person", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 468005, "caption": "The image features a large elephant, possibly an Indian elephant, with its trunk open as if it's about to spray water. It's walking down some steps that are located near a body of water", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "elephant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 545594, "caption": "In the image, a group of people is seen riding a public bus, with a woman wearing a backpack in the front row and multiple people sitting behind her. Some of them have handbags and backpacks, and there are a few people sitting or standing on the right side of the bus", "mscoco_hallucinated_words": [["handbag", "handbag"]], "mscoco_gt_words": ["chair", "orange", "backpack", "person"], "hallucination_idxs": [36], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.5}}, {"image_id": 219546, "caption": " The image showcases a table covered with various types of food, including a bowl with meat, another bowl with different types of vegetables, and a bottle of red wine. The table is spread out across a dining room and appears to be set for a festive event or a special occasion", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "bowl", "dining table", "chair", "spoon", "bottle", "knife"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 386645, "caption": "The image features a view of a road with a building on the side, as well as a few vehicles. A truck is positioned in front of the building, and a few cars can be seen driving along the street", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 155131, "caption": "The scene takes place on a sports court where a woman in a white outfit is making a save while playing tennis, using a tennis racket in her hand. She is in the center of the court, while various people are scattered around the area, watching and possibly waiting for their turn to play", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "sports ball", "bench", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 188752, "caption": " The scene features a banana stand with numerous bananas hanging from the ceiling and a person standing nearby. There are numerous bananas displayed throughout the market, with some arranged in bunches and others hanging individually", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["banana", "apple", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 228350, "caption": " A group of people are sitting at tables outside under a series of red umbrellas. Some are near the edge of the patio, while others are in the center", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "chair", "tie", "wine glass", "umbrella", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.375}}, {"image_id": 437720, "caption": "According to the image, the painting shows a nude person sitting at a table in a room with a red chair. There is a painting on the wall, and a vase and some books are displayed on a shelf in the background", "mscoco_hallucinated_words": [["book", "book"]], "mscoco_gt_words": ["vase", "dining table", "chair", "person", "laptop"], "hallucination_idxs": [36], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.8}}, {"image_id": 378831, "caption": "According to the picture, there is a woman walking down a street", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "fire hydrant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 375503, "caption": "According to the image, the scene is an upstairs view of a house, possibly a home or a residence, with a wooden staircase leading to the upper level. There is a room on the left side of the image and another on the right side, both featuring ornate wooden staircases and ornate wallpaper", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "clock", "person", "laptop", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 330369, "caption": "The image features a red and white boat docked in the water near the shore. The boat appears to be in the foreground of the scene, with the ocean stretching out further into the distance", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "bird", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 288576, "caption": "According to the picture, there is a man standing near a tree stump, with a frisbee thrown in front of him. Another person is also visible in the background, possibly watching the frisbee being thrown or catching it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "person", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 32190, "caption": "A toy horse is on the sidewalk in front of a street corner, near a curb. There are several cars parked or driving on the street, some close to the horse, and a few further away", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["horse", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 535403, "caption": "A person has taken a photo in a bathroom, capturing the sink, mirror, and several bottles arranged on a counter. The bathroom features a combination of clean white walls and tiled surfaces, creating a unique look", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["potted plant", "clock", "bottle", "sink", "hair drier", "toilet"], "hallucination_idxs": [1], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 449798, "caption": "According to the picture, there is a white computer placed in the center of a room", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "dining table", "mouse", "keyboard", "book", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 375769, "caption": " There is a young boy eating at a restaurant, possibly on a street. He is sitting at a table with a drink near him, surrounded by numerous chairs", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bicycle", "chair", "hot dog", "umbrella", "car", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.375}}, {"image_id": 426172, "caption": " The image depicts a dining room setting with a table, various dishes, and a countertop display of wine bottles. The dining room is furnished with chairs, and a TV can be seen in the background", "mscoco_hallucinated_words": [["tv", "tv"]], "mscoco_gt_words": ["dining table", "chair", "bottle", "knife", "book"], "hallucination_idxs": [32], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6}}, {"image_id": 20650, "caption": "The image showcases a meal consisting of a sandwich placed on a colorful plate, surrounded by a mix of vegetables, including broccoli, which is prominently featured in the dish. There is also a fork on the plate, which might be used to eat the sandwich and its accompaniments", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "bowl", "dining table", "sandwich", "orange", "broccoli"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 138078, "caption": "The image features a young man in red and white baseball uniform standing on a grass field. The boy is wearing a baseball glove and appears to be focused, likely waiting for the ball to be thrown", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 152499, "caption": "In the image, there is a family gathering with several people, including children, around a pool filled with water. Some of the people have drinks and a party hat, suggesting they are enjoying a fun occasion together", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "dining table", "potted plant", "chair", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 180764, "caption": "The image depicts a man sitting in a kitchen, taking a bite of a pink glazed donut. The donut is positioned in front of the man, highlighting the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "microwave", "person", "donut"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 147762, "caption": "This image is a close-up view of a plate, likely to be found on a table, with a piece of food or dessert topped with a yellow substance", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "fork", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 113860, "caption": "According to the picture, there is a boat floating down a river. The boat is very visible in the center of the image, occupying the main area of the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "person", "umbrella"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 460866, "caption": " The image depicts various toothbrushes in a cup, which is filled with them", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "toothbrush", "sink", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 177015, "caption": "The image depicts a man sitting on a couch, with a bearded man sitting on his lap holding a small cat. They are both looking at a laptop, with the cat resting on the man's lap while he uses the laptop", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cat", "chair", "couch", "person", "apple", "laptop", "refrigerator"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 333998, "caption": "The image portrays a small and cozy kitchen with white appliances, including a refrigerator to the left and a microwave oven located above it. There is also a sink located on the opposite side of the room, just behind the refrigerator", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "oven", "bottle", "sink", "microwave", "person", "suitcase", "refrigerator", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4444444444444444}}, {"image_id": 47131, "caption": "The image depicts a black and white photo of an overturned train, likely a passenger train or freight train. Many people, some of whom appear to be in their 20s or 30s, are gathered around the overturned train, which is located on a grassy area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 425447, "caption": "A large white truck is driving down the street, spraying water near a tree. The truck is parked in the middle of the street, blocking traffic as it performs the cleaning task", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fire hydrant", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 295105, "caption": " The image captures a scene of a man standing in a kitchen", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "oven", "bottle", "sink", "person", "apple", "refrigerator"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.14285714285714285}}, {"image_id": 443259, "caption": "The image shows a large kitchen with a stove and refrigerator situated at the back of the room. There is a dining area, consisting of several chairs placed around a wooden table with a vase on top", "mscoco_hallucinated_words": [["vase", "vase"]], "mscoco_gt_words": ["bowl", "dining table", "potted plant", "oven", "chair", "bottle", "sink", "orange", "refrigerator"], "hallucination_idxs": [36], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.4444444444444444}}, {"image_id": 537861, "caption": "The image features a train station, with a large train parked on the tracks near the station. Two other trains can be seen in the scene; one parked closer to the end of the train track, and the other partially visible further down the track", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 145020, "caption": "The image shows a toy donald duck wearing a christmas sweater next to a large group of people, possibly at a disneyland attraction", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "mouse", "bird", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 249969, "caption": "The image features a plate filled with a variety of food items, including green beans, carrots, turkey, and potatoes. There is also some meat, likely chicken, visible on the plate", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "spoon", "knife", "wine glass", "broccoli", "carrot", "bird"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 146504, "caption": "In the image, a person is sitting in front of a table or desk. They are holding their hand on top of a computer keyboard and using a computer mouse, positioning themselves near the keyboard", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "mouse", "keyboard", "person", "laptop", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 40037, "caption": "This is an image of a clock tower, which is a distinctive building with a clock on top. It's tall and has a steeple on top, which is a common feature of churches and some other buildings", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 343218, "caption": " The scene is set on a tennis court with a man standing on the court holding a tennis racquet. He appears to be getting ready to hit the tennis ball, as it is near him in the air", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "car", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 301837, "caption": "The image displays a long narrow kitchen with a skylight above. There are two refrigerators, one placed to the left of the kitchen and the other closer to the middle of the room", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["oven", "clock", "sink", "apple", "refrigerator"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 309071, "caption": " The image captures a man, possibly a boy, leaning down to touch the pedals of a bicycle", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "potted plant", "bicycle", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 240681, "caption": "According to the picture, a man with a backpack is standing next to a red motorcycle. The motorcycle is located on the side of a road and appears to be parked", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 178753, "caption": " The scene depicts a young girl, possibly a child, sitting in a red chair in an amusement park or stadium, holding her cell phone. She appears to be using it while sitting down in a seating area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 278890, "caption": " The image depicts a person holding a cellphone open in their hand. The cellphone appears to be placed on a table or surface, as the person's hand is not entirely visible", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 47781, "caption": "There is an image of a bedroom featuring a modern bed with a white bedspread and red blanket. The room is also adorned with a couple of nightstands", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "bed", "book", "person", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 542792, "caption": " The image features a large brown stick, which is prominently displayed in the middle of a street, surrounded by buildings and other city elements. The brown stick resembles a baseball bat and is located near the edge of the road, partially blocking traffic", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 293703, "caption": " There are three people sitting around a dining table at a restaurant. One man is sitting on the left, another man is seated in the center, and a third person is on the right", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "bowl", "dining table", "chair", "bottle", "knife", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2222222222222222}}, {"image_id": 318777, "caption": " The image features a baseball game being played in a large stadium filled with spectators. The action is taking place on a grassy field, with several baseball players scattered across the area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "chair", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 14773, "caption": "There is a unique hot dog dish served on a plate, which features a replica car filled with various toppings. The plate is placed on a table in the center of the image", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "spoon", "hot dog", "bottle", "car", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.375}}, {"image_id": 521863, "caption": "The image captures a skateboarder riding his skateboard in a busy area. There are several people in the vicinity, walking or standing around, and multiple bicycles can be seen in the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "bicycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 250540, "caption": " The image depicts a group of people sitting on motorcycles, occupying both sides of a sidewalk during a rainy day. There are three motorcycles in total, and several people are riding them, including a man and woman closer to the front and another group further back", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "motorcycle", "handbag", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 211604, "caption": " In the image, there is a woman laying on the floor, holding a tennis racket in her hand. She is wearing a hat and has two additional hats positioned behind her, suggesting a group setting or a gathering", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 298067, "caption": "This is a picturesque image featuring a combination of food and drink. The main image is a plate with food on it, including meat pieces and a beverage", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "dining table", "bowl", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 563938, "caption": "The image portrays a scene with a man and a woman sitting on a bench or couch. Both of them are wearing sunglasses, and there are some cups and bottles visible in the background", "mscoco_hallucinated_words": [["cup", "cup"]], "mscoco_gt_words": ["tie", "couch", "bottle", "bench", "person"], "hallucination_idxs": [29], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.8}}, {"image_id": 461311, "caption": " There is a large crowd of people standing in a field, with many people flying kites of various sizes. The kites fill the entire sky, with some kites being higher and others lower in the air", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "kite", "handbag", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 47940, "caption": " The image is of a person sitting on a bed with their legs and feet resting on it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "laptop", "book", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 261318, "caption": " In the image, a man is sitting on a red chair, using a cell phone while leaning against a bench. Near him, two large suitcases are positioned in close proximity to one another, one on the left and another on the right", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "chair", "handbag", "bench", "person", "suitcase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 134768, "caption": "The image features a keyboard with speakers placed near the left side. The speakers occupy the top right and bottom right corners of the picture, while the keyboard is located towards the top left", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["mouse", "laptop", "keyboard", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 526414, "caption": "The image depicts a small room with a dining table situated in the middle of the room. A sewing machine, a television, a couch, and a computer are present in the room, contributing to the converted living space turned workspace atmosphere", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["scissors", "bowl", "dining table", "cat", "remote", "couch", "book", "laptop", "tv", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 273132, "caption": " In the image, a woman is sitting on a park bench, resting and relaxing. She appears to be wearing a hat, and there are several bags placed around her", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 505213, "caption": "The image shows a man holding a hot dog with several toppings, such as lettuce, ketchup, and mustard, as he stands in front of a brick building. The man seems to be enjoying his meal, making for a fun and entertaining scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["hot dog", "sandwich", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 192656, "caption": " The image depicts a baseball game, with multiple people visible on the field. A baseball player is seen standing near the center of the field, getting ready to throw the ball to a specific spot", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 272728, "caption": " The image features a large, potted flower arrangement in a beautifully decorated, multi-tiered glass vase", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "vase", "dining table", "potted plant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 509811, "caption": " The image shows a school room or library filled with various furniture, including desks and chairs arranged in a couple of rows. The tables are adorned with numerous laptops placed in different spots, creating an environment of learning", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "book", "apple", "person", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 412407, "caption": "According to the picture, several people are present at the event or gathering, with one person holding a cell phone close to their hand, which has a black screen with white pixel squares on it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "clock", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 353658, "caption": "The image shows a woman standing in the rain under an umbrella. There are several other people in the background, likely walking or passing by on the street", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "suitcase", "handbag", "umbrella", "person", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 441363, "caption": "The image captures a tennis player swinging his racket on a red tennis court, possibly practicing or engaged in a game. There are two other people on the court, likely his teammates or opponents, who are also watching the action", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 331084, "caption": "The image displays a dining room table filled with a variety of pizzas, vegetables, and other toppings. There are two large, tray-style pizzas on the table, with the first being cut in half", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "spoon", "knife", "person", "pizza", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 282553, "caption": "According to the image, there are two women walking down a dirt road in a field. The women are walking in opposite directions, with one closer to the viewer and the other further away from the viewer", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "cell phone", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 328818, "caption": "According to the picture, the scene shows a woman in pink shirt and blue jeans standing by a park bench. She is leaning over the bench, tying her shoelaces as she prepares to get dressed", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bench", "bottle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 3103, "caption": "A young man wearing a yellow helmet is performing a trick on a skateboard, possibly a ramp. There are several other people in the background, some of them seem to be watching the skateboarder while others might be waiting or standing around", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "clock", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 188136, "caption": "The image features a very empty kitchen with a few appliances and a table", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "sink", "orange", "person", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 449675, "caption": " The image features a dining table with a piece of food, specifically a sandwich, and a bottle on it. There's a fork nearby, with one end protruding from the plate, indicating that the sandwich has been partially consumed", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "spoon", "bottle", "sandwich", "knife", "carrot", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 191691, "caption": "According to the picture, it features a man in a bathroom standing in front of a mirror, getting ready for his day", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "toothbrush", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 89378, "caption": "In the image, there are two motorcycles \u2013 one in front of the other", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "backpack", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 99874, "caption": " The image features a woman sitting at a table in a cafe, using her cell phone. She has sunglasses on and is talking on her phone, possibly taking a call or using it to communicate with someone else", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "dining table", "cat", "bottle", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 437895, "caption": "According to the picture, the image depicts a large bed with a canopy above it, making it a four-poster bed", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 557447, "caption": " The image depicts a man wearing a suit and tie sitting at a table with his hand pointing towards the left side. There is another person sitting behind him in the background, and another person is partially visible further away", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "spoon", "tie", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 159979, "caption": " The image depicts a busy harbor scene with various boats docked on the water", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "bowl", "person", "umbrella"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 357, "caption": "The image depicts a baseball game in progress, with several players on the field", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 538891, "caption": " The image features a woman in a white shirt standing on a tennis court. She is holding a tennis racket in her hand, getting ready to hit the ball", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 561967, "caption": "Scene description", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "traffic light", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 565778, "caption": "According to the picture, there is a train passing by on the train tracks. The train appears to be blue and long, traveling from right to left through the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 7125, "caption": "According to the image, there is a bike path along the road by the water. The bike path extends for some distance and a group of people are riding their bicycles on this path, enjoying a leisurely ride", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bicycle", "bottle", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 221350, "caption": " The image depicts a man wearing a red tie in front of a large window. The tie has hearts drawn all over it, possibly written by the person wearing it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "tie", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 280930, "caption": "In the image, there is a young girl standing in a kitchen, laughing while holding a pan filled with baked dough. She appears to be enjoying her experience of cooking or baking", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "refrigerator", "person", "oven"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 151081, "caption": "This image showcases an airplane flying above a field as it drives slowly near the edge of the road. There is a tractor working on the ground in the same area, near the middle of the image", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["airplane", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 122232, "caption": "A person is standing on a green tennis court wearing a yellow raincoat and a hat, holding a tennis racket in each hand. There are eight tennis balls around them scattered across the court", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 134223, "caption": "The image features a white platter with a meal consisting of a hot dog and fries. The hot dog is served with ketchup and two bite-sized fries are placed alongside it, creating a satisfying and tasty meal", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "broccoli", "hot dog", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 198163, "caption": "The image features a man in a gray-blue jacket sitting inside a damaged building, surrounded by numerous small plates filled with different colors of paint. The man is focused on the plates, examining them closely, as if he is working on a project or painting", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 3742, "caption": "The scene features a vase filled with different kinds of apples arranged in a beautiful pattern. The vase is placed against the wall, and some of the apples can be found both inside and outside the vase", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "apple", "vase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 217593, "caption": "The image shows a young boy in a baseball uniform, wearing a baseball glove and a baseball cap. The boy appears to be ready to throw a baseball or play the position of the catcher", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "chair", "handbag", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 279078, "caption": "The image depicts a young person standing in front of an open refrigerator filled with bottles and beverages. There are various bottles of various sizes on the refrigerator's shelves, suggesting a well-stocked fridge", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "refrigerator", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 572081, "caption": " The image features a tennis match being played on a court with multiple people on both sides. Two tennis players are on the same side of the net, with one of them holding a tennis racket while standing close to the right side of the court and the other one holding a tennis ball", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "couch", "sports ball", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 477016, "caption": " The image depicts a group of people in a small kitchen area, standing in different positions around a toy oven. A mother and her child are the main subjects in the scene, while the other people in the area are also attentively looking at the oven", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "potted plant", "oven", "sink", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 301266, "caption": "In the image, there are four people standing at the table, actively engaged in working together. There are three baskets at the table filled with various items, such as apples and plastic bottles", "mscoco_hallucinated_words": [["apple", "apple"], ["bottle", "bottle"]], "mscoco_gt_words": ["fork", "dining table", "spoon", "sink", "book", "person", "cup"], "hallucination_idxs": [33, 36], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.2857142857142857}}, {"image_id": 475813, "caption": "A young child is standing in a room, possibly in their bedroom or another area, wearing a nightgown and holding a bottle. There are two cell phones nearby, with one held by the child and the other on the left side of the scene", "mscoco_hallucinated_words": [["bottle", "bottle"], ["cell phone", "cell phone"]], "mscoco_gt_words": ["handbag", "remote", "person"], "hallucination_idxs": [23, 28], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 558784, "caption": "According to the picture, there is a man walking along a path next to a pond, with a long stick in his hand. The man is standing in the grass, enjoying the outdoors", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "person", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 308543, "caption": "In the image, a person's hand is holding a pair of silver spoons, with one spoon in front of the other. The spoons appear to be large and in close proximity, suggesting they are being handled or manipulated", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["spoon", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 359164, "caption": "In the image, there are two dogs, one brown and one white, enjoying a sunny day playing outside on the grass. The brown dog is the bigger of the two, and the white dog is smaller", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "dining table", "dog", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 539951, "caption": "The image features a toy kitchen that has a checkerboard floor and a fridge to the left. There are various kitchen appliances in the scene, including an oven to the right and a microwave in the middle", "mscoco_hallucinated_words": [["microwave", "microwave"]], "mscoco_gt_words": ["refrigerator", "clock", "sink", "oven"], "hallucination_idxs": [35], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 143479, "caption": "In the image, an elderly man is holding a tray of hot dogs, with some on the left side and others scattered around the tray. He is wearing a tie and a white shirt", "mscoco_hallucinated_words": [["tie", "tie"]], "mscoco_gt_words": ["chair", "hot dog", "person"], "hallucination_idxs": [32], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 475024, "caption": "The image features a vase filled with different colored and arranged flowers sitting on a table", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "vase", "dining table", "potted plant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 238065, "caption": "The image captures a young man on a skateboard, riding on top of a skateboard ramp, possibly performing a trick or simply enjoying the ride. He is dressed in a blue shirt and blue jeans, and the skateboard is situated in the middle of the image", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 314016, "caption": "According to the picture, a small bathroom contains a toilet with a sign \"Magic Eye\" next to it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "toilet", "sink", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 322163, "caption": " The image depicts a group of people in a kitchen area, with one person focused on making a pizza and others standing nearby", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "potted plant", "remote", "oven", "bottle", "person", "pizza", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2222222222222222}}, {"image_id": 230983, "caption": " The image captures a person on a skateboard performing a trick, in mid-air while jumping over a green bottle. The skateboard appears to be the main focus of the scene, and the person riding it is captured in a dynamic, exciting pose", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["skateboard", "car", "person"], "hallucination_idxs": [19], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 564629, "caption": "According to the image, there is a kitchen located in a mobile home. The kitchen features a wooden table with two chairs on the left side and a black counter on the right side", "mscoco_hallucinated_words": [["table", "dining table"], ["chair", "chair"]], "mscoco_gt_words": ["oven", "refrigerator", "sink", "microwave"], "hallucination_idxs": [20, 23], "metrics": {"CHAIRs": 1, "CHAIRi": 1.0, "Recall": 0.0}}, {"image_id": 53431, "caption": " In the image, a man is sitting on a chair, using a laptop. He has a t-shirt and a pair of shoes, and a pair of trousers with the pants legs folded up and tied around his legs", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "laptop", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 441840, "caption": "The image captures a young child swinging a baseball bat and holding it with confidence on a field. There are several people present in the scene, including another child standing near the little boy, as well as an adult in the background, possibly watching over them or enjoying the action", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "baseball glove", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 53465, "caption": "In the image, there is a festive cake with orange writing on it, sitting on a plate on a table. The cake appears to have a message written on it, which suggests a welcome or personalized theme", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["fork", "bowl", "dining table", "spoon", "cake", "person", "cup"], "hallucination_idxs": [10], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.2857142857142857}}, {"image_id": 381031, "caption": "There is a table that's been set up with different food items. The table features a bowl filled with multiple apples, a plate with two slices of bread on top of it, a cup, and a bottle", "mscoco_hallucinated_words": [["bowl", "bowl"]], "mscoco_gt_words": ["dining table", "donut", "spoon", "bottle", "microwave", "apple", "cup"], "hallucination_idxs": [18], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.5714285714285714}}, {"image_id": 530520, "caption": "In the image, a group of men is sitting on the lawn using laptops, with one man looking at his laptop on the grass and two others around him. There are a few potted plants located in the scene, one at the top-right corner, another at the center-left, and a third one near the top", "mscoco_hallucinated_words": [["potted plant", "potted plant"]], "mscoco_gt_words": ["backpack", "motorcycle", "clock", "person", "laptop"], "hallucination_idxs": [36], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.4}}, {"image_id": 539555, "caption": "The image features a man in an action pose, trying to catch a frisbee in mid-air. He is the focus of the scene as he leaps towards the disc with both hands extended", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "person", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 279521, "caption": " The image features a red train traveling along a railroad track. Three people are standing close to the train, with one person in the foreground and two others further back in the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bus", "train", "handbag", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 560439, "caption": " The image depicts a group of three people snowboarding down a snowy hill. They are wearing snowshoes and are all close to each other while enjoying their activity", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "backpack", "skis", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 248353, "caption": "The image portrays a snowy ski slope where several people are skiing or snowboarding. There is a group of at least 13 people gathered together on the slope, with various ski and snowboard brands scattered around the area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 9628, "caption": "The image captures a thrilling water skiing scene, with a man on water skis and the skis attached to a power boat in the background. The man appears to be flying through the air, likely doing a trick or landing on the water after a high jump", "mscoco_hallucinated_words": [["ski", "skis"], ["ski", "skis"]], "mscoco_gt_words": ["boat", "surfboard", "person"], "hallucination_idxs": [14, 17], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.6666666666666666}}, {"image_id": 330050, "caption": "According to the picture, the man and woman are standing on a tennis court, both holding tennis rackets. They appear to be enjoying each other's company while posing for the camera", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "tennis racket", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 254161, "caption": " The image depicts a man standing on a street or plaza and flying a kite, with several buildings visible in the background. The man is wearing a red and white shirt and appears to be enjoying the outdoors, possibly in the middle of a park or public space", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "kite", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 275749, "caption": "A group of people are gathered around a white table in a room, sitting and standing in various positions, possibly chatting and catching up with each other. There are four individuals, with three standing and one sitting at the table", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "bottle", "mouse", "person", "laptop", "tv", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 163518, "caption": "In the image, a woman is working in a kitchen environment", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "oven", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 156696, "caption": "The image is a black and white photograph of a train station, likely located at the platform. There are two trains present in the image, one at the left end of the station and the other on the right side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "train", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 525202, "caption": " The image showcases a large pizza placed on a pizza pan, sitting on a table", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "chair", "spoon", "bottle", "knife", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2222222222222222}}, {"image_id": 423984, "caption": "The image depicts a plate of food that contains pasta and vegetables. The plate features a variety of pasta, carrots, and carrot pieces, with some bread placed around it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["carrot", "broccoli", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 499486, "caption": " The image portrays two cats sitting on a bed, looking tired, worn out and possibly exhausted after their day on the bed. The cats are positioned next to each other, almost touching", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cat", "book", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 118124, "caption": "The image captures a man wearing glasses and a tie in a vehicle. He is smiling and appears to be enjoying himself while traveling in the car", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 288729, "caption": " The image depicts a cake that is shaped like a small circus tent or a big top. The cake is covered with a yellow frosting and is sitting on a table", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["teddy bear", "cake", "bear", "person", "elephant"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.2}}, {"image_id": 177721, "caption": " The image captures a man on a skateboard, performing a trick in the air, likely at a skate park. He is positioned in the center of the picture, balancing himself while maintaining the position of his skateboard, making it a captivating display of skill and coordination", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 372220, "caption": "The scene depicts a large group of people standing together in an outdoor setting, with several men and women present. One of the men, the central figure, is a tall man with a long beard and gray hair, wearing a priest costume", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "cell phone", "horse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 252625, "caption": "The image captures a quiet city street lined with tall buildings on both sides, creating a narrow alleyway with a sidewalk on the side. Several bicycles are parked near the sidewalk, as well as a couple of motorcycles, all likely to be ridden in the area", "mscoco_hallucinated_words": [["motorcycle", "motorcycle"]], "mscoco_gt_words": ["bicycle", "traffic light", "potted plant", "person"], "hallucination_idxs": [40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.25}}, {"image_id": 493610, "caption": "According to the picture, there is a large group of people gathered on a dirt road in front of a dirt-covered blue truck. The truck appears to be quite old and has a large back area with bench seats, creating a space for several people to sit", "mscoco_hallucinated_words": [["bench", "bench"], ["seat", "chair"]], "mscoco_gt_words": ["bus", "car", "person", "truck"], "hallucination_idxs": [39, 40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 302789, "caption": " The image depicts a small room with a bed and suitcase placed in the corner", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bed", "bottle", "sink", "handbag", "suitcase", "refrigerator"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 30667, "caption": " A person sits with their legs crossed, enjoying a peaceful moment on a bench. Sitting next to them is a brown dog that seems to be relaxing as well", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["boat", "dog", "person"], "hallucination_idxs": [14], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 114108, "caption": "The image features a table with a plate on it, which holds a pizza with two slices of mozzarella on it. There are several people surrounding the table, some of them close to the pizza and others further away", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "chair", "spoon", "knife", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.375}}, {"image_id": 328110, "caption": "The image features a woman holding a baby on a wooden bench. Another person is present in the scene, but it's unclear what they are doing", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 366111, "caption": " In the picture, several people are riding bicycles down a street. They are wearing hats, including a couple of hats on one person, and one person wears a tie", "mscoco_hallucinated_words": [["tie", "tie"]], "mscoco_gt_words": ["bicycle", "traffic light", "car", "person"], "hallucination_idxs": [32], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 532481, "caption": " The image shows a person airborne in the ocean with their parasail out, next to a pair of boats. There is also another parasailer further in the distance, to the right of the main person", "mscoco_hallucinated_words": [["boat", "boat"]], "mscoco_gt_words": ["surfboard", "kite", "car", "person"], "hallucination_idxs": [19], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.25}}, {"image_id": 291634, "caption": "The image shows a row of motorcycles lined up along a street or sidewalk, near the sidewalk with several people standing around them. The motorcycles appear to be parked as they line the street, creating an interesting sight for passersby", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 463678, "caption": " The image shows a young male tennis player on a tennis court, actively engaged in the game. He is holding a tennis racket and is preparing to hit a tennis ball, which is in his vicinity", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 70815, "caption": "In a black and white image, a small dog is walking on a leash along the sidewalk, next to a bicycle and a couple of people. The bicycle can be seen in the background, while the people are further away from the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "dog", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 367872, "caption": "The image captures a baseball game in progress, with multiple players on the field. There are several players scattered around the field, with one of them swinging at the ball with a baseball bat", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "sports ball", "umbrella", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 353898, "caption": "In the image, a man is sitting on a public bench surrounded by a large group of birds, primarily white doves. The birds are spread out in various positions across the scene, with some located in the background, others on the ground, and a few close to the man sitting down", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "backpack", "bench", "person", "bird"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 88269, "caption": "There's a delicious dish of grilled cheese sandwiches and tomato soup on the menu. On the plate is a large bowl of soup and multiple slices of grilled cheese sandwiches", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["spoon", "bowl", "sandwich"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 463211, "caption": "The image features a young girl, possibly in a backpack, walking towards the left side of the picture and another girl further to the right. The yellow school bus can be seen in the background, parked or traveling on the road", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bus", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 396269, "caption": "The image features a beautiful arrangement of flowers in a vase. The vase sits on a table in front of a sofa, creating a luxurious and ornate setting", "mscoco_hallucinated_words": [["sofa", "couch"]], "mscoco_gt_words": ["chair", "vase", "dining table"], "hallucination_idxs": [22], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 98979, "caption": "The image features a microwave in the kitchen, sitting on a countertop. The microwave is silver in color and has a front-facing window", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["oven", "knife", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 244455, "caption": "In the image, there are two men riding motorcycles down a road. They are positioned side by side, with one rider on the left and the other on the right", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "bicycle", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 379443, "caption": "The image features a large black holey piece of fabric laid out on a table, with a pair of scissors situated near the center of the table, creating a fun and playful atmosphere. The pair of scissors is closed and positioned close to the black fabric, giving the illusion of a scissor-style hand gesture", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["scissors", "dining table", "orange"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 536154, "caption": "According to the image, a young boy is sitting on a bed surrounded by a large number of stuffed animals", "mscoco_hallucinated_words": [["bed", "bed"]], "mscoco_gt_words": ["teddy bear", "cat", "person"], "hallucination_idxs": [12], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 374061, "caption": "According to the picture, there is a black cat sleeping on a bed. The cat is holding a stuffed animal tightly and cuddling with it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "cat", "bed", "couch", "bear"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 560018, "caption": " The scene features a large group of people surfing and having fun in the waves at an outdoor location. There are at least eight surfers, some with boards and others without, scattered throughout the ocean", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "kite", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 353320, "caption": " The image shows a guitar case placed on a bench, surrounded by a few pieces of luggage. There are several bags in various sizes and positions around the bench, including some near the guitar case and others slightly further away", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "bench", "suitcase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 256628, "caption": "According to the picture, a golden retriever dog is laying down on a wooden table top. It is positioned in the center of the table and its head is on the left side", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"]], "mscoco_gt_words": ["chair", "bowl", "dog"], "hallucination_idxs": [15, 26], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 158806, "caption": " In the image, there is a big brown dog standing and eating from a blue plate. The plate contains two slices of toasted sandwiches placed side by side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 424270, "caption": "A young child, possibly a toddler, is sitting in a bed in the middle of the image. The bed seems to be a queen-sized bed and appears to be on the left side of the picture", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "tv", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 368335, "caption": "In this image, a horse is walking towards a car in the middle of a parking lot. The horse is positioned on the left side of the image, crossing from the top left corner to the bottom center of the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["horse", "car", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 483227, "caption": "The scene is set on a tennis court with several young people standing on the court, each holding a tennis racquet. They are positioned close to each other, possibly in a group setting for a sports activity or team bonding", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 228306, "caption": "The scene features a large red bus traveling down a city street. It is a city bus with passengers on board, possibly on their way to reach their destination", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 416384, "caption": "The image features a young woman sitting in a white plastic chair and holding a cell phone, possibly texting or browsing the internet. She is wearing pink shorts and appears to be enjoying the outdoors in this hot summer day", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 364783, "caption": " The image depicts a small bathroom painted all red, from the walls to the toilet and sink. The bathroom features a toilet in one corner of the room, and a sink positioned next to it on the right side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["potted plant", "chair", "bottle", "toothbrush", "sink", "toilet", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 335041, "caption": " In the image, a person in a red jacket is skiing down a snow-covered street. The individual is wearing snow skis and has his arms spread out as he moves down the street", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 531327, "caption": "The image features a plate of food on a dining table, filled with various types of vegetables. There are multiple pieces of broccoli scattered throughout the dish, and carrots can be found in a mixture with other vegetables", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "spoon", "knife", "wine glass", "broccoli", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 451228, "caption": " The image features a busy street with double-decker buses and people on bicycles. Some of the buses are in motion while others are parked", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bus", "traffic light", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 256365, "caption": "The image features a large plate filled with numerous cupcakes with white frosting on them, decorated with blue whipped cream swirls. There are at least seven cupcakes on the plate, which covers most of the image's area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "dining table", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 502936, "caption": "The image captures an outdoor setting in a backyard, featuring a man playing tennis on a court. The man is holding a tennis racket, standing on the left side of the court, as he is about to swing and hit the ball", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 451449, "caption": "According to the picture, a small dog is lying next to a bookshelf in a bedroom or similar setting. The dog is situated at the center of the scene and appears to be resting in a dog bed, possibly taking a nap or sleeping comfortably", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "book", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 51054, "caption": " In a bedroom, a woman is sitting on a bed and talking on her cellphone. She is wearing a plaid shirt and her hair appears to be greying", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "bed", "bottle", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 1268, "caption": "According to the picture, there is a group of people, consisting of at least five people, standing near a bridge. They are positioned near a body of water that flows under the bridge", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "backpack", "boat", "handbag", "person", "bird"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 62692, "caption": " The image features a woman playing tennis on a tennis court. She is holding a tennis racket and appears to be in the middle of a match", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 542771, "caption": " The image depicts a large, spacious room with several chairs and tables arranged around various objects. The focal point of the room is a tall vase on the right side of the image, sitting on a table, and another vase in the foreground of the photograph", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "dining table", "potted plant", "chair", "book", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 575406, "caption": "The image shows a blue and white bus traveling down a street with a red car parked beside it. The bus is driving on the road while the car is parked next to it", "mscoco_hallucinated_words": [["car", "car"], ["car", "car"]], "mscoco_gt_words": ["bus", "motorcycle", "potted plant", "person"], "hallucination_idxs": [15, 29], "metrics": {"CHAIRs": 1, "CHAIRi": 1.0, "Recall": 0.0}}, {"image_id": 244834, "caption": "According to the picture, a couple of people are walking down a rain-soaked street holding umbrellas. Two umbrellas are visible, with one being large and white, while the other is smaller and red", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["potted plant", "bicycle", "clock", "umbrella", "handbag", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 151159, "caption": " Scene Description", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 277073, "caption": "In the image, a man and woman are riding a motorcycle together, possibly on a street in a city. The man is in the front, while the woman is riding on the back of the motorcycle", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "motorcycle", "bicycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 377879, "caption": "In the image, there is a train traveling past a field, which features several military tanks of different sizes and styles on a flatbed train. Some of the tanks are positioned near the front of the train, while others are farther back or off to the sides", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "bed", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 436814, "caption": "The image features a conference room containing various objects", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "mouse", "laptop", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 63934, "caption": "The scene is set in a city street, with several vehicles, such as a car, truck, and van, parked alongside the road. The street is empty except for the person on their skateboard who is riding the skateboard on the sidewalk", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 275611, "caption": "The image features a delicious Italian cheese pizza placed on a metal panning at a dinner table. It appears to be the center of attention, with a few people gathered around it, enjoying their meals", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["pizza", "dining table", "knife"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 516248, "caption": "A person is sitting at a dining table or similar area, holding a computer mouse with their hand, suggesting they might be interacting with the computer or possibly taking a break from work or other activities. The mouse is placed on the table in front of the person, while a computer keyboard is also situated near the person on the left side of the table", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "mouse", "keyboard", "person", "laptop", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 478380, "caption": "The image shows a person driving a vehicle down a road with a herd of sheep following behind. The person is seated in the driver's seat and there are at least a dozen sheep in the scene, spread across the scene", "mscoco_hallucinated_words": [["seat", "chair"]], "mscoco_gt_words": ["sheep", "car", "person"], "hallucination_idxs": [27], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 422326, "caption": " The scene features a parking lot filled with various vehicles, including a motorcycle, a car, and two motorcycles with unique design elements. The motorcycle with an unusual form, like a cart, is parked next to a car in the center of the parking lot", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "bicycle", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 110449, "caption": " The image features two men, one on the right and the other on the left, sitting at a table with a remote control in front of them. Both of the men appear to be looking at the remote control, possibly trying to figure out how to operate it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "dining table", "person", "laptop", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 45550, "caption": "The image depicts a man, likely a chef or restaurant owner, standing in front of a table full of food. He is holding up a plate of deep-fried food, likely French fries, and he is smiling at the camera", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["clock", "bowl", "sandwich", "person"], "hallucination_idxs": [18], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.25}}, {"image_id": 444913, "caption": "According to the picture, a man is standing on a grassy area, looking up at a bird in flight while holding a red parasol. The surrounding environment is minimal, with no buildings or significant features visible in the area", "mscoco_hallucinated_words": [["bird", "bird"]], "mscoco_gt_words": ["car", "person", "frisbee"], "hallucination_idxs": [18], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 82709, "caption": " A black and white dog is captured in the middle of a lively scene, leaping into the air and catching a pink frisbee while it is in mid-air. The dog stands out among the people and benches, while a bottle is visible near the left edge of the image", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["backpack", "frisbee", "bottle", "dog", "person", "cup"], "hallucination_idxs": [38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 132796, "caption": "The image captures a man and two women riding on the back of an elephant, possibly in a park or other outdoor setting. The elephant is large and takes up most of the scene, with the people seated in the back riding area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "chair", "bench", "person", "elephant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 303099, "caption": "The image features two cats sitting on top of a toilet, appearing to be in a bathroom setting. One of the cats is on the left side, and the other is on the right side of the toilet", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "toilet", "cat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 271565, "caption": " There are several motorcycles on a road in the image, with several riders on the vehicles. The road appears to be a residential street, possibly near a neighborhood or in a suburban area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 205193, "caption": " In the image, there is a brown granite countertop in front of an old-style toaster oven. A bag of green tea sits next to the toaster oven, while a black tea is nearby", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["oven", "microwave", "person", "toaster"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 272566, "caption": " The image features two men sitting on a sidewalk next to baskets filled with bananas and other fruits. The bananas are placed in front of the men and are also scattered around the area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["banana", "person", "apple", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 170099, "caption": "In the image, a man is seen wearing a red tie and standing in front of a couch", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "orange", "couch", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 14940, "caption": "According to the picture, it features a cat and a stuffed animal, likely a dog, cuddling together. The cat is resting on top of the other animal, with its face partially covered by the dog", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cat", "chair", "couch", "orange", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 408449, "caption": " The image shows a man skateboarding down a city street. He is surrounded by several people in the area, with some standing near the street and others further back", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bus", "handbag", "skateboard", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.14285714285714285}}, {"image_id": 140702, "caption": " The scene features two baby elephants walking through the grass. One baby elephant is near the left side, standing close to a small bird, while the other baby elephant is closer to the right side of the image", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["elephant", "bird", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 354540, "caption": "The image depicts a person sitting at a kitchen counter, using a laptop. In the background, there is a toaster oven and a microwave", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "oven", "chair", "person", "toaster", "laptop", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7142857142857143}}, {"image_id": 19763, "caption": "The image features a close-up of a piece of deep-dish pizza on a plate. There are two pizza slices on the plate, with one slice located on the right and the other on the left", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "chair", "knife", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.14285714285714285}}, {"image_id": 191301, "caption": "The image features a plate with two desserts \u2013 a chocolate cake and ice cream \u2013 on a table or countertop. The desserts are accompanied by two silver spoons, positioned on either side of the dish", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "spoon", "cake", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 124236, "caption": "The image depicts a kitchen area with sunlight shining in through the window. Several bowls are arranged on a counter, and a table sits near the windows with some pots on it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "oven", "spoon", "toaster"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 344930, "caption": "The scene is on a city street, where a red fire hydrant is positioned on the side of the road. There are cars parked nearby, with a large truck closer to the fire hydrant and a smaller car further back", "mscoco_hallucinated_words": [["truck", "truck"]], "mscoco_gt_words": ["orange", "car", "fire hydrant"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 14285, "caption": " The scene features a bedroom with a desk and some books, a cat laying on a bed next to a dog, and a few other items in the room. There are two televisions present, one located towards the top left and the other at the top right of the image", "mscoco_hallucinated_words": [["desk", "dining table"], ["dog", "dog"]], "mscoco_gt_words": ["remote", "cat", "bed", "chair", "book", "tv"], "hallucination_idxs": [7, 21], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 82488, "caption": "A hotel room with two beds is featured in the image. Both beds are equipped with white comforters and yellow and orange pillows, creating a warm and welcoming atmosphere", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "dining table", "bed", "clock", "book", "orange"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 348251, "caption": " The image features a person wearing skis standing in a snowy area. They appear to be smiling and enjoying their time in the snow", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 150703, "caption": " The image features a delicious-looking hot dog sitting on top of a table alongside various cups. A cup on the right contains a drink, possibly beer, while another cup and a bottle can be seen on the left side of the scene", "mscoco_hallucinated_words": [["bottle", "bottle"]], "mscoco_gt_words": ["hot dog", "dining table", "sandwich", "cup"], "hallucination_idxs": [34], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.75}}, {"image_id": 381615, "caption": "This image features a window looking out onto a garden with a wide variety of plants. The scene displays multiple pots and several plants hanging in them, creating a lush atmosphere", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "chair", "book", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 352703, "caption": "In the image, a woman is cooking a meal outside under a tent, as several people watch her and enjoy the food. The woman is in the foreground, actively stirring a pot in a frying pan, likely adding seasoning or finishing touches to the meal", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "oven", "spoon", "bottle", "handbag", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.125}}, {"image_id": 111109, "caption": "The image depicts a young boy posing for a photo, likely during a birthday celebration or an important event. The boy is wearing a tie and shorts, and he is standing on a stool", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "chair", "tie", "person", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 226802, "caption": " The scene is of an outdoor street red from being red, situated in an Asian city. A large group of people can be seen in various locations throughout the street, with some standing, walking, and riding vehicles", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 49005, "caption": "In a baseball game in progress with several players on the field, a baseball player is preparing to pitch the ball. He is standing on the mound with the ball in his hand, while several baseball players can be seen scattered around the field, including catcher, pitcher, and outfielders", "mscoco_hallucinated_words": [["ball", "sports ball"], ["ball", "sports ball"]], "mscoco_gt_words": ["baseball bat", "person", "baseball glove"], "hallucination_idxs": [21, 31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.3333333333333333}}, {"image_id": 410114, "caption": "The image features a large herd of cows and people walking down the road. There are numerous cows scattered along the road and surrounding it, with some closer to the right side and others in the middle or left side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "horse", "person", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 100274, "caption": " The image features a train on a railroad track traveling near a large parking lot, which is in the vicinity. There are several vehicles of various types parked in the parking lot, including a truck, a car, and another vehicle", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "car", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 317254, "caption": "The scene features a person wearing a motorcycle helmet riding on a motorcycle, and they are positioned in the middle of the street, surrounded by cars. A car is located to the right of the motorcycle, closer to the edge of the street, while another car is behind the first car, closer to the background", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 425569, "caption": " There is an image of a large commercial kitchen that appears to have multiple appliances and utensils. The kitchen is filled with silver metallic equipment and bowls placed around various areas of the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "sink", "oven"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 28842, "caption": "The image captures an outdoor skate park where a group of people is gathered, enjoying various activities. Two individuals are skateboarding down a slope, while others are watching or standing around the area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 319607, "caption": "A street scene with a traffic light atop the pole in the middle of the road is being viewed. There are multiple people riding bicycles around the traffic light, with some of them near the bottom of the pole, and others in the background", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "bus", "bicycle", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 210372, "caption": " The image depicts a small kitchen with red and white cabinets, a sink, and a window near the refrigerator", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "bottle", "sink"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 156915, "caption": "The image features a large bouquet of flowers, with a red rose standing out in the center. There are other roses, white flowers, and greenery mixed throughout the arrangement, giving it a full and vibrant look", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "dining table", "potted plant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 349642, "caption": "According to the image, it features a woman sitting on a wooden bench and a dog next to her", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 423008, "caption": "The image captures a busy scene on the water, with numerous small boats docked near each other. Many people are standing on the boats, enjoying the view or conversing", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "umbrella", "handbag", "person", "suitcase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 45435, "caption": " The image shows a person holding up a cellphone in front of them, with the phone's display side facing towards the camera. It appears to be a newer model of cellphone, with a unique design featuring a mix of silver and black colors", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 35827, "caption": " The image displays a room with various computer equipment, including a laptop sitting on a desk, which is being operated by a man in a white t-shirt. The desk is equipped with a chair, and there is another chair located near the side of the room", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "bottle", "person", "laptop", "tv", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 72656, "caption": " The image features a group of three children, two boys and one girl, sitting together at two small dining tables in a playground setting", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "chair", "person", "pizza", "suitcase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 372087, "caption": " The scene takes place at a train station, where a green and yellow train is on the railroad tracks. The train is occupying the majority of the image, stretching across the entire width of the track", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "bench", "suitcase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 336209, "caption": "The image features a young man performing a trick on a skateboard, captured in the act of jumping while riding a skateboarding ramp. The ramp is placed towards the left side of the scene, with the man balancing himself by leaning back slightly", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "bench", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 46031, "caption": "The image displays a white table with various computer-related items placed on top of it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "dining table", "mouse", "keyboard", "laptop", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.16666666666666666}}, {"image_id": 283118, "caption": " The image depicts a family gathering at a dining table with various food items, including pizza and hot dogs. There are three people in the scene, a man, a woman, and a child", "mscoco_hallucinated_words": [["hot dog", "hot dog"]], "mscoco_gt_words": ["cell phone", "backpack", "dining table", "chair", "bottle", "handbag", "car", "person", "pizza"], "hallucination_idxs": [18], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.3333333333333333}}, {"image_id": 429598, "caption": "This image features a kitchen with brick-patterned walls. The kitchen has a small refrigerator, a sink, a white oven, and a toilet in the back room", "mscoco_hallucinated_words": [["toilet", "toilet"]], "mscoco_gt_words": ["sink", "bottle", "refrigerator", "oven"], "hallucination_idxs": [25], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.75}}, {"image_id": 333365, "caption": " This is a scene from an airport, featuring several airplanes parked on the ground and a couple of them on the runways. There are multiple large passenger airplanes parked across the scene, including some closer to the front and others farther back, while a few of them are on the runways", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "airplane", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 424174, "caption": " The image shows a large bowl of assorted fruit on a dining table", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "banana", "orange", "apple"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 3035, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "airplane", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 177213, "caption": " The image features a delicious-looking pizza placed on a plate with a fork near it. The pizza appears to be freshly prepared, with melted mozzarella cheese on top, which gives it a white color", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "dining table", "knife", "person", "pizza", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 438535, "caption": "This image features a street scene with a large clock prominently placed in the middle of the street near some buildings", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "clock", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 490711, "caption": "The image features a clean kitchen with a white refrigerator, a stove, a sink, a dishwasher, and various countertop items. The refrigerator is positioned near the center of the room, while the stove occupies the left side of the kitchen", "mscoco_hallucinated_words": [["stove", "oven"], ["stove", "oven"]], "mscoco_gt_words": ["sink", "banana", "refrigerator"], "hallucination_idxs": [12, 38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.6666666666666666}}, {"image_id": 163728, "caption": "The image features a sleeping cat lying on a table with other items, including a mouse and a backpack, present in the scene. The kitten appears to be curled up and quite comfortable in its position", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["dining table", "cat", "mouse", "laptop", "cup"], "hallucination_idxs": [19], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6}}, {"image_id": 456756, "caption": " The image features two giraffes, one adult and one baby, standing in a zoo enclosure. They appear to be facing each other and the baby giraffe is closer to the center of the scene, while the adult one is slightly off to the side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["zebra", "giraffe", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 281028, "caption": " The image features a large empty train car with several blue seats arranged throughout the vehicle. There are three seats in the front of the car and four more seats located in the middle of the car, with the last two chairs towards the back of the vehicle", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["train", "chair", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 343937, "caption": " The image features a snow-covered slope with a person wearing snow goggles snowboarding down the hill. The woman, wearing a white snowboard helmet, is dressed in a blue winter hat and pants", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 277051, "caption": "In the image, there are two small birds standing on a dining table. They are both positioned close to the edge of the table, one towards the left side and the other towards the right side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "bottle", "knife", "bird"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 145666, "caption": "The image shows a large pot filled with a variety of vegetables, including broccoli and carrots. There is an abundance of both vegetables throughout the pot, covering much of its surface area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "broccoli", "dining table", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 304984, "caption": " The image shows a white plate topped with a sandwich and a bowl, likely filled with a side dish. There are also two spoons placed on the table, one on the left side and the other on the right side, possibly for eating or serving", "mscoco_hallucinated_words": [["spoon", "spoon"], ["table", "dining table"]], "mscoco_gt_words": ["fork", "bowl", "cake", "sandwich", "wine glass", "orange"], "hallucination_idxs": [25, 29], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.3333333333333333}}, {"image_id": 122467, "caption": "The image features a close-up view of a person's hand holding a Nintendo Wii remote. This person is likely holding the remote while sitting on a couch", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["couch", "remote", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 518287, "caption": "According to the image, there is a toothbrush in a cup with its top half removed to resemble a face with a toothy grin", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "toothbrush", "dining table", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 145448, "caption": "According to the picture, there is a cat sitting on a tarp located on the floor of a room, in front of a motorcycle. A person can be seen sitting close to the cat", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "dining table", "cat", "chair", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 352900, "caption": "The dish features a mix of pasta and broccoli, prepared and ready to be served. The pasta is placed at the center of the plate, while the broccoli covers a significant portion of the dish", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "bowl", "broccoli", "knife"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 153607, "caption": " The image features three identical glass vases, each with a different orange flower floating inside, creating a stunning visual display. These vases have the word \"inspiration\" engraved in them", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["orange", "bowl", "dining table", "vase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 157581, "caption": "The image features a long table topped with a variety of wine bottles, all empty, arranged in a line. The table appears to be a restaurant dining area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "bottle", "wine glass", "person", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 227438, "caption": "The image depicts a bedroom with a large bed placed in the center of the room", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "couch", "clock", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 116037, "caption": "The image captures a baseball game in progress. In the foreground, a batter is swinging a bat, attempting to hit the baseball that is approaching", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "person", "sports ball", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 202658, "caption": "In the image, there is a white toilet sitting in the middle of a parking area. The toilet has a hello kitty seat on it, which adds a cute touch to the overall scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "bowl", "toilet"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 18150, "caption": "According to the picture, it is an image of a father and son sharing a piece of pizza. The father is sitting on the floor while the child sits beside him on the couch, possibly helping the son eat his slice of pizza", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "couch", "bottle", "person", "pizza"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 418711, "caption": "The scene takes place in a dark alley where a man is pushing a bike, filled with several bunches of bananas. The bike has bananas hanging from the basket attached to the front and from the frame itself", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["banana", "bicycle", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 92107, "caption": "The image features a bowl of soup that appears to contain broccoli and corn. There is a piece of bread in the scene as well", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["spoon", "bowl", "broccoli", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 491000, "caption": "According to the image, there is a long, narrow street filled with people walking, shopping, and socializing. Multiple individuals are walking on the street, some alone and others in small groups, as they engage in their daily activities", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "handbag", "car", "person", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 401163, "caption": " The image depicts a park setting with two people sitting on a bench, with one sitting closer to the left and the other towards the right. There is a small dog sitting to the left of the two people, sitting closer to the left individual", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 39663, "caption": " The image displays a backpack containing various items, including a laptop, school supplies, and personal items. There are two cell phones, one placed on top of the backpack and another situated slightly to the right of the first one", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "scissors", "backpack", "book", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 268293, "caption": "The image features a man swinging a tennis racket at a tennis ball, trying to hit it. He appears to be focused on the ball, which is mid-air as he completes the swing", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 474499, "caption": " This image depicts a small, narrow kitchen featuring wooden cabinets and appliances", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "oven", "spoon", "bottle", "clock", "sink", "apple"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 358086, "caption": " The image portrays a group of young women playing a game of frisbee, with several players actively participating in the game. They are jumping in the air or close to the ground, attempting to catch the frisbee in various spots", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "chair", "bottle", "person", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 89541, "caption": "According to the picture, there is a variety of foods placed in a yellow container, including bananas, strawberries, and muffins. The different fruits and baked goods are presented in various sizes and locations within the container", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["banana", "cake", "bowl", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 239445, "caption": " In the image, a young man is standing in a park area, holding a frisbee and smiling. He seems to be playing with the frisbee, asking another person to play with him", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "sports ball", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 105782, "caption": " The scene takes place in a grassy park where a giraffe is standing tall, enjoying its time on the field. The giraffe is in the center of the image, while there are also a few cars and trucks parked around the area, giving a sense of a parking lot nearby", "mscoco_hallucinated_words": [["giraffe", "giraffe"], ["giraffe", "giraffe"], ["truck", "truck"]], "mscoco_gt_words": ["cat", "kite", "dog", "car"], "hallucination_idxs": [10, 23, 40], "metrics": {"CHAIRs": 1, "CHAIRi": 0.75, "Recall": 0.25}}, {"image_id": 487990, "caption": " There is an image, possibly a photograph, displayed at a picnic table or dining table filled with various food items, including sandwiches, fruits, bottles, and cups. A guitar, resembling a ukulele, is also present in the image", "mscoco_hallucinated_words": [["sandwich", "sandwich"]], "mscoco_gt_words": ["fork", "bowl", "dining table", "donut", "spoon", "bottle", "cake", "knife", "person", "cup"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.3}}, {"image_id": 138834, "caption": "According to the picture, there is a man walking with his ski equipment along a snowy area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skis", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 492057, "caption": " The image shows a man standing in a bathroom, taking a photo of himself using his cell phone. The man is wearing a black jacket, and a tie can be seen at the bottom left of the mirror", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "tie", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 388215, "caption": " In the image, a man is playing tennis on a tennis court, holding a tennis racquet and reaching out to hit a ball. The ball appears to be near the right side of the court, possibly ready for the player to strike it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tennis racket", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 111409, "caption": "The image features a man wearing a motorcycle helmet riding on a motorcycle down a road. The motorcycle appears to be orange and has a small, round backrest", "mscoco_hallucinated_words": [["orange", "orange"]], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [22], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.6666666666666666}}, {"image_id": 451391, "caption": " The image features a large green plant or vegetable bunched together in the center of the scene. It appears to be made up of numerous small, green vegetables", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["broccoli", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.0}}, {"image_id": 92462, "caption": " In the image, there is a group of cows walking down a dirt road or street with a car driving alongside them. A person can be seen standing nearby", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "car", "person", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 397268, "caption": "The scene depicts two bikes placed in front of a store window, with two chairs sitting alongside them", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "chair", "bench", "potted plant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 127659, "caption": "The scene takes place at a train station where the main feature is a large yellow and white train that is moving along the tracks. The train is in the center of the image, pulling a set of passenger cars", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["train", "bench", "dog", "person"], "hallucination_idxs": [41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 251857, "caption": "The image captures a skateboarder in action as he stands on a skateboard in the middle of a park. In the background, there are several people scattered throughout the park, some closer to the foreground, and others more distant", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 243212, "caption": "A small child is standing in a grassy area, holding a blue frisbee. The child seems to be playing with the frisbee, as they are standing in the middle of the field and holding it close to their body", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "car", "truck", "person", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 444304, "caption": "The image depicts a person, likely a man, crouching down and looking at a screen, which might be a television, possibly watching someone on the screen. There are two Wii remotes lying next to each other, likely for playing video games, on the bed where the person is sitting", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "remote", "tv", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 305000, "caption": "The image depicts an outdoor fruit and vegetable market where two women are shopping. They are positioned near the center of the image, surrounded by various produce such as apples, bananas, and pears", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["banana", "apple", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 44136, "caption": " The image features two men dressed in suits and sitting at a table during a party or event. The table is set with multiple chairs, as well as a variety of dishes and glasses", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["dining table", "tie", "bottle", "wine glass", "person"], "hallucination_idxs": [25], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.4}}, {"image_id": 154846, "caption": " The image depicts a silver pizza tray with a cheese pizza topped with mushrooms in the center of the frame. Several people are in the vicinity of the pizza, with a few of them interacting directly with the pizza", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 567254, "caption": " In the image, a young child stands in front of a television screen, holding a toy in her hand. There is a baby or toddler sitting or standing behind her, possibly being displayed on the TV screen", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bottle", "tv", "vase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 79261, "caption": "According to the picture, there is a man pushing a broom in front of a building, possibly at a plaza or a park. The man is walking in the water, most likely trying to clean the area around the building", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "backpack", "clock", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 301402, "caption": " The image depicts a man sitting in a chair with a laptop open on his lap. He is wearing a black hat and appears to be sitting in a living room setting", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "potted plant", "chair", "bottle", "person", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 415828, "caption": "The image shows a group of people standing near a city street under an umbrella, protecting them from the rain. The umbrella is set up with several people gathered around it, creating an outdoor gathering atmosphere", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "stop sign", "umbrella", "handbag", "traffic light", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 543231, "caption": " In the image, a woman with long hair is standing in a living room, holding a Wii remote and taking a picture with it. Behind her, there are two people: one person on the left side of the scene and another on the right side", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "bowl", "bed", "book", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 495357, "caption": "The image captures a scene where a man is riding a motorcycle down a street or avenue, with cars and other traffic around him. There are several people in the scene, with one person standing near a vehicle in the middle of the street", "mscoco_hallucinated_words": [["car", "car"]], "mscoco_gt_words": ["handbag", "motorcycle", "dog", "person"], "hallucination_idxs": [19], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.5}}, {"image_id": 523175, "caption": "A bowl of vegetable soup is the main attraction of the image, filled with various ingredients, including carrots and tomatoes. The soup has a spoon in it, likely filled with the soup's delicious contents", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["spoon", "bowl", "carrot", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 184945, "caption": " The image features a parking lot filled with motorcycles and scooters of different sizes, including a red motorcycle with a sidecar, as well as several smaller motorcycles and scooters. Some of the motorcycles are situated near a few bicycles and bicycle helmets", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "potted plant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 49942, "caption": "The image shows a man wearing a tie and a pink shirt, with his head under an umbrella. He appears to be staring intensely at something, possibly a light or an illuminated area, which is outside the frame", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "person", "umbrella"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 223019, "caption": "According to the image, there is a blue train sitting at a train station. The train is positioned close to the end of the station platform, and some passengers are approaching the train, likely disembarking or boarding it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "train", "handbag", "car", "person", "suitcase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 42889, "caption": " In the image, there are two computer-related items: a computer keyboard and a computer mouse. There is also a teddy bear with headphones on top of them, positioned close to the keyboard and mouse", "mscoco_hallucinated_words": [["computer", "laptop"], ["computer", "laptop"], ["mouse", "mouse"], ["mouse", "mouse"]], "mscoco_gt_words": ["teddy bear", "bear", "keyboard"], "hallucination_idxs": [11, 15, 16, 37], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5714285714285714, "Recall": 0.6666666666666666}}, {"image_id": 22112, "caption": " The image features multiple vases and sculptures placed on a dining table. The main focal point is a yellow bowl-shaped vase filled with green grass and a variety of sculptures", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "dining table", "potted plant", "vase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 247587, "caption": "In the image, there is a young man standing on a surfboard, apparently attempting to surf. He is the central figure in the scene, with others scattered around him", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 171962, "caption": "There is a large blue city bus stopped on the street, with a bicycle parked at the edge of the bus", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bus", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 545056, "caption": "In the scene, there is an expansive field filled with black and white cows grazing in various positions. Some are standing, while others are lying down, dispersed throughout the grassy area", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sheep", "dog", "car", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 401411, "caption": "There is a man standing in a kitchen, likely preparing food or attending to ingredients, and a woman is present nearby. There are multiple bottles in various positions around the kitchen, and there are two spoons in the image", "mscoco_hallucinated_words": [["bottle", "bottle"], ["spoon", "spoon"]], "mscoco_gt_words": ["bowl", "dining table", "oven", "book", "person", "carrot", "cup"], "hallucination_idxs": [27, 39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.14285714285714285}}, {"image_id": 2867, "caption": "The image shows a group of people standing outdoors in a snowy setting", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "skis", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 246963, "caption": " The image depicts a group of motorcyclists riding down a road with the traffic sign of a \"STOP\" sign nearby. There are seven motorcycles visible in the scene, with a couple of them closely followed by the rest", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["stop sign", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 498225, "caption": " In the image, a group of people is standing in front of a shop, likely a cake store, with a sign displaying a cake inside. There are three men in the crowd, one closer to the left side, one more centered in the group, and the last person at the far right", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "donut", "cake", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.4}}, {"image_id": 124601, "caption": "The image displays a nicely appointed small kitchen area, featuring wooden cabinets, countertops, and appliances. There is a laptop sitting on the counter, possibly being used by the occupants or for working remotely", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bottle", "keyboard", "sink", "wine glass", "microwave", "laptop", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.125}}, {"image_id": 581749, "caption": " The image features a table full of fruits and vegetables. Among the produce, there are multiple types of vegetables such as broccoli, oranges, and bananas", "mscoco_hallucinated_words": [["banana", "banana"]], "mscoco_gt_words": ["orange", "broccoli", "dining table"], "hallucination_idxs": [28], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 227960, "caption": " The image displays a tray of food with various vegetables and carrots on a dining table. There are several spoons in the pan, indicating different levels of preparation", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["bowl", "spoon", "wine glass", "carrot", "cup"], "hallucination_idxs": [15], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.4}}, {"image_id": 286503, "caption": " In the image, an elephant stands outdoors near a fence, likely in a zoo or park. The elephant is occupying the center of the scene, with two people standing nearby: one near the left edge of the image and the other to the right", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["elephant", "person", "sports ball"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 315740, "caption": " There is a large, clean kitchen with granite countertops in the picture. The kitchen features stainless steel appliances including a sink and a refrigerator", "mscoco_hallucinated_words": [["refrigerator", "refrigerator"]], "mscoco_gt_words": ["bowl", "potted plant", "oven", "chair", "couch", "sink", "tv", "microwave"], "hallucination_idxs": [25], "metrics": {"CHAIRs": 1, "CHAIRi": 0.5, "Recall": 0.125}}, {"image_id": 142108, "caption": " The image depicts a group of people in a kitchen with several bananas spread out over a long table. They are gathered together to eat the bananas and are wearing blindfolds, possibly playing a game or completing a dare", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["banana", "dining table", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 542933, "caption": " The image captures a baseball game scene, with several people gathered around a field. A baseball player is getting ready to hit a ball, holding a bat and standing at home plate", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball bat", "chair", "sports ball", "bench", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 475043, "caption": "In the image, there is a woman sitting on a couch with a laptop in front of her. She is holding a beer and is about to drink it, while using the laptop", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "spoon", "couch", "bottle", "person", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 456784, "caption": "The image features a person holding a knife and cutting a chocolate cake. The knife is used to remove the cake's top, revealing its frosting", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cake", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 33753, "caption": " In the scene, there is a park bench sitting on grass with a remote control placed on top of it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "bench", "remote", "tv"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 448694, "caption": " In the image, a large crowd of people is gathered at an open field, with a group of boys actively playing a frisbee game. Among the crowd, some people appear to be watching the frisbee players closely, while others are scattered throughout the scene, enjoying the day or waiting for their turn to play", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "person", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 99053, "caption": "The image features a bowl filled with food, specifically noodles and vegetables. The noodles are likely stir-fried noodles, and the vegetables include broccoli and carrots", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "bowl", "dining table", "broccoli", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 503210, "caption": "This is an image of a nicely furnished living room filled with a variety of seating options. The room has a couch on the left side and a chair placed in front of it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "potted plant", "chair", "couch", "clock", "book", "orange"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2857142857142857}}, {"image_id": 142257, "caption": "There are several people waiting at a stop light in this scene. Numerous cars can be seen along the street, each with a traffic light above them", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "person", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}], "overall_metrics": {"CHAIRs": 0.15430861723446893, "CHAIRi": 0.06597434331093463, "Recall": 0.45458694659972615, "num_hallucinated_caps": 77.0, "num_caps": 499.0, "hallucinated_word_count": 108.0, "coco_word_count": 1637.0, "length_response": 37.33066132264529, "hallucinated_caps_ls": [144305, 54870, 524656, 253630, 326021, 254368, 231534, 209347, 523292, 209925, 277479, 344942, 465601, 185327, 348654, 482432, 37907, 506401, 368367, 325836, 401010, 567562, 346940, 122602, 411564, 259345, 545594, 437720, 535403, 426172, 443259, 563938, 301266, 475813, 539951, 143479, 230983, 564629, 53465, 381031, 530520, 9628, 288729, 252625, 493610, 30667, 366111, 532481, 396269, 536154, 256628, 575406, 275611, 478380, 45550, 444913, 82709, 344930, 14285, 150703, 49005, 283118, 429598, 490711, 163728, 304984, 105782, 487990, 111409, 127659, 44136, 495357, 42889, 401411, 581749, 227960, 315740]}}}