{"overall_metrics": {"CHAIRs": 0.1557377049180328, "CHAIRi": 0.03623718887262079, "Recall": 0.9012059369202227, "num_hallucinated_caps": 76.0, "num_caps": 488.0, "hallucinated_word_count": 99.0, "coco_word_count": 2732.0, "length_response": 42.42827868852459, "hallucinated_caps_ls": [95908, 401027, 253630, 161227, 343534, 2154, 104326, 213905, 344942, 498747, 261097, 348654, 482432, 454161, 129707, 451943, 37907, 506401, 368367, 295656, 401010, 122602, 24104, 527497, 265628, 499594, 5934, 245664, 357578, 443299, 120776, 219546, 386645, 437720, 32190, 138078, 180764, 249969, 14773, 134768, 526414, 273132, 191691, 159979, 217593, 477016, 475813, 359164, 539951, 230983, 381031, 163518, 423984, 302789, 88269, 536154, 353320, 368335, 1268, 275611, 478380, 272566, 354540, 344930, 381615, 45435, 304984, 39663, 388215, 397268, 305000, 567254, 543231, 227960, 475043, 33753]}, "results": {"sentences": [{"image_id": 144305, "caption": "itting on a dining table, there is a laptop and a keyboard.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "keyboard", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 164091, "caption": "itting a small caption for the image, I see a young girl sitting in a crib with a teddy bear by her side, surrounded by a bed. The girl is wearing a dress, and there is a doll on the bed, adding to the cozy and warm atmosphere of the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["teddy bear", "person", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 308003, "caption": "A wooden bench, dining table, and a wooden chair are all positioned together in the same area. The wooden chair is on the ground, while the dining table is raised on a bench. The bench and the chair are placed near the dining table, giving the impression of an outdoor", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "bench"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 54870, "caption": ". The image features a sandwich on a plate, a knife, and a fork. It appears to be a delicious meal, with the knife and fork ready to be used for cutting and eating the sandwich.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "sandwich", "fork"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 95908, "caption": "itting on the dining table, a person is using their laptop and mouse while a cup of coffee is sitting next to the laptop.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["dining table", "laptop", "cup", "mouse", "keyboard"], "hallucination_idxs": [7], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.8}}, {"image_id": 329614, "caption": "A car and a bus are traveling down the street in a busy city, with a person walking towards the bus stop.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 324008, "caption": "A man is enjoying a meal on a dining table on an airplane while a banana and other food items are placed in front of him. The man is also sitting on a train, which suggests that he may be traveling to or from his destination. The presence of an orange in the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "banana", "cup", "airplane", "orange", "person", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8571428571428571}}, {"image_id": 52700, "caption": "A person is holding a fork while standing at a dining table surrounded by various desserts. In the image, there are multiple pieces of cake, and they are placed on a table with white tablecloths.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "fork", "spoon", "handbag", "person", "cake"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 501311, "caption": "ports, a baseball player throws the ball while wearing a baseball glove, and a person watches him from a distance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 247179, "caption": "urring through the city streets, a person rides a decorated elephant, accompanied by a group of people in colorful attire. The elephant is adorned with lights, and a truck can be seen in the background. The scene takes place at night, adding to the festive", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "elephant", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 577912, "caption": "itting in the bathroom, the image captures a sink, a toothbrush, a toilet, a bottle, and a bathroom sink. The sink is filled with water, and the toothbrush is placed on the counter. The bathroom sink has a mirror attached to it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "sink", "toilet", "toothbrush"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 196341, "caption": "A woman is walking down a hallway, carrying a handbag, and looking at a large clock hanging on the wall. The clock has a face with Roman numerals, adding to the historic and timeless atmosphere of the scene. The woman's focus on the clock suggests she might be checking the time or", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 234433, "caption": "itting a bicycle, a person is walking in the street, and a car is parked nearby. In the distance, a traffic light can be seen.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "bicycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 455610, "caption": ", a man is standing next to a motorcycle, while a car is parked in the background. A traffic light is visible in the scene, and a person is looking at the motorcycle.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "motorcycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 39053, "caption": "A delicious breakfast spread is being enjoyed at a dining table. A bowl, a spoon, a cup, and a cell phone are present on the table, along with a variety of food items like a cake, an apple, and a bowl of cereal. The scene is a co", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "cup", "apple", "cell phone", "spoon", "cake"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 493862, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 2.3333333333333335}}, {"image_id": 269873, "caption": ", people are walking down the street and gathering around a traffic light, with a car and a bus present. The scene shows a busy and vibrant atmosphere with people going about their daily activities.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 524656, "caption": "A well-equipped kitchen with various cooking tools, appliances, and dining essentials is showcased in this image. The bowl, spoon, knife, and sink are some of the essential items for meal preparation, while the refrigerator, toaster, and", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "bowl", "wine glass", "bottle", "refrigerator", "cup", "microwave", "sink", "spoon", "oven", "toaster"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5454545454545454}}, {"image_id": 369547, "caption": "In this image, a dog is sitting on the beach, surrounded by a bird and a cat. The dog is looking at the bird, while the cat is either staring at the bird or looking away from it. The scene is a quirky and unique interaction between the three animals, capturing their different", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "cat", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 181446, "caption": ", a man is standing in front of a refrigerator, holding a bottle of soda and a bowl, possibly filled with food. The man is also holding a glass of water, which could be a refreshing drink for the day ahead.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "bottle", "refrigerator", "sink", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 128119, "caption": ", I see a woman standing on the sidewalk with her backpack, handbag, and bicycle. She is waiting for the bus to arrive. She is using her cell phone to stay in touch with her friends or family while waiting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "bicycle", "cell phone", "backpack", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 401027, "caption": "The image features a sandwich on a plate, a fork, a dining table, a knife, and a cup. The sandwich is being eaten by an unidentified person at a dining table, and the fork is being used to cut the sandwich. The presence of the kn", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["dining table", "sandwich", "cup", "fork", "knife"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 1.0}}, {"image_id": 253630, "caption": ", there is a living room area with a brown couch, a dining table, a potted plant, and a TV. The TV is currently turned off, and a brown chair is placed in front of it. The room also features a red cushion on the couch and a rug. The presence", "mscoco_hallucinated_words": [["couch", "couch"], ["couch", "couch"]], "mscoco_gt_words": ["dining table", "chair", "potted plant", "tv"], "hallucination_idxs": [10, 51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 1.0}}, {"image_id": 326021, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "teddy bear", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.3333333333333333}}, {"image_id": 317320, "caption": "In this image, we see a person riding a bicycle on the road, and a motorcycle in the distance. The street is covered in a yellow line that indicates the lane markings on the road. The presence of these vehicles and the yellow lines suggest a busy and active street environment. The", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 507171, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "person", "cat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 161227, "caption": "A person is sitting on a couch, using a laptop and cell phone, while there is a remote and a keyboard nearby. A chair is also present, and there is a cell phone and a keyboard on the table. The room is tidy and organized, with the laptop and keyboard on a desk.", "mscoco_hallucinated_words": [["person", "person"], ["table", "dining table"], ["desk", "dining table"]], "mscoco_gt_words": ["remote", "laptop", "couch", "chair", "cell phone", "keyboard"], "hallucination_idxs": [1, 42, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.23076923076923078, "Recall": 1.0}}, {"image_id": 25812, "caption": "In the image, a man is skiing down a snowy slope while a snowboarder is skiing up the same slope. Both of them are enjoying their time on the snowy mountain, with the man skiing down and the snowboarder skiing up.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "person", "skis"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 267537, "caption": ", four women are seated on a bench at a baseball game, while one of them holds an umbrella. The group appears to be enjoying the game and each other's company, enjoying the shared experience.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "bench", "tie", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 195645, "caption": "- A group of people, including a woman, are gathered at a dining table under a tree. They are enjoying a bottle of wine and sharing laughter, with some of them holding glasses. The woman has a handbag on her shoulder, while a baby is sitting on her lap. The image", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bench", "wine glass", "bottle", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 155671, "caption": "In the image, there is a street scene with a traffic light at a crossroads. A truck is parked on one side of the street, and a car is parked on the other side. The image shows a busy intersection, which suggests that it is a bustling area with a lot", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "truck", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 412894, "caption": "A person is walking down the street, carrying a backpack and a handbag, while a traffic light is visible in the background. The scene is busy, with many people around the person, and there is a lively atmosphere in the area. The image showcases the diversity of urban life, where people", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "handbag", "person", "backpack"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 32625, "caption": "itting in a dining area, there is a bowl with food on a table, a fork, and a cup of beverage. A book is also present, which may be a part of the dining experience. The setting seems cozy and comfortable, inviting guests to enjoy their meals and relax in", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "cup", "fork", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 18688, "caption": ", a baseball player is wearing a baseball glove, and he is ready to catch a baseball.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball bat", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 343534, "caption": "itting a living room setting, a young girl and her teddy bear sit together on a chair in front of a book, with a potted plant and a dog nearby. The dog seems to be happy and relaxed, while the girl is enjoying her time with the teddy bear. This cozy scene suggests", "mscoco_hallucinated_words": [["girl", "person"], ["girl", "person"]], "mscoco_gt_words": ["dog", "potted plant", "chair", "book", "teddy bear"], "hallucination_idxs": [8, 44], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 1.0}}, {"image_id": 370657, "caption": "A young woman and her family are riding motorcycles, and she is holding a handbag on her lap while riding.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 481596, "caption": "A man is sitting in a chair, holding a hot dog, and smiling at the camera. He is enjoying a baseball game at a stadium, possibly enjoying a snack or a moment of entertainment. The man's presence adds a sense of fun and enjoyment to the atmosphere of the", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "person", "hot dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 460347, "caption": ", a man walks down the street while cars and a bus pass by him.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 398753, "caption": "The baseball player in the white jersey is holding a baseball bat, while a person in the background wears a baseball glove. The image shows the ball being thrown and caught during a game, emphasizing the importance of teamwork and coordination between the players.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball bat", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 67833, "caption": "A person is enjoying a delicious meal, consisting of a pizza and a glass of wine, on a dining table with a fork and a knife. The pizza is accompanied by a cup and a wine bottle, making it a complete and satisfying dining experience.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "wine glass", "bottle", "cup", "fork", "pizza", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7777777777777778}}, {"image_id": 272880, "caption": "itting on a dining table, a person holds a pizza slice in one hand and a fork in the other. The person is enjoying their meal, possibly with a friend or family member. The dining table is covered with a white tablecloth, which adds a touch of elegance to the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "cup", "fork", "pizza", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 254368, "caption": "A man in a bed is eating a pizza, enjoying a delicious meal while lounging in bed.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "person", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 193108, "caption": ", a young baseball player is standing behind a fence while wearing a baseball cap, holding a baseball bat. He is about to swing at a baseball, ready to hit the ball out of the field.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "baseball bat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 459141, "caption": ", a person rides a bicycle down a city street, holding a potted plant while wearing a green hat. In the background, a motorcycle can be seen, which adds to the urban landscape.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["potted plant", "motorcycle", "bicycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 231534, "caption": ", a man is loading a handbag and suitcase onto a truck at the airport, while a person is walking nearby. The airplane is parked in the background, ready to be boarded.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "airplane", "person", "handbag", "suitcase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 227031, "caption": "itting a slice of cake, the happy couple stands in front of a dining table with a potted plant, surrounded by guests. The man holds a knife, while the woman wears a tie and a beautiful smile. The potted plant adds a touch of nature to the setting, making the scene more", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "potted plant", "tie", "knife", "person", "cake"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 209347, "caption": ", a woman is sitting on a train, looking at her cell phone while holding her handbag. She is wearing a leather jacket and a tie, which adds a touch of elegance to her attire.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "cell phone", "handbag", "person", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 523292, "caption": "A cat is sitting on a dining table next to a laptop and a TV. The cat is likely enjoying the warmth and comfort of the table, while the laptop and TV provide entertainment and connectivity for the owner. The presence of these electronic devices in the same space as the cat adds a unique", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "tv", "chair", "cat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 12375, "caption": "The dining table is filled with a delicious meal, consisting of a roasted chicken, potatoes, carrots, and broccoli. This is a healthy and satisfying meal that provides a balanced combination of protein, carbohydrates, and vegetables, making it", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "broccoli", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 13550, "caption": "- \nIn the image, a woman is sitting on a rock, holding a handbag while petting an elephant. She is smiling, and the elephant seems to be enjoying the interaction. The woman is likely enjoying her time with the elephant, which could be a positive", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["elephant", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 146837, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "person", "motorcycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 67042, "caption": "A man is standing in a narrow corridor, holding his cell phone in his hand and looking at it intently. He is wearing a tie and appears to be engaged in a conversation on his phone. The corridor has a stone wall and is surrounded by a tall building, giving the impression that", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "tie", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 209925, "caption": ", a baseball player is shown with his glove, ready to catch the ball thrown by his teammate. The image captures the moment of the player's focus on the ball, with the ball in the air and the glove held up to catch it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 2154, "caption": "A man is sitting on a bench in the middle of a city street. There is a car parked nearby, and an umbrella is laying on the ground. The scene is in front of a small caf\u00e9, where a person is looking at a computer. The presence of the man, the", "mscoco_hallucinated_words": [["computer", "laptop"]], "mscoco_gt_words": ["umbrella", "bench", "person", "car"], "hallucination_idxs": [48], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 274931, "caption": "In this image, a car is parked on a city street, with a traffic light, a fire hydrant, and a streetlight nearby. The city street is filled with vehicles, and the car is a classic model, suggesting that it might be an older model or one that holds sentimental value. The", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "fire hydrant", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 104326, "caption": "A cozy and well-equipped kitchen scene features a dining table, a chair, a clock, a sink, a microwave, an oven, and a potted plant. This image captures a moment of everyday life in a comfortable and functional kitchen, where people can gather, cook,", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["dining table", "knife", "potted plant", "microwave", "clock", "chair", "sink", "oven"], "hallucination_idxs": [47], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.875}}, {"image_id": 233888, "caption": "itting the baseball, a young boy is playing baseball on the television screen, surrounded by a potted plant, baseball glove, baseball bat, and a baseball-themed t-shirt.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tv", "potted plant", "baseball glove", "baseball bat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 213905, "caption": "itting the tennis racket, two women are in the middle of a tennis match. One of them is about to hit the ball back to the other woman, who is ready to return it. They are standing in front of a chair, which might be used for seating or for holding the rackets. The woman", "mscoco_hallucinated_words": [["ball", "sports ball"]], "mscoco_gt_words": ["chair", "person", "backpack", "tennis racket"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.75}}, {"image_id": 277479, "caption": ", a young child is seen reaching into the refrigerator, likely to grab a bottle of juice. The bottle is placed on the top shelf of the refrigerator, while the child's mother is also in the room, possibly supervising or helping the child with the task", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 300784, "caption": ", a group of children, and a dog, are sitting in a living room setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["couch", "bench", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 183766, "caption": "itting cake on a plate with a fork and knife, a dining table is set up for a special occasion. The cake is decorated with fresh berries, giving it a beautiful and inviting appearance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "knife", "fork", "cake"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 169169, "caption": "A person is walking down the sidewalk, passing a traffic light while a car is parked on the street. There is a potted plant nearby, and a sign with the words \"Spring in the City\" can be seen in the background. This scene suggests a peaceful and vibrant urban environment,", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "potted plant", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 409346, "caption": "itting, a woman and a man are enjoying a delicious cake at a dining table. The cake is served on a large platter, and they are using spoons to eat it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "spoon", "person", "cake"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 522262, "caption": ", a bicycle is parked next to a car, and a person is walking down the street. The image also features a traffic light, which is indicating a green light, suggesting that the street is safe for crossing. The presence of multiple bicycles and a car in the scene indicates that this is a", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "bicycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 79014, "caption": "A man is holding a yellow, plastic orange ball in his hands while standing in front of a fence.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "orange", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 344942, "caption": "In this image, we can see a TV, a clock, a laptop, and a small toy. The toy is sitting on the desk next to the laptop, while the clock and TV are on the wall. This image captures a moment in time when people are enjoying their daily activities, such", "mscoco_hallucinated_words": [["desk", "dining table"], ["person", "person"]], "mscoco_gt_words": ["clock", "laptop", "tv"], "hallucination_idxs": [27, 51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 275929, "caption": "Three toilets are arranged in a stair-like formation, with one sitting on top of a sink. The bowl is empty, and there is a sign on the wall nearby. The image presents a surreal and unusual scene, suggesting that the toilets might be part of a creative installation or", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "toilet", "bowl"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 465601, "caption": "A woman is walking down the street with her handbag, while an elderly man is sitting on a bench, enjoying the rain. The woman is holding an umbrella, possibly to protect herself from the rain. The scene captures the interaction between the two individuals, with the woman walking and the", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "bench", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 300124, "caption": "A dining table, microwave, and couch are all visible in the image. The couch has a cardboard box on top of it, and there is also a chair nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["couch", "dining table", "chair", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 46919, "caption": "A man is walking uphill with his skis on, wearing a backpack, and carrying a ski pole. The snowy terrain provides a challenging environment for his outdoor adventure.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "backpack", "skis"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 434022, "caption": "itting the train and car at a railroad crossing, a person is walking down the track, and a train is moving in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "car", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 490491, "caption": "A man is standing in front of a car, looking at a stop sign on the side of the road. The stop sign has a red background and a white letter \"S\" on it. The man appears to be waiting for something or someone, possibly a bus or another vehicle, while the car is parked nearby", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["stop sign", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 175878, "caption": "A group of friends stand together, smiling and looking at their cell phones, possibly taking a selfie or engaging in a conversation. The man in the middle wears a backpack, adding a casual and outdoor vibe to the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "person", "backpack"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 12669, "caption": "In this image, a person is holding a wine glass, while standing in front of a vase with a handbag hanging from their shoulder.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["vase", "handbag", "person", "wine glass"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 192496, "caption": "itting on the train tracks, a person watches a train passing by, with a blue car parked nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "car", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 133146, "caption": "Focusing on the visible elements in the image, a short caption for this image could be: \"Relaxing in the living room, surrounded by comfortable furniture, a TV, and a few decorative pieces. The vase adds a touch of elegance to the room, while the book and the", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tv", "vase", "couch", "chair", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 53744, "caption": "A knife is placed on top of a sandwich, and it is sitting on a dining table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "knife", "sandwich"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 375902, "caption": "A man in a blue jacket, backpack, and skis is walking towards the finish line of a skiing competition, holding a pair of skis.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "backpack", "skis"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 498747, "caption": "In this image, a person is talking on their cell phone while standing in front of a bird. There is a bottle on the table, and other people are present in the room. The person on the phone appears to be engaged in a conversation, and the bird is sitting on the table.", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"]], "mscoco_gt_words": ["bird", "cell phone", "person", "bottle"], "hallucination_idxs": [26, 57], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 52435, "caption": "I see a person on a surfboard riding a wave at the beach, accompanied by a bird flying above the water.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "surfboard", "umbrella", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 301977, "caption": "A cow stands by a refrigerator filled with bottles, indicating a possible connection between the two. Perhaps the cow is grazing on the bottles, or the refrigerator is used for storing and transporting the bottles. The presence of the cow and the bottles together in the same space", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "bottle", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 261097, "caption": ", a man is standing on a tennis court, looking down at his tennis racket. He is wearing white clothing and is positioned in front of a blue backdrop, likely a tennis court. The image captures his moment of concentration while preparing to hit a tennis ball.", "mscoco_hallucinated_words": [["ball", "sports ball"]], "mscoco_gt_words": ["chair", "bench", "person", "tennis racket"], "hallucination_idxs": [50], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.5}}, {"image_id": 227204, "caption": "A group of three women are enjoying a meal together at a dining table. They have a bowl of food, a spoon, and a cup, and one of them is holding a banana. One woman is using a cell phone, while the other two women are eating and enjoying their", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "banana", "bottle", "cup", "cell phone", "spoon", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7777777777777778}}, {"image_id": 353981, "caption": ", a truck and a person are parked next to a large airplane, with a small person standing next to the aircraft.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["airplane", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 185327, "caption": "A man is walking on the sidewalk next to a street, accompanied by a group of horses. In the background, a car is driving past the horses, and a cow is visible. In the distance, a bus is also present, adding a sense of movement and activity to the scene. The presence of", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "car", "horse", "person", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 348654, "caption": ", the scene captures a dining table with a laptop, cell phone, and TV. The laptop is placed on the table, while the cell phone is resting on top of a speaker and a dining chair. A TV is also visible in the background, creating an interesting and multifaceted setup", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["dining table", "cell phone", "laptop", "tv"], "hallucination_idxs": [39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 1.0}}, {"image_id": 482432, "caption": ", I see a sink, toothbrush, bowl, and a bottle of toothpaste. The sink is filled with water, and there is a toothbrush placed on top of it. The bowl is also filled with water, and there is a bottle of toothpaste sitting", "mscoco_hallucinated_words": [["bottle", "bottle"], ["bottle", "bottle"]], "mscoco_gt_words": ["sink", "bowl", "toothbrush"], "hallucination_idxs": [12, 46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 454161, "caption": "A delicious pizza is sitting on a dining table, accompanied by a wine glass and a fork. The wine glass is filled with a glass of red wine, and the pizza is topped with a variety of ingredients. A person is enjoying the pizza and a glass of wine on", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["dining table", "pizza", "fork", "wine glass"], "hallucination_idxs": [43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 1.0}}, {"image_id": 144723, "caption": "A dog is flying with a frisbee in its mouth, while a truck and a car are parked in the background. The dog is jumping and playing in the grassy field, enjoying the outdoor setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "dog", "car", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 129707, "caption": "is holding a cell phone in the middle of a room, with a book on a table nearby.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["cell phone", "person", "book"], "hallucination_idxs": [17], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 0.6666666666666666}}, {"image_id": 379842, "caption": ", the dining table is full of various objects, including a book, a remote, and two video games. The white book is likely a book about video games, which may be a reference book for the video games on the table. The remote is an electronic device used to control the video games, and the two", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "remote", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 203110, "caption": "A city street is bustling with activity, as a truck, a person, a train, a car, and an orange bus pass by a streetcar. The image captures the vibrant and lively atmosphere of the city, with people going about their daily routines while enjoying the s", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "car", "orange", "truck", "person", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 556478, "caption": "I am a helpful language and vision assistant, and I am here to help you write a caption for the image. Let's start by identifying the key elements in the image. There is a pizza, a bottle, an oven, and a cup. Now, we can create a ca", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["pizza", "oven", "cup", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 103815, "caption": "I see a baseball glove, a baseball player, a baseball, and a baseball bat. The baseball player is throwing a baseball, and the baseball is flying through the air.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball bat", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 451943, "caption": ", a person is sitting on a bed with their backpack and suitcase, possibly getting ready for a trip. The bed is placed in a dining room, and there is a table nearby for eating. This scene could indicate that the person is packing and preparing to leave for a journey, perhaps a", "mscoco_hallucinated_words": [["person", "person"], ["person", "person"]], "mscoco_gt_words": ["dining table", "suitcase", "backpack", "bed"], "hallucination_idxs": [2, 45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 1.0}}, {"image_id": 268854, "caption": "A person is standing on the shore of a body of water, holding a handbag and looking out at a boat. The person appears to be taking a break from their activities, perhaps enjoying the scenery and the surrounding landscape. The presence of a boat adds to the overall tranquility of the atmosphere,", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "handbag", "boat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 34464, "caption": "ever wonder what it's like to be a surfer, riding the waves and enjoying the ocean's beauty? This image captures the essence of that experience, showcasing a group of people, including a surfer, a man with a surfboard, and a boat. The sur", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 495513, "caption": ", a man and a woman are enjoying a hot dog at a bonfire. The man is holding a fork while the woman is holding a hot dog, and they are both smiling at each other. The scene suggests a happy and relaxed atmosphere, where they are enjoying each other's company and the", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "person", "hot dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 529256, "caption": "A dog is laying on a red bed, and a book is next to it. This scene is likely to evoke a sense of comfort and relaxation, as the dog is cozy and comfortable, and the book is nearby to provide mental stimulation. The presence of the book suggests that the dog owner values", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "dog", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 37907, "caption": "itting on top of a television, a cat is sitting on a chair, while a potted plant is sitting on the ground.", "mscoco_hallucinated_words": [["chair", "chair"]], "mscoco_gt_words": ["cat", "potted plant", "tv"], "hallucination_idxs": [13], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 506401, "caption": "itting through the intersection, a person is walking while a truck and a car pass by. A cat is sitting on a bench near the traffic light, while a car and a truck cross the intersection.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["traffic light", "car", "cat", "truck", "person"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 1.0}}, {"image_id": 406611, "caption": "ki, the person is wearing a backpack and skiing down the mountain, surrounded by a crowd of people.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "backpack", "skis"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 267225, "caption": "ever in the parking lot, a person is seen next to a truck, holding a remote while a car is parked nearby. The truck has a large satellite dish on top, and there is also a TV on the back of it. The scene suggests that the person might be in charge of monitoring", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "tv", "car", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 278365, "caption": "A young woman sits on a chair at a dining table, enjoying a pizza and a drink while holding her cell phone.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "cup", "pizza", "chair", "cell phone", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 69757, "caption": "A group of people, including an older man, are gathered around a dining table, celebrating a birthday with a delicious chocolate cake and various beverages. One person is cutting a slice of the cake while another person is enjoying a glass of wine.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "wine glass", "cup", "chair", "knife", "person", "cake"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.42857142857142855}}, {"image_id": 432213, "caption": "- A person is enjoying a sandwich, along with a glass of wine, at a dining table, while a bowl of french fries is placed in front of them.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "sandwich", "wine glass", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 368367, "caption": "itting her phone, a woman sits at a table with a cup of coffee.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["cell phone", "person", "cup"], "hallucination_idxs": [9], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 201555, "caption": "ever seen a traffic light next to a road construction site, complete with a truck and a car? This scene showcases the dynamic nature of urban development, as construction projects often require a variety of vehicles and equipment to ensure their successful completion. The presence of the traffic light at the construction site suggests that the area is", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "truck", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 92131, "caption": "A woman is standing by a bus stop, holding a handbag. There are people walking around the area, and the bus is parked nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 21779, "caption": "itting on the floor, a young woman is holding a suitcase and has her backpack on her lap, while a large red backpack lies nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "suitcase", "backpack"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 565148, "caption": "itting into the baseball field, the man is preparing to catch a baseball during a game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball bat", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.25}}, {"image_id": 160183, "caption": "- A fire truck is parked in front of a building, with a fireman standing next to it. A car is parked next to the fire truck. There are people standing around the scene, possibly watching the fire truck or waiting for it to be put into service.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 555456, "caption": "kiwi, a person in a black and white ski suit, is skiing down a snow-covered mountain slope. He is holding a ski pole and appears to be in a moment of concentration as he navigates through the snow.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "person", "skis"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 235595, "caption": "A person stands in front of a car, holding a cup and surrounded by several motorcycles parked on the sidewalk. A potted plant is also visible nearby, along with a parking meter.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "potted plant", "car", "parking meter", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 489909, "caption": "itting on the counter is a bowl filled with bananas, a bowl filled with oranges, and a bowl filled with cucumbers.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "orange", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 501080, "caption": "itting into the dining table, the potted plant adds a touch of greenery to the room, while the bowl and vase sit on the dining table, possibly containing flowers or other decorative items.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "potted plant", "vase", "bowl"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 405183, "caption": "A young woman is sitting on a couch, holding a remote control while watching a TV. Her handbag is nearby on the couch. In the background, there is a potted plant and a book.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "tv", "potted plant", "couch", "chair", "book", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.875}}, {"image_id": 409138, "caption": "A green city bus is parked on the side of a street, with a car nearby. A person is standing near the bus, looking at it. This scene could be a typical urban setting, with people going about their daily routines and using public transportation to get around the city.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 411840, "caption": "itting a backpack, a woman and her friend are walking on a green grass field, carrying a tennis ball and tennis racket.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "tennis racket", "chair", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 333712, "caption": "itting into the corner of the dining table, a pizza slice rests on a plate, accompanied by a spoon. This image captures a moment of enjoyment and relaxation, as someone savors a delicious meal.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "pizza", "spoon"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 148977, "caption": "A small teddy bear sits on a dining table, wearing headphones and listening to music while holding a book. A bottle and a keyboard are nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bear", "bottle", "teddy bear", "book", "keyboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 399058, "caption": "A truck driver rides along the road while a person on a bicycle and an elephant on a trailer pass by. The man on the bicycle and the elephant are likely on a journey together, while the truck driver is navigating the road. The presence of the ele", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["elephant", "bicycle", "orange", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 183571, "caption": "is a man standing near a boat, and he is holding a dog.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "dog", "boat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 325836, "caption": "itting a beautiful handbag, a sleek motorcycle, and a person, the image captures the essence of a lively and vibrant scene. The motorcycle, with its bright colors and intricate details, stands out against the black and white background, while the person and bicycle create a", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 437910, "caption": ", we find a small bathroom with a sink, toilet, and a bottle of hand sanitizer. The sink is surrounded by newspaper clippings, creating a messy and cluttered environment. The image shows that the bathroom is in need of some cleaning and organization to maintain a more", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "toilet", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 155885, "caption": ", I am captivated by the serene scene of a boat, a person, and birds, all floating on the water. The person is likely enjoying a leisurely time by the boat, while the birds are flying gracefully in the sky above. The presence of the boat and the birds adds", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 467887, "caption": "A person is sitting on a couch, using a laptop on a dining table, and wearing glasses. They are using a mouse to navigate on their computer.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "mouse", "couch", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 174402, "caption": "The image shows a young man holding a tennis racket and standing on a tennis court.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 352734, "caption": "A young boy is eating a large chocolate cake with a spoon and fork at a dining table, enjoying his dessert.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "fork", "spoon", "person", "cake"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 346716, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.6666666666666667}}, {"image_id": 451144, "caption": ", two men stand on a snowy slope, with one of them holding a backpack and the other holding skis. They are likely preparing to ski down the hill together, enjoying the winter landscape.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "backpack", "skis"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 295656, "caption": "A group of people, including a man wearing a tie, are enjoying a meal together at a restaurant. They are eating pizza and enjoying the food, and the man is holding a fork in his hand. There is a clock on the wall behind them, a TV on the wall, and", "mscoco_hallucinated_words": [["fork", "fork"]], "mscoco_gt_words": ["tv", "tie", "car", "clock", "pizza", "chair", "person"], "hallucination_idxs": [36], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 0.7142857142857143}}, {"image_id": 129886, "caption": ", a young man is skiing down a snowy slope, while a snowboarder and a person are skiing up the slope. The snowboarder appears to be in the middle of a snowy area with a chairlift nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "person", "skis"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 557998, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "wine glass", "bottle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.2}}, {"image_id": 401010, "caption": "is a comfortable bedroom with a flat screen TV, a white chair, a sink, and a bed. The room also features a white table next to the bed, providing a comfortable and spacious living space for the occupant. The image captures the essence of a cozy and inviting room,", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["chair", "sink", "tv", "bed"], "hallucination_idxs": [27], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 567562, "caption": "Caption: A family of three, including a mother and two young girls, enjoys a delicious dinner together at the dining table. The mother is eating pizza and using a fork, while the two girls are eating broccoli and using spoons. They are sitting at a wooden table,", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "bottle", "cup", "fork", "pizza", "chair", "broccoli", "spoon", "book", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5454545454545454}}, {"image_id": 313928, "caption": "A sunny day at the beach, a group of friends enjoying a meal on the shore while the surfboard and bottle of wine add to the relaxed atmosphere. A dining table and chair are set up with a wine glass, offering a comfortable place to sit and enjoy their food and drink", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "surfboard", "wine glass", "bottle", "chair"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 490284, "caption": "A black dog is lying on a couch, resting its head on a person's shoulder, as the person gently pets it. The scene captures a heartwarming moment of affection and companionship between the dog and the person. The image conveys a sense of comfort, love", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["couch", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 346940, "caption": "In this image, we see a kitchen scene with a knife and a bowl on a countertop. There is also an oven in the kitchen. The image captures a moment where the oven is producing a smoke-like haze, and there are two kettles on the stove.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "oven", "bowl"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 222782, "caption": "The image depicts a busy street scene with a truck, a car, and a person walking down the street. In the foreground, a person is walking with a cell phone in hand, while in the background, a truck and a car are parked in the street. The presence of", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "truck", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 255934, "caption": "A person is standing near a cow and an elephant, while a handbag is visible in the image. The presence of these animals in the same area suggests that they are in a zoo or wildlife sanctuary.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["elephant", "handbag", "person", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 278627, "caption": "itting at the dining table, a person is holding a spoon and a fork, and they are enjoying their meal. The table is filled with a variety of food items, including a bottle, a cup, and a potted plant.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bottle", "cup", "potted plant", "fork", "chair", "spoon", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.875}}, {"image_id": 400851, "caption": "itting on the beach, a group of friends lounge together in the sun, enjoying the warm weather. One person sits on a bench, holding a backpack, while others relax on a chair or the sand. A sports ball is nearby, and a boat is visible in the distance. The group", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "bench", "boat", "sports ball", "chair", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8571428571428571}}, {"image_id": 472610, "caption": ", a woman is standing next to a cute dog, surrounded by cows in a field. The dog is wearing a leash, and the woman is holding a leash. The image shows a strong bond between the dog and the woman, as they are together in the field, with the cows and", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "person", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 78565, "caption": ", a man and a dog are seen enjoying a day out on the water, with a sailboat in the background. They are sitting on the beach with their dog on the sand, looking at the sailboat as it floats on the calm water. This image captures a peaceful and relaxing", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "dog", "boat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 388237, "caption": "itting on a skateboard, a young man is riding down the street while enjoying a cold bottle of beer.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "skateboard", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 289640, "caption": "A group of friends are gathered around a dining table, enjoying each other's company while playing with a frisbee and drinking wine. The atmosphere is relaxed and fun, as they laugh and share stories while having a good time. The presence of an orange cup adds a pop of color to", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "wine glass", "cup", "chair", "orange", "frisbee", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5714285714285714}}, {"image_id": 122602, "caption": "itting the scissors and bottle on the dining table, a person is preparing for a crafting project. The bottle is filled with water, and the scissors are nearby for use. The presence of the scissors suggests that the person might be cutting a piece of paper, fabric,", "mscoco_hallucinated_words": [["person", "person"], ["person", "person"]], "mscoco_gt_words": ["dining table", "scissors", "bottle"], "hallucination_idxs": [11, 42], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 24104, "caption": "A laptop sits on a dining table, surrounded by a book, cup, and keyboard. A bed is visible in the background, and a TV is mounted on the wall. A mouse and a keyboard are also present on the dining table. The scene suggests that the person might be using the laptop for", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["dining table", "laptop", "tv", "cup", "mouse", "book", "keyboard", "bed"], "hallucination_idxs": [53], "metrics": {"CHAIRs": 1, "CHAIRi": 0.08333333333333333, "Recall": 1.0}}, {"image_id": 183709, "caption": "evergreen, a man is standing next to a red traffic light in the middle of a busy street. He is looking down and appears to be waiting for the signal to turn green before crossing the street. The man is wearing a black jacket and carrying a handbag, and there is a car nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "handbag", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 59710, "caption": "itting the kitchen, the dining table is occupied by a chair, while a refrigerator and a white appliance are also present. The image features a kitchen scene with a chair and an appliance, making it a functional space for eating, cooking, and enjoying meals.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "chair", "dining table"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 279407, "caption": "A young boy in a green baseball jersey and a baseball cap is tossing a baseball while standing near a bench, with a bottle and a baseball glove nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "bottle", "sports ball", "baseball glove", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 314729, "caption": "The image features a vintage photograph of a man holding a baseball bat while standing in front of a white background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "baseball bat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 527497, "caption": "itting in front of a dining table, a person is typing on a laptop while looking at a television. The table also has a keyboard and a mouse, which are essential tools for the person's work. The presence of a laptop and a television in the same space highlights the convenience of using a computer for", "mscoco_hallucinated_words": [["person", "person"], ["person", "person"]], "mscoco_gt_words": ["dining table", "laptop", "tv", "mouse", "keyboard"], "hallucination_idxs": [9, 37], "metrics": {"CHAIRs": 1, "CHAIRi": 0.18181818181818182, "Recall": 1.0}}, {"image_id": 411564, "caption": "A man is sitting on a chair at a dining table, holding his cell phone and enjoying a bottle of beer. The man is in a room with a refrigerator, and there is a dining table in front of him.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bottle", "refrigerator", "chair", "cell phone", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 320350, "caption": "ki and the woman are walking on a snow-covered trail, accompanied by their dogs. They are enjoying a winter outing together, and the woman is carrying a ski pole.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "person", "skis"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 401857, "caption": "A baseball player is swinging at a pitch on a baseball field while wearing a baseball glove. The player is positioned at a base, and there is a cup on the bench nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "cup", "sports ball", "baseball glove", "baseball bat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 285302, "caption": "itting on the table, a young boy is using scissors to cut a piece of paper.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "person", "chair", "scissors"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 61959, "caption": "evergreen and colorful, the image shows a bus, a truck, a car, and a red traffic light at an intersection.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "bus", "truck", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 229659, "caption": "A person is sitting at a dining table in a restaurant, using a laptop while a clock is displayed on the wall. A bottle and a glass are also present on the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "tv", "bottle", "clock", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7142857142857143}}, {"image_id": 473118, "caption": "A man is seen riding a bicycle down a ramp, with a skateboard under his foot. This image captures the moment of the man performing a trick on the skateboard while riding the bicycle. It is a unique and thrilling combination of transportation and skate", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 391222, "caption": ":\nA man and a woman are standing next to a map in the snowy environment, looking at a map or a sign. They are likely hikers or skiers who are preparing for their outdoor adventure, as they have backpacks and skis. The presence of the map suggests that", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "backpack", "skis"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 416104, "caption": "A group of friends gather at a dining table, enjoying a meal while a person sips wine from a wine glass and another holds a bottle. An umbrella is present to provide shade, creating a pleasant atmosphere for their gathering.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "dining table", "wine glass", "bottle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 511852, "caption": ", two men are skiing down a snow-covered slope, with one of them holding a backpack. They are skiing on a trail in the snow and appear to be enjoying the winter sports.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "backpack", "skis"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 59386, "caption": "A group of animals, including a giraffe, a zebra, and a car, are standing in a field. The giraffe is feeding from a tree, while the zebra is nearby. A car is also present, possibly being used as a makeshift shelter or a place", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["zebra", "car", "giraffe"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 540180, "caption": "A man is walking past a traffic light, while a car is parked on the side of the street.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 153692, "caption": "itting into the bike lane, a man rides his bicycle while a dog walks by his side. The man wears a backpack, and the dog is on a leash. The image shows the dog and the man in the middle of a busy street, with cars passing by and a", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "dog", "bicycle", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 265628, "caption": ", a person is standing in front of a toilet, and there is a bowl or a sink nearby. The person is wearing shoes, and the floor is covered in a wood-like material.", "mscoco_hallucinated_words": [["sink", "sink"]], "mscoco_gt_words": ["bowl", "toilet", "person"], "hallucination_idxs": [18], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 499594, "caption": "A person is eating a delicious broccoli salad on a dining table, accompanied by a cup of wine and a fork. The dining table has a nice blue tablecloth, adding to the warm and inviting atmosphere. The person is enjoying their meal, and the presence of a", "mscoco_hallucinated_words": [["person", "person"], ["person", "person"]], "mscoco_gt_words": ["dining table", "cup", "bottle", "fork", "broccoli", "knife"], "hallucination_idxs": [1, 41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2857142857142857, "Recall": 0.6666666666666666}}, {"image_id": 5934, "caption": "ports ball sitting on a bench, wearing a baseball glove, and a baseball is nearby. This scene shows a baseball player who is likely preparing for a game, getting ready to catch a ball, or possibly just relaxing and enjoying the sunny day.", "mscoco_hallucinated_words": [["player", "person"]], "mscoco_gt_words": ["sports ball", "bench", "baseball glove"], "hallucination_idxs": [23], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 47828, "caption": "is a boat floating on the water with a person sitting on it. There is also a person walking in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["boat", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 47916, "caption": "itting their frisbee on the lawn, a group of dogs chases a frisbee in the grass. One of the dogs is white, and the other is brown. A truck and a car are parked near the area, and there is a bench nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "dog", "car", "frisbee", "truck"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 255483, "caption": "itting on the dining table, a person is reading a book while using scissors to cut out a piece of paper.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "person", "scissors", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 131138, "caption": "A person is sitting at a dining table with a laptop and a spoon, surrounded by various items such as a mouse, keyboard, cup, and a potted plant. This image captures a moment of relaxation and work-life balance, where the person can enjoy a meal while also engaging in", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "tv", "cup", "potted plant", "mouse", "spoon", "keyboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8888888888888888}}, {"image_id": 258209, "caption": "itting on the bench, a man is sitting next to a bicycle and reading a book. He is surrounded by bricks and a street lamp, providing a quiet and cozy atmosphere for his reading experience.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bench", "person", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 347453, "caption": ", the city street is bustling with various modes of transportation, including cars, buses, and a person walking down the sidewalk. The image captures the intersection of a busy street with a traffic light, where the person and the cars are waiting for their turns to proceed. The presence of a ben", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "bus", "bench", "car", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 285820, "caption": "\"A cozy living room filled with comfortable furniture and decorated with a few personal touches. The television sits in front of the couch, while a bowl, a book, and a cell phone lay on the coffee table. A bench and a dining table are also present in the", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "bench", "tv", "couch", "cell phone", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 245664, "caption": "The laptop and the TV are placed on a dining table, with a wine glass, mouse, keyboard, and book next to them. The laptop is connected to the TV, and a cup is sitting on the table. The scene suggests that the person is engaging in work or entertainment, possibly watching a", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["dining table", "laptop", "wine glass", "cup", "tv", "mouse", "book", "keyboard"], "hallucination_idxs": [49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.08333333333333333, "Recall": 1.0}}, {"image_id": 453731, "caption": ", a dog is resting on the floor, surrounded by various items like a bottle and a cup. The dog appears to be well-cared for and comfortable in its environment, providing a relaxing scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "cup", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 524601, "caption": "A group of friends are gathered in a living room, playing video games on a couch while one of them is holding a remote. The scene shows a fun and casual atmosphere where they are enjoying themselves and having a good time together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "refrigerator", "couch", "chair", "book", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 253262, "caption": "itting on a tarmac, a person is walking towards a large plane while carrying a suitcase. An airport truck is parked nearby, and a man is standing in front of it. The scene shows a busy and active airport environment where people are preparing for their flights.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["airplane", "tie", "person", "truck", "suitcase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 87052, "caption": "A couple is standing in a kitchen, holding a bowl and a bottle, while the man is pouring a liquid from the bottle into the bowl. They are near a refrigerator, oven, and a counter with a cup.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "cup", "refrigerator", "bottle", "vase", "oven", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8571428571428571}}, {"image_id": 357578, "caption": "A man is standing next to a red fire truck on a city street, while a car is stopped at a traffic light. The truck is in the process of stopping at a red light, and there is a person in the passenger seat.", "mscoco_hallucinated_words": [["seat", "chair"]], "mscoco_gt_words": ["traffic light", "truck", "person", "car"], "hallucination_idxs": [46], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 1.0}}, {"image_id": 418692, "caption": "itting on the couch with a remote in hand, a person is reading a book while on their cell phone. The couch is situated in front of a window, and there's a chair nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "couch", "chair", "cell phone", "book", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 259345, "caption": "itting a smile and holding an umbrella, a man is sitting in a chair, enjoying the sunny day while dressed in a colorful outfit.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 119709, "caption": "itting a plate of freshly baked donuts, a woman is sitting in a chair, and a dining table is in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "person", "chair", "donut"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 443299, "caption": "itting on a bench, a young woman is looking at her cell phone while her companion holds her backpack and handbag. They are both waiting at a train station or subway platform, possibly for a train or a bus. The presence of the bench and the cell phone suggests that they are in a public", "mscoco_hallucinated_words": [["train", "train"], ["train", "train"]], "mscoco_gt_words": ["bench", "cell phone", "backpack", "handbag", "person"], "hallucination_idxs": [29, 38], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 1.0}}, {"image_id": 120776, "caption": "A young man is seen standing on a ledge, holding an umbrella and looking at his phone. There is a statue of an animal, possibly a giraffe, in the background. The person is also wearing a tie.", "mscoco_hallucinated_words": [["phone", "cell phone"], ["giraffe", "giraffe"]], "mscoco_gt_words": ["umbrella", "tie", "person"], "hallucination_idxs": [17, 29], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 1.0}}, {"image_id": 411765, "caption": "I see a young man holding a tennis racket and preparing to hit a tennis ball, with a net in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 447991, "caption": "Bicycle, car, traffic light, and a person are all present in this image, creating a busy scene on the street. A handbag and cell phone are also visible, adding to the everyday atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "truck", "bicycle", "car", "cell phone", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8571428571428571}}, {"image_id": 276719, "caption": "A group of people and their pets, including a dog, are gathered in front of a truck parked on the side of the road. They are standing around a handbag, possibly waiting for a ride. In the background, there is a car and a motorcycle, and a person is sitting in a", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "car", "chair", "handbag", "motorcycle", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8571428571428571}}, {"image_id": 422328, "caption": "itting in the snow, a person is standing next to a horse and a car, while a skier is snowboarding down the mountain. A snow-covered street is visible in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "car", "skis", "horse", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 354559, "caption": "A young woman wearing a black costume is holding a teddy bear and posing for the camera.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bear", "teddy bear", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 438788, "caption": "itting cake, the person is about to cut a slice of cake on a dining table, with a knife in their hand. The cake is white, and it appears to be decorated with chocolate chips. The scene captures a moment of preparation for sharing the cake with", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "knife", "person", "cake"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 468005, "caption": ", a man is walking down a staircase with a backpack, while a big elephant is standing behind him.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["elephant", "person", "backpack"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 545594, "caption": ", a man wearing a backpack and holding onto a bus seat is standing next to a chair on a bus. The man is wearing a tie-dye shirt, and the bus has a colorful design.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "orange", "person", "backpack"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 219546, "caption": "a person is sitting at a dining table, with a bowl, a spoon, a knife, a fork, and a bottle of ketchup in front of them.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["dining table", "bowl", "bottle", "fork", "chair", "spoon", "knife"], "hallucination_idxs": [1], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.8571428571428571}}, {"image_id": 386645, "caption": "A bustling street scene unfolds as a truck, person, motorcycle, and car are parked near a building. The truck is parked in front of a restaurant, and the person is walking towards it. The motorcycle is parked nearby, and a car is stopped at a traffic light", "mscoco_hallucinated_words": [["traffic light", "traffic light"]], "mscoco_gt_words": ["motorcycle", "truck", "person", "car"], "hallucination_idxs": [52], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 1.0}}, {"image_id": 155131, "caption": "- A woman is standing on a tennis court, ready to hit a tennis ball with her racket. She is wearing a white dress, a tie, and is holding a tennis racket. She is waiting for her turn to play.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "sports ball", "tie", "tennis racket", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 188752, "caption": "A man stands in front of a fruit stand, with a banana hanging from a rope above his head. This man is surrounded by other fruits, including apples, creating an inviting atmosphere. The image captures a moment of the man's interaction with the fruit stand, possibly showc", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["apple", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 228350, "caption": "In this image, a man is standing next to a dining table with a bowl, a chair, an umbrella, a tie, and a wine glass. A cup is also visible on the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "dining table", "bowl", "wine glass", "cup", "tie", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 437720, "caption": "itting on a red dining table, a woman sits at a desk with a laptop in front of her. She is writing in her journal while enjoying a cup of tea and looking at a vase with flowers on her desk.", "mscoco_hallucinated_words": [["cup", "cup"]], "mscoco_gt_words": ["dining table", "laptop", "vase", "chair", "person"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.8}}, {"image_id": 378831, "caption": ", a woman is walking down the street with her handbag and is stopped by a fire hydrant.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "handbag", "fire hydrant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 375503, "caption": "itting in the room, the person sits at a dining table with a chair, a laptop, and a clock. There is also a TV and a large wooden staircase in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "tv", "clock", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 330369, "caption": "A red fishing boat is docked at a dock with a person on board, and a bird is perched on the boat. The person is looking out towards the water, possibly enjoying the view or preparing to depart.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 288576, "caption": "A man is playing a frisbee game with a bicycle in a grassy area, surrounded by trees.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "person", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 32190, "caption": "A man is sitting on a bench near a street with a car parked nearby. A horse statue stands in front of a store on the street, and there is a person walking on the sidewalk. This scene captures the contrast between a real horse and a lifelike statue, adding a touch", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["person", "horse", "car"], "hallucination_idxs": [6], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 535403, "caption": "In the image, there is a small, clean bathroom with a clock, potted plant, bottle, sink, hair drier, and toilet. The bathroom has a modern design and is well-maintained, with the clock placed on the wall above the sink and the potted plant", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "hair drier", "potted plant", "clock", "sink", "toilet"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 449798, "caption": "is a desktop image featuring a laptop, a cell phone, a keyboard, a book, and a mouse.  The desktop setup appears to be cluttered, with several objects placed on the dining table.  There are also several toys and figurines present, adding to the cluttered appearance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "mouse", "cell phone", "book", "keyboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 375769, "caption": "A boy sits at a dining table eating a hot dog while looking at a bicycle parked outside. A cup and an umbrella are also present on the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "dining table", "hot dog", "cup", "bicycle", "car", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 426172, "caption": ", \nA dining table is set with a bottle, a knife, a chair, and a book, creating a cozy and inviting atmosphere. The presence of a book on the table adds a touch of elegance to the setting, which can be used for a meal or a relaxing", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bottle", "chair", "book", "knife"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 20650, "caption": "itting a fork in the sandwich, a bowl of broccoli, and an orange plate on a dining table, this image captures a moment of enjoying a healthy meal.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "sandwich", "fork", "orange", "broccoli"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 138078, "caption": "A young boy is standing in a baseball field, wearing a red baseball uniform and holding a baseball bat. He is in the middle of a baseball game, with his glove on his left hand and his baseball glove off his right hand. The boy is looking down at the ground, possibly waiting for the", "mscoco_hallucinated_words": [["baseball bat", "baseball bat"]], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [18], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.6666666666666666}}, {"image_id": 152499, "caption": "itting on the edge of the pool, a person is holding a surfboard. In the background, a dining table, chair, and umbrella are visible.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "dining table", "surfboard", "potted plant", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 180764, "caption": "A young man is sitting at a dining table, holding a bottle of beer, and enjoying a delicious donut. He is smiling while taking a bite of the tasty treat.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["person", "microwave", "donut", "bottle"], "hallucination_idxs": [8], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 0.75}}, {"image_id": 147762, "caption": ", the cake sits on a dining table with a fork placed next to it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "fork", "cake"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 113860, "caption": ", I can see a man standing on a small boat in the middle of a lake. The boat is surrounded by tall trees and has a blue umbrella on it. This scene suggests a leisurely and relaxing time spent on the lake, possibly enjoying the beautiful scenery and the cool shade", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 460866, "caption": ", a toothbrush is sitting in a cup next to a bowl on a dining table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "cup", "sink", "toothbrush"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 177015, "caption": "itting on the couch, a man is using his laptop while a cat sits next to him on the couch, and a refrigerator is in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["laptop", "refrigerator", "chair", "couch", "apple", "cat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7142857142857143}}, {"image_id": 333998, "caption": "Rushing to finish a meal preparation, a person is busy with utensils and ingredients scattered around the kitchen. The person is in a hurry to make a meal, and they have a bowl, sink, refrigerator, suitcase, bottle, microwave,", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "bottle", "refrigerator", "cup", "microwave", "sink", "suitcase", "oven", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7777777777777778}}, {"image_id": 47131, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "car", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 2.3333333333333335}}, {"image_id": 425447, "caption": "The image shows a white truck with a fire hydrant parked on the side of a road. There is also a person present, possibly an emergency responder, in the vicinity. This scene might be indicative of an emergency situation, such as a fire or a spill, where the", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "truck", "fire hydrant"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 295105, "caption": "itting in the kitchen, a man is surrounded by various appliances, including a bowl, a bottle, a sink, a refrigerator, and an oven. He is looking at a bottle and appears to be preparing to eat an apple.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "bottle", "refrigerator", "sink", "apple", "oven", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 443259, "caption": "A dining table with a bowl on it is surrounded by a chair, a sink, and a refrigerator. There is also a potted plant, an orange, and a bottle present in the room. The scene is well-lit and shows a clean kitchen, making it a welcoming and", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "bottle", "refrigerator", "potted plant", "chair", "sink", "orange", "oven"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8888888888888888}}, {"image_id": 537861, "caption": ", a train is pulling into the station, and a person is sitting on a bench near the train. The image captures the moment when the train is arriving at the station, and the person is waiting for the train to stop. The presence of the bench suggests that the train station might be a public", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "person", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 145020, "caption": "Mickey Mouse, a cute teddy bear, and a person are posing in front of a Christmas-themed Disney World setting, complete with a Christmas tree. They are all dressed in festive attire, adding to the holiday spirit of the event.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "teddy bear", "person", "mouse"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 249969, "caption": "A man sits at a dining table, enjoying a delicious meal with a bird, spoon, broccoli, and various other vegetables. He has a knife and a wine glass next to his plate, and a fork and a carrot are nearby. The presence of a wine glass", "mscoco_hallucinated_words": [["man", "person"]], "mscoco_gt_words": ["dining table", "wine glass", "fork", "bird", "broccoli", "spoon", "knife", "carrot"], "hallucination_idxs": [1], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 1.0}}, {"image_id": 146504, "caption": "itting on a dining table, a person is using a laptop, keyboard, and mouse to work while holding a cup of coffee.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "cup", "mouse", "keyboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 40037, "caption": "A person is standing in front of a large clock tower, while a car is parked nearby. The clock tower appears to be a part of a building, and the person is looking up at the clock. The presence of the car suggests that the scene could be a moment of pause or contemplation, as", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 343218, "caption": "In this image, a person is seen standing on the court with a tennis racket, while a car is parked nearby. The person is ready to hit a tennis ball with the racket, possibly preparing for a game or practice session. The presence of the car nearby suggests that the person might be at", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "car", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 301837, "caption": "itting in the kitchen, there are several visible elements that catch the eye. A clock and a sink are on the countertop, while a refrigerator, oven, and stove are lined up against the wall. A white apple sits on the counter, adding a pop of color to the scene", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "clock", "apple", "sink", "oven"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 309071, "caption": "itting on the bench, a man is seen with a bicycle and a backpack, likely preparing for a journey. He is wearing a cap and sunglasses, indicating that he is prepared for the sun and outdoor activities. The man is also sitting near a potted plant, adding", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "potted plant", "bicycle", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 240681, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "motorcycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.25}}, {"image_id": 178753, "caption": "A young girl sits in a red chair, using a cell phone to talk to someone on the phone.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "cell phone", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 278890, "caption": "is taking a selfie with his phone while sitting at a dining table, with his friend or family member nearby. The scene captures the interaction between the two individuals as they engage in conversation and share a moment of connection, possibly related to the conversation or their current situation. The presence of a dining table", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "cell phone", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 47781, "caption": ", the image shows a room with a bed, a book, a vase, and a TV. A person is sitting on the bed, watching TV.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tv", "vase", "book", "person", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 542792, "caption": ", a man is standing next to a large baseball bat in front of a building.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "baseball bat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 293703, "caption": "A group of friends enjoying a meal at a table, with a bowl, dining table, chair, and utensils present.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "bottle", "cup", "fork", "pizza", "chair", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 318777, "caption": "ports is a group of players on the baseball field, with a baseball glove on one of their hands, and a baseball in the air.  One person is wearing a baseball cap.  In the background, there is a chair, and a large crowd is watching the game.  The atmosphere seems to be", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "chair", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 14773, "caption": "A person is eating a hot dog from a tray while sitting at a dining table. A spoon, fork, and cup are nearby, and a bottle is on the table. The person is surrounded by a colorful and vibrant atmosphere, with a car and a truck in the background", "mscoco_hallucinated_words": [["truck", "truck"]], "mscoco_gt_words": ["dining table", "hot dog", "cup", "bottle", "fork", "car", "spoon", "person"], "hallucination_idxs": [51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.09090909090909091, "Recall": 1.0}}, {"image_id": 521863, "caption": ", a person is skateboarding down the street, and a man is riding a bicycle on the sidewalk. This image showcases the vibrant atmosphere of the city, with people engaging in various activities and enjoying their surroundings. The presence of these two individuals, one on", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 250540, "caption": ", I see a group of people on a city street, including a man with a handbag and a woman holding a cell phone. They are standing near a car and a motorcycle, which might be parked or in motion. It appears to be a busy day in the city, as they are engaged in their", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "cell phone", "motorcycle", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 211604, "caption": ", a young woman is lying on a wooden deck, wearing a sports hat, and holding a tennis racket.  She is positioned next to a tennis ball, which might indicate that she is about to play a game or has just finished one. The presence of the racket suggests that she might be a", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 298067, "caption": "The image depicts a dining table with a bowl of food, a cup, and a bottle of beer. The bowl contains a portion of food, and the cup is filled with beer. The dining table is clean and well-organized, giving the impression of a", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "cup", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 563938, "caption": "itting the couch and enjoying a cold drink, the group of friends is engaging in a casual and relaxed conversation. The man in a tie and the woman in the black dress are seated on the couch, while the other woman is standing nearby with a bottle. The setting appears to be", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "bottle", "tie", "couch", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 461311, "caption": "The image captures a group of people enjoying a day outdoors. They are gathered in a park, with one person flying a kite and others watching. A man is standing with a backpack, while a woman is holding a handbag. A car is also visible in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["kite", "car", "backpack", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 47940, "caption": ", a person is sitting on a bed, surrounded by various books, a laptop, and a stuffed animal. The books are stacked on the bed, and the person is laying down, possibly reading or watching something on their laptop. This scene suggests that the person is engaging in leisure activities,", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bed", "laptop", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 261318, "caption": "itting his cell phone, a man sits on a bench in the middle of a street, surrounded by suitcases and a chair.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "chair", "cell phone", "suitcase", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 134768, "caption": "The image shows a laptop, a keyboard, a mouse, and a dining table. The keyboard and mouse are set up on the dining table, while the laptop is placed on a desk. This setup suggests that the person using the computer is working or studying on the table, possibly using the laptop for", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["dining table", "keyboard", "laptop", "mouse"], "hallucination_idxs": [43], "metrics": {"CHAIRs": 1, "CHAIRi": 0.07692307692307693, "Recall": 1.0}}, {"image_id": 526414, "caption": "A cozy living room scene with a cat, a book, a cup, and a dining table is filled with various items such as a laptop, remote, and a bowl. The cat is sitting on the couch, while the dining table is adorned with a vase.", "mscoco_hallucinated_words": [["vase", "vase"]], "mscoco_gt_words": ["dining table", "bowl", "laptop", "tv", "cup", "remote", "scissors", "couch", "cat", "book"], "hallucination_idxs": [51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.09090909090909091, "Recall": 0.8}}, {"image_id": 273132, "caption": "The image shows a woman sitting on a bench, with her handbag on her lap. The woman is wearing a hat and has a backpack nearby.", "mscoco_hallucinated_words": [["backpack", "backpack"]], "mscoco_gt_words": ["bench", "handbag", "person"], "hallucination_idxs": [26], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 505213, "caption": "A man is eating a hot dog and sandwich in the street, enjoying his meal as he stands by the curb.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "person", "hot dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 192656, "caption": "I can see a baseball player wearing a black and white baseball glove, standing on a grass field with a white baseball in his hand. He appears to be preparing for a pitch or catching a ball.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 272728, "caption": ", there is a vase filled with colorful flowers sitting on a dining table. A chair is placed next to the table, and there is a potted plant nearby. The image captures a warm and cozy atmosphere, creating a sense of comfort and relaxation.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "potted plant", "vase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 509811, "caption": ", a young woman is sitting at a dining table with a laptop and a book. She is working on a project while enjoying a snack of an apple. The room has several chairs and a few computers set up around the dining table, giving it an organized and functional feel. This image conve", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "chair", "apple", "book", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 412407, "caption": "A woman is holding her cell phone, while a clock is visible on her wrist. This scene might be a scene of a person taking a photo of their wristwatch or checking the time on her cell phone. The presence of the clock and the cell phone together in this image can suggest that the person is", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "cell phone", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 353658, "caption": "A woman with a handbag and a suitcase is walking down a crowded street, holding an umbrella over her head to protect herself from the rain. She is surrounded by other people, a clock, and various items for sale, such as a carrot and a handbag.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "clock", "person", "handbag", "suitcase", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 441363, "caption": "- A young man in red tennis attire is swinging a tennis racket as he prepares to hit the ball.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 331084, "caption": ", \"A group of friends gather around the dining table to enjoy a delicious meal of pizza and salad, with a spoon, knife, and bowl present on the table. The person in the image is using a spoon to eat their meal.\"", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "pizza", "spoon", "knife", "person", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8571428571428571}}, {"image_id": 282553, "caption": "A man is walking down a dirt road, while holding a handbag and using his cell phone. There is a person in the background, walking as well. The scene is set against a large stone structure with a tree in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 328818, "caption": "itting her bicycle on a bench, a young woman is sitting and looking at her bottle.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "bench", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 3103, "caption": ", the skateboarder is seen jumping off a ramp with his skateboard in the air. The person is wearing a yellow helmet and is performing a trick on the skateboard, which is likely to be a skateboard trick.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 188136, "caption": "itting in the kitchen, a person is washing dishes in a sink while a bowl of oranges and a carrot sit on the dining table. This scene showcases the everyday activity of cleaning up after a meal, and the person is using the sink to ensure the dishes are thoroughly", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "sink", "orange", "person", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 449675, "caption": "A group of friends are enjoying a delicious meal together at a dining table. A sandwich, a fork, a spoon, and a knife are laid out on the table, accompanied by a bottle of water and a glass of juice. A carrot and a cup of soup are", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "sandwich", "bottle", "cup", "fork", "spoon", "knife", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 191691, "caption": "In the image, a person is sitting in front of a sink, using a toothbrush to brush their teeth. There is a bottle on the counter next to the sink, and they are using the sink to wash their face. This scene could be interpreted as a person preparing for a", "mscoco_hallucinated_words": [["sink", "sink"], ["sink", "sink"], ["sink", "sink"]], "mscoco_gt_words": ["toothbrush", "person", "bottle"], "hallucination_idxs": [12, 32, 39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.42857142857142855, "Recall": 1.0}}, {"image_id": 89378, "caption": "I see a man standing next to a motorcycle and a bicycle. He's wearing a red helmet, and there is a backpack nearby. This image captures the essence of a motorcyclist who is about to embark on a journey, perhaps a road trip or a bi", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person", "backpack"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 99874, "caption": "itting on a dining table, a person holds a cell phone, wearing a handbag, and is surrounded by a cat and a bottle.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bottle", "cat", "cell phone", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 437895, "caption": "is a beautiful bedroom filled with colorful elements, including a vase, a potted plant, and a bed. The bed features a unique and intricate bed frame, showcasing the room's focus on creativity and aesthetics. The presence of the vase and potted plant adds", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["potted plant", "vase", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 557447, "caption": "itting on a dining table, a man is holding a spoon and fork while wearing a tie. He is looking down, possibly contemplating a meal or a conversation. In the background, there is a knife and a fork, indicating that the dining table has been set for a meal", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "tie", "fork", "spoon", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 159979, "caption": ", a group of people, including a woman, are enjoying a meal on a boat while a man holds an umbrella over their heads to protect them from the rain. The boat is parked near a dock with various items such as a bowl, a table, and a boat in the background,", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["umbrella", "bowl", "boat", "person"], "hallucination_idxs": [49], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 1.0}}, {"image_id": 357, "caption": "- A baseball player is holding a baseball bat, and there is a baseball glove on the ground.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 538891, "caption": "a woman with a tennis racket is standing at the net, with a tennis ball on the ground.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 561967, "caption": "itting the traffic light, a train and a car are all present on the tracks, creating a busy scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "car", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 565778, "caption": "ever the train and car pass by the busy train station, a person stands next to the train, and a traffic light is present, signaling for the train and cars to wait.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "person", "car", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 7125, "caption": "A man is riding his bicycle down a dirt path while carrying his backpack, accompanied by his dog and holding a water bottle.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "bottle", "bicycle", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 221350, "caption": "A man is sitting at a dining table, wearing a tie, and holding a handbag. The handbag has red hearts on it.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "tie", "chair", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 280930, "caption": "A young girl is standing in front of a refrigerator, smiling and holding a bottle of soda while standing near a counter. A bottle is also on the counter.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "oven", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 151081, "caption": "itting along a dirt road, a truck driver is looking up at a small airplane flying overhead while another person is present in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["airplane", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 122232, "caption": "itting a tennis ball back and forth, a young girl is engaging in a fun game of tennis with a tennis racket and a green tennis court in the background. The image captures the spirit of playfulness and physical activity, highlighting the importance of outdoor sports for exercise and mental well-being", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 134223, "caption": "A delicious hot dog and broccoli sit on a white plate atop a dining table, surrounded by a bowl and other condiments.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["hot dog", "dining table", "bowl", "broccoli"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 198163, "caption": "itting the caption for this image, I can describe the scene as a person is sitting at a dining table, surrounded by various bowls filled with colorful paint. The person is engaged in painting, possibly creating artwork or preparing a project, while the bowls serve as a storage container for the paint", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 3742, "caption": "A beautiful bowl of apples sits on a dining table, next to a vase containing a single red rose. This scene creates a serene and inviting atmosphere, perfect for a warm and cozy gathering.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "apple", "bowl", "vase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 217593, "caption": "A young boy is standing in a grassy area, holding a baseball bat, and wearing a baseball glove. He is in the process of getting ready to throw the ball. A handbag is nearby, and there are several chairs scattered around the area.", "mscoco_hallucinated_words": [["ball", "sports ball"]], "mscoco_gt_words": ["baseball glove", "chair", "baseball bat", "handbag", "person"], "hallucination_idxs": [32], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 279078, "caption": "ittingly, a young woman is posing in front of a refrigerator full of bottles, likely containing various types of beverages. She is smiling, showcasing her personality and sense of fun.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "person", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 572081, "caption": "A man is playing tennis on a court, while a couch and a chair are nearby. He is wearing a tennis racket and appears to be in the middle of a game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "tennis racket", "couch", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 477016, "caption": "A woman and her daughter are having a fun day at home, enjoying each other's company while admiring the decorations of their kitchen. The daughter is holding a potted plant, while the woman is looking at a sink, oven, and possibly a microwave. They are both smiling,", "mscoco_hallucinated_words": [["microwave", "microwave"]], "mscoco_gt_words": ["potted plant", "sink", "backpack", "oven", "handbag", "person"], "hallucination_idxs": [48], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.6666666666666666}}, {"image_id": 301266, "caption": "A group of people are gathered around a dining table, using various items such as a spoon, fork, and a cup. They are in a kitchen setting, with a sink and a book present. This scene could be described as a casual gathering of friends or family members enjoying a meal together", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "cup", "fork", "sink", "spoon", "book", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 475813, "caption": "itting her baby's hand in a handbag, a young woman is holding a remote control. She is standing in front of a television and smiling.", "mscoco_hallucinated_words": [["television", "tv"]], "mscoco_gt_words": ["person", "handbag", "remote"], "hallucination_idxs": [25], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 558784, "caption": "is a man playing with a frisbee in a green grassy area, possibly on a golf course. He's standing near a water body, possibly a pond or lake, and is holding a backpack. The presence of a backpack suggests that he may be on an outdoor adventure or", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "backpack", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 308543, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "person", "spoon"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 359164, "caption": ", a dog is playing with a ball in the grass, while a person is sitting in a chair next to a dining table.", "mscoco_hallucinated_words": [["ball", "sports ball"], ["person", "person"]], "mscoco_gt_words": ["dining table", "dog", "chair", "car"], "hallucination_idxs": [7, 14], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 0.75}}, {"image_id": 539951, "caption": "In this image, a brick-built kitchen features a refrigerator, sink, clock, and oven. The kitchen is also equipped with a microwave. The brick walls, white countertops, and black and white checkered floor give it a unique and colorful appearance, making it", "mscoco_hallucinated_words": [["microwave", "microwave"]], "mscoco_gt_words": ["refrigerator", "oven", "sink", "clock"], "hallucination_idxs": [25], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 143479, "caption": "A man is holding a tray of hot dogs, and a chair is nearby. This scene could be a casual gathering or a small party where people are sharing food and socializing. The man's focus on the tray of hot dogs suggests that he is either serving them to others or enjo", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "person", "hot dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 475024, "caption": ", \nThe image shows a dining table with a vase, a potted plant, and an orange flower in it. It is placed on the table next to a mirror, creating a visually pleasing and well-balanced arrangement. This arrangement brings warmth and a touch of nature into the room", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "potted plant", "orange", "vase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 238065, "caption": "In this image, a young man is sitting on a wooden chair, riding a skateboard, and enjoying the sunshine. The chair is placed on a wooden rail or ledge, which might be part of a skateboard ramp. The man is performing a trick or stunt,", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 314016, "caption": "In this image, we see a sink, a bottle, a book, and a toilet in a small bathroom. The sink is likely filled with water, and the book is placed on the counter next to it. The bottle is also sitting on the counter, while the toilet", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sink", "toilet", "book", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 322163, "caption": "A person is cooking in a dining room, preparing pizza in a microwave oven. The image shows a person working on the pizza while a remote is being used on a dining table. A bowl, a bottle, and a potted plant are also present in the room", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "remote", "bottle", "potted plant", "microwave", "pizza", "oven", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 230983, "caption": "A young man is shown in this image, riding a skateboard down a sidewalk. In the background, a car is parked, and the street has a streetlight nearby. This scene portrays a person enjoying their passion for skateboarding, which is an action-packed", "mscoco_hallucinated_words": [["streetlight", "traffic light"]], "mscoco_gt_words": ["skateboard", "person", "car"], "hallucination_idxs": [30], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 564629, "caption": "This image features a kitchen with a sink, refrigerator, microwave, oven, and a stove. The kitchen has a wooden cabinets and is equipped with modern appliances. The sink is likely a convenient and essential component of the kitchen, providing a clean and hygien", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "oven", "sink", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 53431, "caption": "A man is sitting in a chair, using a laptop. He is in front of a brick wall, and there is a white chair next to him.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "person", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 441840, "caption": "A baseball game is taking place in the background of this image, with a person, a baseball bat, and a baseball glove visible. A truck and a car can also be seen in the image, adding to the setting.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["baseball glove", "car", "baseball bat", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 53465, "caption": "itting on the dining table, a person holds a spoon and a bowl while eating cake with a fork. They have a cup of coffee and tea nearby. This scene suggests that the person is enjoying a sweet treat and a warm beverage, perhaps as part of a celebration or simply as", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "cup", "fork", "spoon", "person", "cake"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 381031, "caption": "A man is enjoying a snack on a dining table with a spoon, a bottle of soda, and a cup of coffee. The man has a donut in his hand and a plate with an apple, and there is a microwave in the background.", "mscoco_hallucinated_words": [["man", "person"], ["man", "person"]], "mscoco_gt_words": ["dining table", "cup", "bottle", "microwave", "donut", "apple", "spoon"], "hallucination_idxs": [1, 26], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 1.0}}, {"image_id": 530520, "caption": "is sitting on the grass, with a backpack and a laptop on their lap, while another person is looking at a clock. They are likely to be waiting for a motorcycle to arrive, as they are in a parked motorcycle in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["laptop", "clock", "motorcycle", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 539555, "caption": "A person is performing a skateboard trick, while simultaneously holding a frisbee in their hand.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "person", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 279521, "caption": "itting on the street, two men, one with a backpack, the other with a handbag, are waiting for a bus or train.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "car", "backpack", "handbag", "person", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 560439, "caption": "ix, a group of friends is skiing down a snowy slope, and one person is carrying a backpack, a handbag, and a snowboard.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "skis", "backpack", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 248353, "caption": ", a group of friends can be seen skiing down a snow-covered mountain, with one person in a yellow jacket and another in a red jacket, and a snowboarder wearing a black jacket. They are all enjoying their time on the slopes, which showcases the fun", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "person", "skis"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.3333333333333333}}, {"image_id": 9628, "caption": "A man is riding a surfboard on the water, while a boat is behind him. The man is balancing and maneuvering the surfboard, and the boat is following him. This image captures a moment of skill and precision, as the man expertly navigates the sur", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "boat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 330050, "caption": "A man and a woman are posing for a picture while holding tennis rackets. They are sitting on a bench and smiling, indicating that they are enjoying the activity.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 254161, "caption": "A person is walking through a city park, holding a handbag and a kite.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["kite", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 275749, "caption": "A group of friends are hanging out in a room with a dining table and chairs, enjoying a bottle of beer and a cup of coffee. One person is using a laptop on the dining table, while another person is sitting on a chair. There is a TV nearby, and a mouse", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "tv", "bottle", "cup", "mouse", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 163518, "caption": "itting in a large dining room, a woman is carefully arranging salad bowls on a table. She is using a chair to reach the top shelf, where she has placed a large bowl of salad. The bowl is filled with various vegetables, such as lettuce, carrots", "mscoco_hallucinated_words": [["carrot", "carrot"]], "mscoco_gt_words": ["dining table", "bowl", "chair", "oven", "person"], "hallucination_idxs": [51], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 0.8}}, {"image_id": 156696, "caption": ", a man is walking down the train platform while carrying a handbag, while the train is parked at the station.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "person", "car", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 525202, "caption": "A group of friends are sitting at a dining table, enjoying a delicious pizza together. One person is using a spoon to eat their pizza, while another person is holding a knife and fork. A bottle of drink is placed on the table, and a cup is also present. The", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bottle", "cup", "fork", "pizza", "chair", "spoon", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8888888888888888}}, {"image_id": 423984, "caption": ", a bowl of delicious pasta is placed on a dining table, accompanied by a carrot and broccoli. The meal is likely a healthy and balanced option for a nutritious lunch or dinner.", "mscoco_hallucinated_words": [["bowl", "bowl"]], "mscoco_gt_words": ["dining table", "broccoli", "carrot"], "hallucination_idxs": [2], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 499486, "caption": "The image features a bed with two cats, a book, and a messy bedroom. The cats are sitting on the bed, while the book is lying on the floor. The scene suggests that the bedroom is in disarray, and the cats may be the reason for the mess. The", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cat", "book", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 118124, "caption": "is a young man sitting in a car, wearing a tie.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tie", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 288729, "caption": "itting in front of the cake is a smiling person, accompanied by a stuffed teddy bear and an elephant. The cake is shaped like a circus tent, featuring a blue and yellow color scheme.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["elephant", "bear", "teddy bear", "person", "cake"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 177721, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "skateboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.3333333333333333}}, {"image_id": 372220, "caption": ", a man in a black hat and long beard is standing next to a white horse and a woman with a handbag. He is holding a cell phone and looking down at it, possibly checking his phone for a message or browsing the internet.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "handbag", "person", "horse"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 252625, "caption": "itting the street with the bicycle and the person, a traffic light is located at a busy intersection. A potted plant is placed on the sidewalk, and a person is walking down the sidewalk with a bicycle.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "potted plant", "bicycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 493610, "caption": "A truck and a car are parked on a dirt road, with a person standing next to them. A bus is also present, and a person is walking in the distance. The image captures the daily life and traffic scene in a small town or rural area, with people going about their daily routines", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "truck", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 302789, "caption": "A busy traveler's bedroom features a collection of luggage and personal items, such as a backpack, handbag, and bottle, arranged on the bed. A suitcase and a refrigerator are also present, adding to the cluttered atmosphere. The bed is occupied by a", "mscoco_hallucinated_words": [["traveler", "person"]], "mscoco_gt_words": ["bottle", "refrigerator", "sink", "backpack", "handbag", "suitcase", "bed"], "hallucination_idxs": [2], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 0.8571428571428571}}, {"image_id": 30667, "caption": ", a brown dog is laying on a boat, accompanied by a person.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "dog", "boat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 114108, "caption": "In the image, a person is sitting at a dining table with a spoon and a fork, enjoying a slice of pizza with a knife. The pizza is placed on a plate, and there is a cup next to it. This caption captures the essence of the image:", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "cup", "fork", "pizza", "chair", "spoon", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.875}}, {"image_id": 328110, "caption": "itting on a bench, a young woman wears a backpack and holds a baby in her arms. She is sitting down with her handbag and a small handbag nearby, seemingly taking a break from her travels.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "handbag", "person", "backpack"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 366111, "caption": "evergreen parked cars and a bicycle are seen in the image. A person is walking down the street, and a traffic light is visible.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "bicycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 532481, "caption": "A man is flying a kite on the beach while a car is parked nearby. In the foreground, there are surfboards on the sand, and a person is riding the waves. This scene suggests that the location is a beach, and it appears to be a sunny day. The k", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "kite", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 291634, "caption": "is an image of a group of people on a busy street, with a motorcycle parked nearby, and a bicycle on the sidewalk.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 463678, "caption": "I am a tennis player, I stand in front of a tennis net, holding a tennis racket. I am about to serve a tennis ball.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 70815, "caption": "itting the pavement, a dog sits next to a bicycle while a man walks past with a truck. The dog appears to be well-behaved and enjoying the outdoor space.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "truck", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 367872, "caption": "itting a baseball with a bat, a person holding a baseball glove, and an umbrella in the background, this image captures a moment of a baseball game in progress. The umbrella is likely there to protect the players from the rain, adding an interesting twist to the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "sports ball", "baseball glove", "baseball bat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 353898, "caption": "itting in the park, a man is sitting on a bench and reading his cell phone while surrounded by a group of birds. He is dressed in casual attire and has a backpack next to him. The park is filled with white birds, creating a serene and peaceful atmosphere.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "bird", "cell phone", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 88269, "caption": ", \nA bowl filled with soup sits on a plate, and next to it is a sandwich. The bowl is covered in a red sauce, and it appears to be a tomato-based soup. The presence of a spoon indicates that the person is about to consume the soup.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["sandwich", "bowl", "spoon"], "hallucination_idxs": [45], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 463211, "caption": "A young girl and a boy are walking alongside a school bus, each wearing backpacks. They are walking in a parking lot, possibly near a school or park. The girl has a ponytail and is carrying a backpack, while the boy is holding onto his backpack. The scene seems", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "backpack"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 396269, "caption": "A beautiful floral arrangement sits on a dining table in a luxurious room, surrounded by elegant chairs and a vase. This stunning centerpiece adds a touch of elegance and warmth to the space, creating a welcoming atmosphere for guests to enjoy.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "vase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 98979, "caption": ", the image features a microwave, oven, and a knife on a countertop. The microwave is turned off, and the oven is open, revealing a cooking surface inside. The knife is placed next to the appliances, adding to the overall kitchen setup.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "oven", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 244455, "caption": "everett and his friend are riding their motorcycles down the street, with a bicycle and a car following behind them. A truck is parked at the side of the road, and a person is walking in the direction of the motorcyclists. The scene appears to be a casual", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "car", "motorcycle", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 379443, "caption": "itting the orange fabric on a dining table, a pair of scissors is busy cutting the material. The scissors are positioned on top of the fabric, ready to make a precise cut. The presence of the orange fabric indicates that the scissors are being used to create a unique design or pattern", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "orange", "scissors"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 536154, "caption": ", a young girl is sitting on a bed surrounded by a pile of stuffed animals, including a teddy bear, a cat, and numerous other toys. The girl is smiling and looking happy, and the stuffed animals appear to be her friends. This image captures a moment of innocence and", "mscoco_hallucinated_words": [["bed", "bed"]], "mscoco_gt_words": ["cat", "teddy bear", "person"], "hallucination_idxs": [8], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 374061, "caption": "itting on a couch, a cat is resting on a bed next to a teddy bear and a stuffed bear. The cat seems to be enjoying a cozy and comfortable resting spot, while the stuffed animals provide a soft and cuddly companion for the cat to snuggle up", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bear", "couch", "cat", "teddy bear", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 560018, "caption": ", a person is seen riding a surfboard in the ocean, while a kite is flying in the sky.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "person", "kite"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 353320, "caption": "itting a suitcase, a handbag, and a person sitting on a bench, the scene appears to be a busy and active place, possibly at a train station or airport. The person may be waiting for a train or flight, while the suitcase and handbag suggest that they are preparing for their", "mscoco_hallucinated_words": [["train", "train"], ["train", "train"]], "mscoco_gt_words": ["person", "bench", "handbag", "suitcase"], "hallucination_idxs": [29, 41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 1.0}}, {"image_id": 256628, "caption": ", a golden dog lies on the wooden deck, surrounded by a bowl and a chair. The dog's bright yellow fur and the wooden deck make for a beautiful contrast. The bowl and the chair create a cozy atmosphere, and the dog's presence adds warmth to the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "bowl", "dog"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 158806, "caption": "The image shows a dog sitting on the ground, looking at a sandwich, while a person is eating the same sandwich from a plate. The dog is likely waiting patiently for its turn to eat the sandwich, while the person is enjoying their meal. The presence of the blue plate,", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sandwich", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 424270, "caption": "A small child is sitting on a bed in a room with a TV, possibly in a hotel or a home. The room is empty, and the child is watching a show on the TV. The image captures a moment of the child's engagement with the screen, and the viewer can infer that", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "tv", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 368335, "caption": "In the image, we see a horse standing next to a car and a truck, as well as other vehicles parked in the area. This scene is a common sight in many parking lots, where people park their cars and trucks, and sometimes other vehicles, like horses, may also be present", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["truck", "horse", "car"], "hallucination_idxs": [39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 483227, "caption": "A group of children are gathered around a tennis court, participating in a tennis lesson or playing a game. One young boy is holding a tennis racket, and another boy is holding a sports ball. They appear to be having fun and engaging in a physical activity, possibly practicing their tennis skills.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 228306, "caption": ", a person is walking down a city street, with a car passing by and a bus parked in the background. The bus is a red double-decker, which is a unique feature in this image. The scene captures the essence of city life, where people, vehicles, and public transportation co", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 416384, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "cell phone", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 364783, "caption": "Caption: A cozy bathroom with a potted plant, a chair, a sink, a toilet, a cup, and a toothbrush, offering a welcoming and comfortable space for personal grooming and relaxation.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "cup", "potted plant", "chair", "sink", "toilet", "toothbrush"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8571428571428571}}, {"image_id": 335041, "caption": "A man is walking down the street with his skis on, wearing a red jacket and a hat. There is a car parked on the street next to him, and he is walking through the snow-covered sidewalk. The man appears to be enjoying the winter weather as he goes about", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "skis", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 531327, "caption": "A spoon, broccoli, wine glass, fork, dining table, knife, carrot, and other dining utensils are arranged on a table, making for a well-set dining scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "wine glass", "fork", "broccoli", "spoon", "knife", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 451228, "caption": "ix and a man are seen walking in the city on a bike, while a bus and a red double-decker bus pass by them. There is also a traffic light on the street.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "bus", "bicycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 256365, "caption": "A delicious cupcake rests on a plate, sitting next to a dining table. The cupcake is a blue and white striped one, topped with frosting. It is surrounded by other cupcakes, creating a visually appealing and inviting presentation.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "cake", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 502936, "caption": ", a young man is practicing his tennis skills on a court. He is holding a tennis racket and is about to hit the ball. The court is well-maintained, and there is a net separating the playing area from the spectators. The scene is vibrant and engaging, show", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 451449, "caption": "is a cozy scene featuring a dog lying in a bed, surrounded by books and a radio. The dog is likely a small breed, as it is described as being \"a little dog.\" This scene showcases the dog's comfort and relaxation as it lounges in the cozy bed,", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["book", "dog", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 51054, "caption": "itting the phone, an elderly woman is sitting on her bed, surrounded by personal belongings. The room is cluttered with a bottle, cup, and a phone.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cup", "bottle", "cell phone", "person", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 1268, "caption": "itting on a bench near the river, a young woman is capturing a moment with her cell phone while a white bird flies by. She is wearing a backpack and a handbag, and there is a boat in the background.", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["boat", "bird", "cell phone", "backpack", "handbag", "person"], "hallucination_idxs": [3], "metrics": {"CHAIRs": 1, "CHAIRi": 0.14285714285714285, "Recall": 1.0}}, {"image_id": 62692, "caption": "- A young woman is playing tennis in a court, using her tennis racket and a ball to hit the ball back and forth.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 542771, "caption": "A cozy dining room scene is captured, featuring a person sitting at a dining table surrounded by a potted plant, vase, book, and chair.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "potted plant", "vase", "chair", "book", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 575406, "caption": "itting along a sidewalk, a person on a motorcycle is passing a bus while riding on a red and blue-colored scooter. In the background, there is a potted plant and a store. The scene is lively and vibrant, capturing the essence of urban life.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "potted plant", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 244834, "caption": "A couple walks hand in hand, with the man holding an umbrella over the woman, as they stroll along the sidewalk past a street clock. The woman carries a handbag, and they both wear white shirts. In the background, there is a bicycle parked next to", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "potted plant", "bicycle", "car", "clock", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7142857142857143}}, {"image_id": 151159, "caption": "A man is riding a skateboard, while another man is standing on a bench and watching.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "person", "bench"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 277073, "caption": "itting through the busy city street, a man rides his motorcycle while wearing a backpack and holding onto a helmet. A bicycle and car are also present on the road, demonstrating the diversity of transportation methods used in the urban environment.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "car", "motorcycle", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 377879, "caption": ", a large, old-fashioned train and a truck carrying a tank are parked next to each other in a loading area. The tank is being transported on a large flatbed trailer. This image suggests that the train and the tank are part of a military operation, and the tank might be", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "bed", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 436814, "caption": "itting a computer mouse and sitting in a chair, there is a dining table with a laptop and a chair in a small room.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "chair", "mouse", "laptop"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 63934, "caption": "itting a skateboard on a sidewalk, a person is riding on the sidewalk while passing by a car and a truck, and there is a man walking in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "truck", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 275611, "caption": "itting a pizza on a dining table, a person is holding a knife and slicing the pizza.", "mscoco_hallucinated_words": [["person", "person"]], "mscoco_gt_words": ["dining table", "pizza", "knife"], "hallucination_idxs": [9], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 516248, "caption": "A person is sitting at a dining table, using a laptop and keyboard while watching a TV. They have a mouse on the table and a keyboard on their lap. This scene shows a common activity where people use technology while enjoying their free time, such as watching TV or browsing the internet.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "tv", "mouse", "keyboard", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 478380, "caption": "itting sheep in a field, a person is driving a car down a country road, surrounded by a large group of sheep. The sheep are closely bunched together, and the car has a white passenger seat. The scene may be taken from a bus or a tourist vehicle, providing a unique view for the passengers.", "mscoco_hallucinated_words": [["seat", "chair"]], "mscoco_gt_words": ["sheep", "person", "car"], "hallucination_idxs": [39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 1.0}}, {"image_id": 422326, "caption": "In this image, a group of people can be seen enjoying a day outdoors. A truck is parked in the background, while a person is riding a bicycle. To the left, a car is parked next to a motorcycle. This diverse group of vehicles and activities highlights", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "car", "motorcycle", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 110449, "caption": "A person sits at a dining table, with a laptop and a remote control in front of them. A TV is in the background, and there is a music player on the table as well. The scene suggests a comfortable and cozy setting for relaxing and enjoying some entertainment.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "tv", "remote", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 45550, "caption": "itting his mouth with a sandwich, the man in the image is captured with a bowl and a clock in the background. This scene might evoke a sense of nostalgia, as the man is enjoying his lunch in a casual setting, with a simple meal and a timepiece nearby", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "sandwich", "person", "bowl"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 444913, "caption": "itting a frisbee on the grass near a parked car, a man stands in the background, capturing a moment of fun and leisure in the outdoors.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "car", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 82709, "caption": "A dog is flying in the air with a frisbee in its mouth, while a person watches from the sidelines. The person is holding a backpack and a bottle, and there is a cup on the ground. This scene suggests that the person is enjoying a game of frisbe", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "bottle", "cup", "backpack", "frisbee", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 132796, "caption": "The person is sitting on a bench while an elephant is riding on a chair, wearing a backpack.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["elephant", "bench", "chair", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 303099, "caption": "itting on the toilet, two cats sit next to a bottle and a cat sits on top of the toilet.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cat", "toilet", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 271565, "caption": "everal motorcyclists, a truck, and a car are all gathered on a city street.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "truck", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 205193, "caption": "The image features a person, toaster, microwave, and oven, all of which are present in the kitchen. The person is standing near the stove, and the toaster is on the counter.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["oven", "toaster", "person", "microwave"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 272566, "caption": "itting on a bench, a man is enjoying his snacks, consisting of bananas and apples. He is surrounded by baskets filled with fresh fruits, and a cup is nearby, possibly containing drinks. The setting seems to be a peaceful and comfortable spot where he can relax and indul", "mscoco_hallucinated_words": [["bench", "bench"]], "mscoco_gt_words": ["apple", "person", "banana", "cup"], "hallucination_idxs": [3], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 1.0}}, {"image_id": 170099, "caption": "itting in front of a couch, a man wearing a tie and orange clothing is engaged in a conversation with an unknown person.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["couch", "orange", "tie", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 14940, "caption": "A cute orange cat cuddles up with a stuffed animal dog on a couch, creating a cozy and adorable scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "couch", "chair", "orange", "cat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 408449, "caption": "A young man is standing in the middle of a city street, wearing a backpack and skateboarding. He is surrounded by a truck and a car, and is about to cross the street. A person is walking by with a handbag, and a bus is also visible in the scene.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "truck", "skateboard", "car", "backpack", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7142857142857143}}, {"image_id": 140702, "caption": "An elephant, a small baby elephant, and a person are walking together in a grassy field. They are in close proximity to each other, with the elephant and the person holding hands. The baby elephant is adorable, and the scene is heartwarming.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bird", "elephant", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 354540, "caption": "A man is sitting on a chair, with his laptop on a table, next to a toaster. He is holding a cell phone and has a microwave and an oven nearby.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["laptop", "microwave", "chair", "cell phone", "oven", "person", "toaster"], "hallucination_idxs": [13], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 1.0}}, {"image_id": 19763, "caption": "A person is sitting at a dining table, enjoying a delicious slice of pizza and a cup of beer. The pizza is cut into a slice and placed on a plate, with a fork and knife nearby. A person is present, adding a human element to the scene. This image capt", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "cup", "fork", "pizza", "chair", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8571428571428571}}, {"image_id": 191301, "caption": "itting with a spoon, a person is enjoying a delicious cake on a dining table, accompanied by a cup of coffee. The cake is served on a white plate, and the cup is filled with a tasty beverage. This scene suggests a cozy and satisfying dining experience.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "cup", "spoon", "person", "cake"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 124236, "caption": "itting in the kitchen, we see a variety of dining utensils, including a spoon, bowl, oven, dining table, and toaster. These items are strategically placed in the room, likely for easy access and organization. The presence of a dining table suggests that the kitchen may", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "spoon", "oven", "toaster"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 344930, "caption": "In the image, we can see an orange fire hydrant, a car, and a person. The fire hydrant is located on the side of a road, and the car is parked next to it. The person is standing near the hydrant, looking at it. The presence of a fire hydrant", "mscoco_hallucinated_words": [["person", "person"], ["person", "person"]], "mscoco_gt_words": ["orange", "fire hydrant", "car"], "hallucination_idxs": [17, 41], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2222222222222222, "Recall": 1.0}}, {"image_id": 14285, "caption": "itting on the bed, a cat is lounging in the bedroom, surrounded by a TV, a remote, a chair, and a book. The cozy setting indicates a relaxing and comfortable environment, where the cat and its owner can spend time together, possibly watching TV or reading a book. The", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["remote", "tv", "chair", "cat", "book", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 82488, "caption": "itting, there is a dining table, a bed, and a clock in the room. The orange color adds a pop of vibrancy to the space. Additionally, there is a remote control and a book present. This cozy room appears to be a comfortable and inviting space where one can relax,", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "remote", "clock", "orange", "book", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 348251, "caption": "is a man wearing a backpack and skiing down a snowy hill. He is holding a ski pole in one hand and appears to be enjoying his skiing experience.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "backpack", "skis"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 150703, "caption": "itting at the dining table, there is a delicious sandwich and a hot dog in front of a cup. The table is filled with food and drinks, making it a perfect setting for a satisfying meal.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "sandwich", "hot dog", "cup"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 381615, "caption": "itting in the living room, there is a vase, a chair, a book, a cup, and a potted plant. The potted plant is placed on a wooden table, adding a touch of greenery to the room. The vase and the chair are likely used for display purposes, while the book", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["cup", "potted plant", "vase", "chair", "book"], "hallucination_idxs": [33], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1, "Recall": 1.0}}, {"image_id": 352703, "caption": "A woman is cooking at a dining table, holding a spoon and using a bowl to prepare a meal. She is also carrying a handbag and a bottle. The oven is nearby, and there are cups on the table. The scene is set in a kitchen, where the woman", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "bottle", "cup", "spoon", "oven", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 111109, "caption": "A young boy is sitting on a chair in front of a TV, wearing a tie and holding a motorcycle. This image captures a unique combination of two seemingly unrelated objects: a TV and a motorcycle. The boy's presence in the scene adds a personal touch to the image, making", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["tv", "tie", "chair", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 226802, "caption": ", we see a person riding a motorcycle, a car, and a bus on the street. The scene appears to be busy and active, with the presence of multiple vehicles and people. The person riding the motorcycle is likely enjoying the freedom and thrill of traveling on their own, while the", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "motorcycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 49005, "caption": "portsman in the foreground is holding a baseball bat and wearing a baseball glove, while another person is at the back of the image. They are possibly playing baseball on a field with a baseball diamond, with a baseball field backdrop.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "baseball bat", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 410114, "caption": "ever seen a bizarre sight on the road? A truck, a person, and a group of cows and horses are walking down the street together. The sight is quite unexpected, but it could be a unique experience for the person walking down the street and the animals accompanying them. The presence of", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "person", "horse", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 100274, "caption": "itting the image, we can see a truck, a train, and a car parked in a large area.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["truck", "car", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 317254, "caption": "I'm on a busy city street, riding my motorcycle while weaving in and out of traffic, I see a car approaching from behind. I'm also wearing a helmet to ensure safety.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["motorcycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 425569, "caption": "itting the sink, bowl, and oven, the kitchen is well-equipped with various utensils and tools for cooking and preparing meals. The sink, bowl, and oven serve different functions, allowing the user to efficiently wash, prepare, and cook their food. The presence", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["oven", "bowl", "sink"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 28842, "caption": "itting on a skateboard, a person is surrounded by a green grass field with a dog nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 319607, "caption": "A person rides a bicycle while wearing a backpack, passing a traffic light near a bus stop in a busy city street. The image captures a moment of everyday life in a bustling urban environment, where people are on the move, either on foot or by bike, and", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "bus", "bicycle", "car", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 210372, "caption": "itting in the kitchen, I see a sink, refrigerator, and a bottle on the counter. The sink is dirty, indicating that it may have been used recently. The refrigerator is also empty, with no visible food or drinks inside. The bottle on the counter could be a", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "sink", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 156915, "caption": "The vase is filled with a red rose, which is placed on a dining table. A potted plant is also present, adding some greenery and color to the scene. The image captures a warm and inviting atmosphere, with the red rose and potted plant complementing each other well.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "potted plant", "vase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 349642, "caption": "itting on the bench, a dog sits next to a person. The bench is made of wood, and it is located on a cobblestone street.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 423008, "caption": "A woman is standing by a river, holding a handbag, while a boat is tied to a dock. A suitcase and an umbrella are also nearby. The woman seems to be enjoying the view and possibly planning a trip, as she is holding a handbag.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "boat", "person", "handbag", "suitcase"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 45435, "caption": "itting her cell phone, a woman holds up a white cup, possibly filled with a hot beverage, while sitting at a desk in an office.", "mscoco_hallucinated_words": [["desk", "dining table"]], "mscoco_gt_words": ["cell phone", "person", "cup"], "hallucination_idxs": [24], "metrics": {"CHAIRs": 1, "CHAIRi": 0.25, "Recall": 1.0}}, {"image_id": 35827, "caption": "In this image, we see a person sitting at a dining table, surrounded by various items such as a laptop, a cup, and a bottle. The person is using a laptop, and the bottle and cup suggest that they might be taking a break or enjoying a drink. The presence of a chair", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "tv", "bottle", "cup", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8571428571428571}}, {"image_id": 72656, "caption": "A group of young boys are sitting around a small dining table, eating pizza while enjoying their time together. They are using a colorful chair and a wooden table, and there is a bowl filled with pizza on the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "pizza", "chair", "suitcase", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 372087, "caption": "itting a suitcase on a bench, a person is waiting at the train station, and a train is visible in the background.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "bench", "suitcase", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 336209, "caption": "itting on a skateboard, a young man is jumping over a bench in an old, abandoned building.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["skateboard", "person", "bench"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 46031, "caption": "A modern living room features a dining table, a laptop, a cell phone, and a mouse on top of the table, while a TV and a keyboard are nearby. The room is well-equipped for both work and leisure activities, providing ample space for relaxation and entertainment.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "tv", "mouse", "cell phone", "keyboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 283118, "caption": "Caption: A family of three enjoying a meal together at a dining table, with a backpack and handbag on the floor. The father and two children are sitting at the table, while a car is parked outside. A bottle and pizza are also present on the table. The father", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bottle", "car", "pizza", "chair", "cell phone", "backpack", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.7777777777777778}}, {"image_id": 429598, "caption": "ink, a refrigerator, a bottle, and an oven are all visible in this kitchen. The sink and stove are also present, adding to the functional kitchen space.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "oven", "sink", "bottle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 333365, "caption": ", I see a truck, a red airplane, a car, and a person all parked at the airport. The person could be a traveler or an airport worker, while the truck and the airplane are likely involved in transporting passengers or cargo. The presence of a car indicates that", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["airplane", "truck", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 424174, "caption": "itting on the dining table, the bowl is filled with fresh fruits, including bananas, apples, oranges, and other fruits. The bowl sits on a wooden dining table, which is covered with a tablecloth.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "banana", "apple", "orange"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 3035, "caption": "A large, white airplane sits on the tarmac, surrounded by various vehicles, including a truck, a car, and a person. The image captures the bustling airport scene with the airplane waiting for its next flight, while the vehicles and the person provide a sense of movement", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["airplane", "truck", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 177213, "caption": "itting through the dining table, the person holds a pizza with a fork and a knife, while a cup of drink is also present on the table. This image captures a moment of enjoying a delicious meal with friends or family, as they share a pizza and a cup of beverage", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "cup", "fork", "pizza", "knife", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 438535, "caption": "A man is walking down the sidewalk, carrying a backpack while looking at a clock. There is a car parked on the street and a building with a clock tower visible in the background. The man seems to be engrossed in his surroundings, possibly contemplating his next move or decision", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["backpack", "clock", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 490711, "caption": ", a white refrigerator is standing in a kitchen with a sink, a counter, and a banana on it. The refrigerator has a door open, revealing its contents, and there is a sink nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["refrigerator", "sink", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 163728, "caption": ", the cat is laying on the dining table, and there is a cup on the table as well. The cat is using a mouse as a pillow, and a laptop is nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "cup", "mouse", "cat"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 456756, "caption": "A giraffe stands tall and proud in the zoo, while a person is standing next to it, admiring its magnificent height. The giraffe is in a fenced enclosure, which is surrounded by other animals, including zebras. This scene captures the awe and wonder of the", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["zebra", "person", "giraffe"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 281028, "caption": ", a person is sitting in a train car, possibly enjoying the scenery outside the window. The train is traveling down a track, and a car is parked nearby. The presence of a chair and the view from the window suggest that the person is taking a leisurely ride or a short break", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "person", "car", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 343937, "caption": "In the image, there is a person wearing a ski mask and a helmet riding a snowboard down a snowy slope. The person is in a helmet and ski gear, indicating that they are skiing or snowboarding. The presence of skis in the image further suggests that the", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["snowboard", "person", "skis"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 277051, "caption": "itting on the table, the two birds sit on the dining table, looking at the knife and the bottle of wine. They seem to be enjoying their meal, and the presence of the knife and bottle indicates that the table might be a special occasion, such as a dinner party or a", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bottle", "bird", "chair", "knife"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 145666, "caption": "itting the bowl with broccoli, carrot, and other vegetables, the dining table is filled with a delicious and healthy meal.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "broccoli", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 304984, "caption": "itting on a table, the plate is filled with a delicious sandwich, a glass of wine, and a bowl of cake. The presence of an orange and a fork suggests that the meal is well-balanced and satisfying.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["sandwich", "bowl", "wine glass", "fork", "orange", "cake"], "hallucination_idxs": [3], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 0.8333333333333334}}, {"image_id": 122467, "caption": "is holding a Nintendo Game Boy in his hand, with a remote control in the other hand, and a couch behind him. He is seated on a dining table, possibly enjoying a gaming session with a friend or family member.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "couch", "remote", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 518287, "caption": "A toothbrush, a cup, and an orange are displayed on a dining table, creating a visually appealing arrangement.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "orange", "cup", "toothbrush"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 145448, "caption": "A person sits at a dining table with a cat on the back of their motorcycle, and a man and another person are present at the table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "car", "chair", "cat", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 352900, "caption": "itting through a bowl of steaming hot pasta, a knife is placed on top of a fork, and a plate is filled with a generous portion of broccoli.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["fork", "knife", "bowl", "broccoli"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 153607, "caption": "The image features a blue vase, a bowl, an orange flower, and a dining table. The blue vase adds a pop of color to the room, and the bowl provides a unique and visually appealing touch to the dining table. The orange flower adds a splash of warmth", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "vase", "orange"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 157581, "caption": "A person is sitting at a dining table with a wine glass, a bottle, and a cup. They are enjoying a glass of wine, possibly from the bottle next to them.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "wine glass", "cup", "bottle", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 227438, "caption": "itting through the bedroom door, you can see a bed, a couch, and a clock.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "chair", "couch", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 116037, "caption": "itting the baseball glove, a person is preparing to catch a baseball as another person is about to swing a bat.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "baseball bat", "baseball glove"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.5}}, {"image_id": 202658, "caption": "itting a toilet, a pink Hello Kitty toilet seat is seen in a bathroom setting. Next to the toilet, a bowl sits on a chair, and there is a toilet paper holder on the wall. The overall atmosphere of the bathroom is clean and", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "bowl", "toilet"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 18150, "caption": "A young boy sits on a couch, enjoying a slice of pizza while his dad holds a pizza box and a bottle. The boy is wearing a blue hoodie and eating the pizza with a big smile, while the dad looks happy to be sharing this meal", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "couch", "pizza", "backpack", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 418711, "caption": "A man rides his bicycle while carrying a large bag of bananas on the back, while a truck passes by on the road.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["banana", "bicycle", "car", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8}}, {"image_id": 92107, "caption": "A spoonful of creamy broccoli soup sits in a bowl, accompanied by a green spoon on a white dining table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "broccoli", "spoon"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 491000, "caption": "A man is walking his dog on the sidewalk, while a person is carrying a handbag. In the background, there is a car and a motorcycle parked on the street.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "car", "motorcycle", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 401163, "caption": "itting in the park, a person and a dog sit on a bench, enjoying the beautiful green surroundings. The park is well-maintained, with neatly trimmed grass and trees, creating a serene environment for relaxation and bonding between the human and their pet.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 39663, "caption": "A student is sitting on the floor surrounded by various items such as a backpack, scissors, laptop, cell phone, and a book. This image shows that the student is likely preparing for school or studying, as they have all the necessary tools for their academic activities. The backpack suggests that the student", "mscoco_hallucinated_words": [["student", "person"], ["student", "person"], ["student", "person"]], "mscoco_gt_words": ["laptop", "scissors", "cell phone", "backpack", "book"], "hallucination_idxs": [1, 32, 58], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 1.0}}, {"image_id": 268293, "caption": "ports player is holding a tennis racket and swinging it while preparing to hit the ball during a game.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "tennis racket"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 474499, "caption": "A spoon rests on a countertop next to a vase, while a clock is hanging from the wall above. The kitchen sink is filled with water, and a bottle is placed on top of the sink. In the background, an oven is visible, and an apple is sitting on a kitchen", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "vase", "clock", "apple", "sink", "spoon", "oven"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 358086, "caption": ", a group of friends are gathered around a frisbee on the field, with one woman playing it while another friend watches from the chair. The woman wearing a backpack and a green shirt is also present. A bottle is visible on the ground, and a person is seen standing nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bottle", "chair", "backpack", "frisbee", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 89541, "caption": "A colorful and delicious spread of food is set on a dining table. A bowl full of bananas, a cake, and a dessert plate with a variety of sweet treats like pies and cookies are placed on the table. This meal offers a diverse selection of flavors and text", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "banana", "cake"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 239445, "caption": ", the young man is holding a frisbee while smiling. He appears to be enjoying the outdoor activity and is excited about playing the sport. The presence of the sports ball in his hand suggests that he is ready to play or engage in a game of frisbee.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sports ball", "person", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 105782, "caption": "The image shows a dog and a cat in a field, with a car parked nearby. A kite is flying in the sky, and a car is parked in the distance.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cat", "dog", "kite", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 487990, "caption": "The image features a group of people enjoying a meal, including a woman cutting a cake with a knife, while a man is holding a spoon and a fork. They are seated at a dining table surrounded by various items such as a bowl, bottle, cup, and a don", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "bottle", "cup", "fork", "donut", "spoon", "knife", "person", "cake"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.9}}, {"image_id": 138834, "caption": "", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "skis", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 3.0}}, {"image_id": 492057, "caption": "A man is sitting in a bathroom, looking at his cell phone while wearing a tie.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["cell phone", "tie", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 388215, "caption": "In the image, a young man is playing tennis on a court, with a tennis racket and a ball. He is swinging the racket to hit the ball, and the image is taken from a bird's eye view.", "mscoco_hallucinated_words": [["bird", "bird"]], "mscoco_gt_words": ["sports ball", "person", "tennis racket"], "hallucination_idxs": [39], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 111409, "caption": ", a man rides his motorcycle down a long, winding road, while a bicycle is parked next to him. The man is wearing a helmet to ensure his safety, and the motorcycle's color adds a touch of style to the image. The presence of the bicycle", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bicycle", "motorcycle", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 451391, "caption": "The person holding a bunch of broccoli is posing in front of a dining table, giving the impression that they are proud of their fresh produce.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "person", "broccoli"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 92462, "caption": "A group of people are walking down a street in an African village, and a car is parked on the side. In the middle of the street, there is a person carrying a backpack and a cow. The scene shows the contrast between the urban and rural environments, as the person with the backpack is", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "person", "backpack", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 397268, "caption": "A bicycle, a chair, a potted plant, a bench, and a green couch are sitting outside a window display, showcasing a cozy and inviting atmosphere. The green couch is particularly eye-catching, adding a pop of color to the scene. The presence of", "mscoco_hallucinated_words": [["couch", "couch"], ["couch", "couch"]], "mscoco_gt_words": ["chair", "potted plant", "bench", "bicycle"], "hallucination_idxs": [16, 33], "metrics": {"CHAIRs": 1, "CHAIRi": 0.3333333333333333, "Recall": 1.0}}, {"image_id": 127659, "caption": "A person is sitting on a bench, holding a leash while a dog is standing next to them. There is a train passing by, and a woman is walking in the background. This scene is calm and peaceful, showcasing the bond between the dog and the person as they enjoy the scenery", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "dog", "person", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 251857, "caption": ", a person is skateboarding in the middle of a city street, surrounded by a busy traffic light and cars. The skateboarder's movements are captured in a moment of excitement and skill as he maneuvers through the urban environment, demonstrating the thrill and challenge of skateboarding", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "skateboard", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 243212, "caption": ", a young boy is seen holding a frisbee while standing in a grassy area. A truck and a car are parked nearby, and a dog is also present. The image captures a moment of fun and playfulness as the boy enjoys playing with the frisbee in a", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dog", "car", "frisbee", "truck", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 444304, "caption": "A person is sitting on a bed, watching a TV. They are holding a remote control and appear to be engaged in the activity. The television is on, and the scene is set in a room with a bed and a television.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "remote", "tv", "bed"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 305000, "caption": ", a woman stands behind a table displaying various fruits, including bananas, apples, and vegetables. She is wearing a black and white striped shirt and a black hat. The table is likely a market or a fruit stand, showcasing a variety of fresh produce for customers to choose from", "mscoco_hallucinated_words": [["table", "dining table"], ["table", "dining table"]], "mscoco_gt_words": ["apple", "person", "banana"], "hallucination_idxs": [6, 34], "metrics": {"CHAIRs": 1, "CHAIRi": 0.4, "Recall": 1.0}}, {"image_id": 44136, "caption": "itting a toast, a man in a black and white tuxedo, and a person in a suit sit at a dining table. They are enjoying a glass of wine and a bottle of wine, respectively.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "wine glass", "bottle", "tie", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6}}, {"image_id": 154846, "caption": ", there is a pizza on a dining table, being eaten by a person. The pizza appears to be a delicious combination of various toppings, including cheese and pepperoni. The person is enjoying their meal, and the image captures the moment of satisfaction and enjoyment of", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "person", "pizza"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 567254, "caption": "itting my hair, a toddler sits in front of the TV while holding a vase, watching a person on TV. The room has a nice bottle on the table.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["vase", "person", "tv", "bottle"], "hallucination_idxs": [31], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 79261, "caption": "A man is walking in a fountain with a backpack on his back, holding a handbag with one hand and a clock in the other.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["clock", "handbag", "person", "backpack"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 301402, "caption": "A young man sits at a dining table, surrounded by a potted plant and a laptop, while a bottle and a chair are also present. This scene suggests a comfortable and peaceful environment, where the person can relax and enjoy their time spent online or engaging in other activities.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "bottle", "potted plant", "chair", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 415828, "caption": "A woman, possibly a tourist, is standing in front of a red traffic light near a car, while holding a handbag and wearing a backpack. The woman is waiting for the traffic light to turn green before crossing the street, accompanied by a person with an umbrella. The scene is set in", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "traffic light", "stop sign", "car", "backpack", "handbag", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8571428571428571}}, {"image_id": 543231, "caption": "itting in front of a television, a woman holds a remote control and focuses on a bowl of food on a bed, while she reads a book.", "mscoco_hallucinated_words": [["television", "tv"]], "mscoco_gt_words": ["bowl", "remote", "book", "person", "bed"], "hallucination_idxs": [5], "metrics": {"CHAIRs": 1, "CHAIRi": 0.16666666666666666, "Recall": 1.0}}, {"image_id": 495357, "caption": "itting her handbag, a woman is walking down a city street with her small dog. Behind her, a man rides a motorcycle, and they are both looking at the woman's handbag.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["handbag", "motorcycle", "dog", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 523175, "caption": "itting a spoon into a bowl filled with carrots, on a dining table.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "spoon", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 184945, "caption": "itting the city street, a motorcycle and several bicycles are parked next to a potted plant. This scene captures the contrast between the urban landscape and the natural elements, creating a unique and visually appealing image.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["potted plant", "motorcycle", "bicycle"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 49942, "caption": ", a young man is holding an umbrella over his head, wearing a tie and suit.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["umbrella", "tie", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 223019, "caption": "As a young woman, she navigates through a busy train station with her handbag, suitcase, and backpack while wearing a stylish outfit.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["car", "person", "backpack", "handbag", "suitcase", "train"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8333333333333334}}, {"image_id": 42889, "caption": "itting teddy bear sits atop a keyboard in a blue room.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bear", "keyboard", "teddy bear"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 22112, "caption": "In this image, a blue and white ceramic vase sits on top of a dining table, surrounded by a potted plant and a bowl. The presence of these items in the same space creates a visually appealing and harmonious atmosphere, with the vase and bowl adding a", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "potted plant", "vase", "bowl"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 247587, "caption": "A man is riding a surfboard in the sandy beach area while a car is parked nearby.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["surfboard", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 171962, "caption": "A man is walking down the sidewalk while a bicycle and a car are passing by him. A bus is parked in the distance, and a person is waiting at a bus stop.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bus", "bicycle", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.75}}, {"image_id": 545056, "caption": ", a group of animals is gathered together in a field. A black and white dog is standing in the middle of the field, while a car is parked nearby. In the distance, there are cows, sheep, and other animals grazing in the lush green grass. The scene captures the beauty of", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["sheep", "dog", "car", "cow"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 401411, "caption": "A chef stands in front of a dining table, holding a bowl and a carrot while surrounded by various ingredients and utensils. The image suggests that the chef is preparing a meal or demonstrating cooking techniques. The presence of a book and a cup adds an interesting element to the", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "cup", "book", "oven", "person", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.8571428571428571}}, {"image_id": 2867, "caption": ", a group of people is gathered around a backpack, possibly in a snowy environment, with skis and skiing gear nearby. They appear to be preparing for an outdoor adventure, possibly in a ski resort or a snow-covered mountain.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "backpack", "skis"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 246963, "caption": ": A group of motorcyclists rides down the road, passing by a stop sign, while a person is walking on the sidewalk.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["person", "motorcycle", "stop sign"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 498225, "caption": "itting her handbag, a woman is talking on her cell phone while standing in front of a bakery shop. Her friend is looking at her handbag, and the woman has a donut and a cake in her hands. The scene suggests that they might be discussing a recent purchase or deciding what", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["donut", "cell phone", "handbag", "person", "cake"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 124601, "caption": "Coffee and laptop in the kitchen.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "laptop", "wine glass", "bottle", "cup", "microwave", "sink", "keyboard"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.125}}, {"image_id": 581749, "caption": "Broccoli and orange are sitting on a dining table, surrounded by other items.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "orange", "broccoli"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 227960, "caption": "itting on the table, a bowl of steaming vegetables, a spoon, a wine glass, and a cup are present. The vegetables include carrots, which are a nutritious and delicious addition to a meal. The presence of the spoon and the cup suggests that the food", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["bowl", "wine glass", "cup", "spoon", "carrot"], "hallucination_idxs": [3], "metrics": {"CHAIRs": 1, "CHAIRi": 0.125, "Recall": 1.0}}, {"image_id": 286503, "caption": ", a person is standing near a sports ball in front of a large elephant. The elephant is standing in a pen or enclosure, which may be part of a zoo or wildlife park. The person is looking at the elephant and possibly interacting with it, while the sports ball is", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["elephant", "sports ball", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 315740, "caption": "In this image, a living room is filled with various elements, including a bowl, chair, sink, TV, couch, potted plant, microwave, oven, and other appliances. The room is well-equipped and inviting, with the bowl and chair providing a co", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bowl", "tv", "potted plant", "microwave", "couch", "chair", "sink", "oven"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 142108, "caption": "A group of people is sitting around a dining table, enjoying a meal of bananas. One person is eating a banana while another is peeling it. The scene appears to be casual and relaxed, suggesting that the group is enjoying a tasty and healthy snack", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "person", "banana"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 542933, "caption": "itting baseball, a player is standing on a bench, with a baseball glove on their hand and a baseball bat in their hand. They are ready to hit a baseball, and the crowd is watching from the stands.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["bench", "sports ball", "baseball glove", "chair", "baseball bat", "person"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 475043, "caption": "itting on the couch, a person is enjoying a snack from a bowl while using a laptop on the table. A spoon is placed next to the bowl, and a bottle of beer is on the couch.", "mscoco_hallucinated_words": [["table", "dining table"]], "mscoco_gt_words": ["bowl", "laptop", "bottle", "couch", "spoon", "person"], "hallucination_idxs": [20], "metrics": {"CHAIRs": 1, "CHAIRi": 0.1111111111111111, "Recall": 1.0}}, {"image_id": 456784, "caption": ", the man is cutting a cake with a knife.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["knife", "person", "cake"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 33753, "caption": ", a man sits on a bench in the park, holding a remote and watching TV on the bench.", "mscoco_hallucinated_words": [["man", "person"]], "mscoco_gt_words": ["chair", "bench", "remote", "tv"], "hallucination_idxs": [2], "metrics": {"CHAIRs": 1, "CHAIRi": 0.2, "Recall": 0.75}}, {"image_id": 448694, "caption": "itting a frisbee, a group of people are enjoying a fun outdoor activity together.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["chair", "person", "frisbee"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 0.6666666666666666}}, {"image_id": 99053, "caption": "A bowl of delicious broccoli, carrot, and other vegetables sits on a dining table, accompanied by a fork.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["dining table", "bowl", "fork", "broccoli", "carrot"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 503210, "caption": "A cozy living room with a clock, couch, potted plant, chair, vase, orange, book, and other decorative elements. The room has a warm and inviting atmosphere, showcasing a comfortable living space where one can relax and enjoy their time.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["potted plant", "vase", "clock", "couch", "chair", "orange", "book"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}, {"image_id": 142257, "caption": "ever-present traffic, a red truck, a car, and a person at a traffic light make for a busy intersection.", "mscoco_hallucinated_words": [], "mscoco_gt_words": ["traffic light", "truck", "person", "car"], "hallucination_idxs": [], "metrics": {"CHAIRs": 0, "CHAIRi": 0.0, "Recall": 1.0}}], "overall_metrics": {"CHAIRs": 0.1557377049180328, "CHAIRi": 0.03623718887262079, "Recall": 0.9012059369202227, "num_hallucinated_caps": 76.0, "num_caps": 488.0, "hallucinated_word_count": 99.0, "coco_word_count": 2732.0, "length_response": 42.42827868852459, "hallucinated_caps_ls": [95908, 401027, 253630, 161227, 343534, 2154, 104326, 213905, 344942, 498747, 261097, 348654, 482432, 454161, 129707, 451943, 37907, 506401, 368367, 295656, 401010, 122602, 24104, 527497, 265628, 499594, 5934, 245664, 357578, 443299, 120776, 219546, 386645, 437720, 32190, 138078, 180764, 249969, 14773, 134768, 526414, 273132, 191691, 159979, 217593, 477016, 475813, 359164, 539951, 230983, 381031, 163518, 423984, 302789, 88269, 536154, 353320, 368335, 1268, 275611, 478380, 272566, 354540, 344930, 381615, 45435, 304984, 39663, 388215, 397268, 305000, 567254, 543231, 227960, 475043, 33753]}}}