Configuration 1: xlnet on glove with cosine distance
Configuration 2: bert on glove with cosine distance
Testing on: hard queries for the flight_delay schema
Entries below have greater num_guesses with a threshold of 3
--------------------------------------------------
--------------------------------------------------

Query (#1): predict the average delay caused by weather for each airline where flight will start in next two days
Ground Truth (attribute): WEATHER DELAY

Annotation 1: [{'text': 'delay caused by weather', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'weather', 'confidence': 0.9999998211860657}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#2): predict the average delay caused by weather for each flight of Emirates airline where flight will start in next two days
Ground Truth (attribute): WEATHER DELAY

Annotation 1: [{'text': 'average delay caused by', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999994039535522}, {'text': 'weather', 'confidence': 0.9999996423721313}, {'text': 'of', 'confidence': 0.9767034649848938}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#3): predict the average delay caused by the tornedo for each flight of Quatar airline where flight will start in next two days
Ground Truth (attribute): WEATHER DELAY

Annotation 1: [{'text': 'average delay', 'confidence': 0.9999999403953552}, {'text': 'the torned', 'confidence': 0.9999994436899821}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999995231628418}, {'text': 'tornedo', 'confidence': 0.9754388531049093}]

Configuration 1 took 4 attempt(s) to get the correct answer
Configuration 2 took 3 attempt(s) to get the correct answer

--------------------------------------------------

Query (#4): predict the average delay due to bad weather for each flight of Quatar Airlines where flight will start in next two days
Ground Truth (attribute): WEATHER DELAY

Annotation 1: [{'text': 'average delay due to bad', 'confidence': 0.9999999403953552}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999995231628418}, {'text': 'weather', 'confidence': 0.9999999403953552}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#5): I want to know the average delay for aircraft for the flights of Air Emirates which will start next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'average delay', 'confidence': 0.999999612569809}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}, {'text': 'aircraft', 'confidence': 0.6063679456710815}]

Configuration 1 took 4 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#6): can you tell me the average aircraft delay for Quatar Airlines where tail number starts from 4500 and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'aircraft delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999996721744537}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#7): Predict the average elapsed time for all flights which will start from Dallas Airport and expected elapsed time is less than six hours
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'elapsed time', 'confidence': 1.0}]
Annotation 2: [{'text': 'elapsed time', 'confidence': 0.9134460836648941}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#8): I want to predict the expected elapsed time of all Air Emirates Airlines flights where flight number is in between 3400 and 3500 and they will start tomorrow
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'elapsed time of all Air Emirates Airlines flights', 'confidence': 0.9999997073953802}]
Annotation 2: [{'text': 'elapsed time of all', 'confidence': 0.9958430429299673}]

Configuration 1 took 12 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#9): predict how many flights will get cancelled which was supposed to start from New York for next week
Ground Truth (attribute): CANCELLED

Annotation 1: [{'text': 'to start', 'confidence': 0.42805470526218414}, {'text': 'New York', 'confidence': 0.9338990747928619}]
Annotation 2: [{'text': 'flights', 'confidence': 0.999997615814209}, {'text': 'cancelled', 'confidence': 0.6879050731658936}]

Configuration 1 took 12 attempt(s) to get the correct answer
Configuration 2 took 3 attempt(s) to get the correct answer

--------------------------------------------------

Query (#10): predict how many flights of Quatar Airlines will get cancelled for covid-19 for next month
Ground Truth (attribute): CANCELLED

Annotation 1: [{'text': 'many flights of Quatar Airlines will get cancelled', 'confidence': 0.9999999271498786}]
Annotation 2: [{'text': 'flights of Quatar', 'confidence': 0.9142798185348511}, {'text': 'cancelled', 'confidence': 0.29966166615486145}]

Configuration 1 took 3 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#11): I wanna know how many flights will be cancelled for each airline for the tornedo that is coming within tomorrow
Ground Truth (attribute): CANCELLED

Annotation 1: [{'text': 'many flights will be cancelled', 'confidence': 0.9999985218048095}]
Annotation 2: [{'text': 'flights', 'confidence': 0.9999998807907104}, {'text': 'cancelled', 'confidence': 0.9999679327011108}]

Configuration 1 took 3 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#12): predict the average change is duration of flights for the Quatar Airlines which will start within next week
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'change', 'confidence': 1.0}, {'text': 'duration of flights', 'confidence': 1.0}]
Annotation 2: [{'text': 'change', 'confidence': 0.9996417164802551}, {'text': 'duration of flights', 'confidence': 0.9999998013178507}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#13): predict the average change is duration of flights for the Emirates Airlines which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'change', 'confidence': 1.0}, {'text': 'duration of flights', 'confidence': 1.0}]
Annotation 2: [{'text': 'change', 'confidence': 0.993069589138031}, {'text': 'duration of flights', 'confidence': 0.9999997615814209}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#14): predict the average departure delay for the Emirates Airlines which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#15): predict the average arrival delay for the Emirates Airlines which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): ARRIVAL DELAY

Annotation 1: [{'text': 'arrival delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'arrival delay', 'confidence': 0.9964969158172607}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#16): predict the average air system delay for the Emirates Airlines which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9979952573776245}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#17): predict the average security delay for the Emirates Airlines which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 4 attempt(s) to get the correct answer

--------------------------------------------------

Query (#18): predict the average airline delay for the Emirates Airlines which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999986886978149}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#19): predict the average late aircraft delay for the Hainan Airlines which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999879002571106}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#20): predict the average change is duration of flights for the Quatar Airlines which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'change', 'confidence': 1.0}, {'text': 'duration of flights', 'confidence': 1.0}]
Annotation 2: [{'text': 'change', 'confidence': 0.9985328316688538}, {'text': 'duration of flights', 'confidence': 0.9999997615814209}]

Configuration 1 took 15 attempt(s) to get the correct answer
Configuration 2 took 15 attempt(s) to get the correct answer

--------------------------------------------------

Query (#21): predict the average departure delay for the Quatar Airlines which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999998807907104}, {'text': 'Q', 'confidence': 0.932946503162384}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#22): predict the average arrival delay for the British Airways which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): ARRIVAL DELAY

Annotation 1: [{'text': 'arrival delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'arrival delay', 'confidence': 0.8950702846050262}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#23): predict the average air system delay for the Singapore Airlines which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9924404621124268}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#24): predict the average security delay for the Emirates Airlines which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 4 attempt(s) to get the correct answer

--------------------------------------------------

Query (#25): predict the average airline delay for the Emirates Airlines which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999995231628418}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#26): predict the average late aircraft delay for the Emirates Airlines which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999963939189911}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#27): predict the average change is duration of flights for the Quatar Airlines which will have scheduled time is in between 7 Am and 11 Am in central time and will start within next week
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'change', 'confidence': 1.0}, {'text': 'duration of flights', 'confidence': 1.0}, {'text': 'Am and 11 Am', 'confidence': 0.9985582679510117}]
Annotation 2: [{'text': 'change', 'confidence': 0.9962142705917358}, {'text': 'duration of flights', 'confidence': 0.9999997615814209}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#28): predict the average departure delay for the Quatar Airlines which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}, {'text': 'Am and 11 Am', 'confidence': 0.9971211850643158}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999998211860657}, {'text': 'Q', 'confidence': 0.9909868836402893}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#29): predict the average arrival delay for the British Airways which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): ARRIVAL DELAY

Annotation 1: [{'text': 'arrival delay', 'confidence': 1.0}, {'text': 'Am and 11 Am', 'confidence': 0.9988486170768738}]
Annotation 2: [{'text': 'arrival delay', 'confidence': 0.9908137023448944}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#30): predict the average air system delay for the Singapore Airlines which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'Am and 11 Am', 'confidence': 0.9980480372905731}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9988801181316376}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#31): predict the average security delay for the Emirates Airlines which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'Am and 11 Am', 'confidence': 0.9985719621181488}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 4 attempt(s) to get the correct answer

--------------------------------------------------

Query (#32): predict the average airline delay for the Emirates Airlines which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'Am and 11 Am', 'confidence': 0.9982976913452148}]
Annotation 2: [{'text': 'delay', 'confidence': 0.999999463558197}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#33): predict the average late aircraft delay for the Emirates Airlines which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'Am and 11 Am', 'confidence': 0.9948686808347702}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999980330467224}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#34): predict the average change is duration of flights for the Quatar Airlines which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'change', 'confidence': 1.0}, {'text': 'duration of flights', 'confidence': 1.0}]
Annotation 2: [{'text': 'change', 'confidence': 0.992922306060791}, {'text': 'duration of flights', 'confidence': 0.9999997615814209}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#35): predict the average departure delay for the Quatar Airlines which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999998509883881}, {'text': 'Q', 'confidence': 0.7371076345443726}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#36): predict the average arrival delay for the British Airways which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): ARRIVAL DELAY

Annotation 1: [{'text': 'arrival delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'arrival delay', 'confidence': 0.994181901216507}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#37): predict the average air system delay for the Singapore Airlines which will have flight duration more than four hours and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9940408766269684}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#38): predict the average security delay for the Emirates Airlines which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 4 attempt(s) to get the correct answer

--------------------------------------------------

Query (#39): predict the average airline delay for the Emirates Airlines which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999992847442627}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#40): predict the average late aircraft delay for the Emirates Airlines which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999909996986389}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#41): predict the average change is duration of flights for the Quatar Airlines with security delay more than five minutes for next week
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'change', 'confidence': 1.0}, {'text': 'duration of flights', 'confidence': 1.0}]
Annotation 2: [{'text': 'change', 'confidence': 0.997955322265625}, {'text': 'duration of flights', 'confidence': 0.9999997615814209}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#42): predict the average departure delay for the Quatar Airlines which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999998807907104}, {'text': 'Q', 'confidence': 0.557557225227356}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#43): predict the average arrival delay for the British Airways which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): ARRIVAL DELAY

Annotation 1: [{'text': 'arrival delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'arrival delay', 'confidence': 0.9994996190071106}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#44): predict the average air system delay for the Singapore Airlines which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9956287145614624}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#45): predict the average security delay for the Emirates Airlines which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 4 attempt(s) to get the correct answer

--------------------------------------------------

Query (#46): predict the average airline delay for the Emirates Airlines which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999991059303284}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#47): predict the average late aircraft delay for the Emirates Airlines which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.999988466501236}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#48): predict the average length of the flights which will start from Atlanta International Airport with scheduled departure time between 6 AM and 3 PM for next week
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'length of the flights which', 'confidence': 0.9999393939971923}, {'text': 'start', 'confidence': 0.9857444167137146}, {'text': 'International Airport with scheduled departure time', 'confidence': 0.999814381202062}]
Annotation 2: [{'text': 'length of the flights', 'confidence': 0.9999998807907104}, {'text': 'start', 'confidence': 0.5589728355407715}]

Configuration 1 took 3 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#49): predict the average change is duration of flights which will start from Atlanta International Airport which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'change', 'confidence': 0.9999999403953552}, {'text': 'duration of flights which', 'confidence': 0.9999996721744537}, {'text': 'start', 'confidence': 0.9999886155128479}]
Annotation 2: [{'text': 'change', 'confidence': 0.986327052116394}, {'text': 'duration of flights', 'confidence': 0.9999997615814209}, {'text': 'start', 'confidence': 0.4462527930736542}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#50): predict the average departure delay for flights which will start from Atlanta International Airport which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9993968605995178}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999990463256836}, {'text': 'start', 'confidence': 0.38901740312576294}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#51): predict the average arrival delay for flights which will start from Atlanta International Airport which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): ARRIVAL DELAY

Annotation 1: [{'text': 'arrival delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.999915599822998}]
Annotation 2: [{'text': 'arrival delay', 'confidence': 0.9995099902153015}, {'text': 'flights', 'confidence': 0.9999993443489075}, {'text': 'start', 'confidence': 0.4176228642463684}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#52): predict the average air system delay for flights which will start from Atlanta International Airport which will have flight duration more than four hours and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9992895722389221}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9998815655708313}, {'text': 'flights', 'confidence': 0.9999984502792358}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#53): predict the average security delay for flights which will start from Atlanta International Airport which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9999130368232727}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999990463256836}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#54): predict the average airline delay for flights which will start from Atlanta International Airport which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9998470544815063}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.999998927116394}, {'text': 'start', 'confidence': 0.38126206398010254}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#55): predict the average late aircraft delay for flights which will start from Atlanta International Airport which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9997398853302002}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999992847442627}, {'text': 'flights', 'confidence': 0.9999992847442627}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#56): predict the average change is duration of flights which will start from Atlanta International Airport with security delay more than five minutes for next week
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'change', 'confidence': 0.9999999403953552}, {'text': 'duration of flights which', 'confidence': 0.9999994486570358}, {'text': 'start', 'confidence': 0.999992311000824}]
Annotation 2: [{'text': 'change', 'confidence': 0.9975414276123047}, {'text': 'duration of flights', 'confidence': 0.9999997615814209}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#57): predict the average departure delay for flights which will start from Atlanta International Airport which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9975658655166626}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999992251396179}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#58): predict the average arrival delay for flights which will start from Atlanta International Airport which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): ARRIVAL DELAY

Annotation 1: [{'text': 'arrival delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.99964439868927}]
Annotation 2: [{'text': 'arrival delay', 'confidence': 0.9964981973171234}, {'text': 'flights', 'confidence': 0.9999993443489075}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#59): predict the average air system delay for flights which will start from Atlanta International Airport which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9994192123413086}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9999071955680847}, {'text': 'flights', 'confidence': 0.9999978542327881}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#60): predict the average security delay for flights which will start from Atlanta International Airport which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9996314644813538}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999991059303284}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#61): predict the average airline delay for flights which will start from Atlanta International Airport which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9993733167648315}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999989867210388}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#62): predict the average late aircraft delay for flights which will start from Atlanta International Airport which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9991527199745178}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999991357326508}, {'text': 'flights', 'confidence': 0.9999986886978149}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#63): predict the average change is duration of flights which will start from Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'change', 'confidence': 0.9999999403953552}, {'text': 'duration of flights which', 'confidence': 0.9999995529651642}, {'text': 'start', 'confidence': 0.9999759197235107}]
Annotation 2: [{'text': 'change', 'confidence': 0.9804689884185791}, {'text': 'duration of flights', 'confidence': 0.9999997615814209}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#64): predict the average departure delay for flights which will start from Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9960705637931824}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999994039535522}, {'text': 'start', 'confidence': 0.3651830554008484}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#65): predict the average arrival delay for flights which will start from Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): ARRIVAL DELAY

Annotation 1: [{'text': 'arrival delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9995027184486389}]
Annotation 2: [{'text': 'arrival delay', 'confidence': 0.9646424949169159}, {'text': 'flights', 'confidence': 0.9999994039535522}, {'text': 'start', 'confidence': 0.3860728442668915}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#66): predict the average air system delay for flights which will start from Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.998691201210022}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9999334812164307}, {'text': 'flights', 'confidence': 0.9999982714653015}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#67): predict the average security delay for flights which will start from Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9994216561317444}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999992251396179}, {'text': 'start', 'confidence': 0.3184075653553009}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#68): predict the average airline delay for flights which will start from Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9987378716468811}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999991655349731}, {'text': 'start', 'confidence': 0.34830331802368164}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#69): predict the average late aircraft delay for flights which will start from Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9982425570487976}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999993145465851}, {'text': 'flights', 'confidence': 0.999998927116394}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#70): predict the average change is duration of flights which will start from Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'change', 'confidence': 0.9999999403953552}, {'text': 'duration of flights which', 'confidence': 0.9999994039535522}, {'text': 'start', 'confidence': 0.9996415376663208}]
Annotation 2: [{'text': 'change', 'confidence': 0.9899966716766357}, {'text': 'duration of flights', 'confidence': 0.9999997615814209}, {'text': 'start', 'confidence': 0.35735347867012024}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#71): predict the average departure delay for flights which will start from Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9559800028800964}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999994039535522}, {'text': 'start', 'confidence': 0.544145405292511}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#72): predict the average arrival delay for flights which will start from Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): ARRIVAL DELAY

Annotation 1: [{'text': 'arrival delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9927011728286743}]
Annotation 2: [{'text': 'arrival delay', 'confidence': 0.998760461807251}, {'text': 'flights', 'confidence': 0.999999463558197}, {'text': 'start', 'confidence': 0.5631474256515503}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#73): predict the average air system delay for flights which will start from Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9880433678627014}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9990746676921844}, {'text': 'flights', 'confidence': 0.9999979138374329}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#74): predict the average security delay for flights which will start from Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9928694367408752}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999992847442627}, {'text': 'start', 'confidence': 0.47307291626930237}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#75): predict the average airline delay for flights which will start from Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9860687255859375}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999992251396179}, {'text': 'start', 'confidence': 0.506251335144043}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#76): predict the average late aircraft delay for flights which will start from Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9861502051353455}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.999998927116394}, {'text': 'flights', 'confidence': 0.9999988079071045}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#77): predict the average change is duration of flights for flights which will start from Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time and will start within next week
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'change', 'confidence': 1.0}, {'text': 'duration of flights', 'confidence': 0.9999999403953552}, {'text': 'start', 'confidence': 0.9922425150871277}, {'text': 'Am and 11 Am', 'confidence': 0.9993589520454407}]
Annotation 2: [{'text': 'change', 'confidence': 0.9960870146751404}, {'text': 'duration of flights', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999993443489075}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#78): predict the average departure delay for flights which will start from Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 0.9999999403953552}, {'text': 'start', 'confidence': 0.9964398145675659}, {'text': 'Am and 11 Am', 'confidence': 0.9996474534273148}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999992251396179}, {'text': 'start', 'confidence': 0.5813219547271729}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#79): predict the average arrival delay for flights which will start from Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): ARRIVAL DELAY

Annotation 1: [{'text': 'arrival delay', 'confidence': 0.9999999403953552}, {'text': 'start', 'confidence': 0.9991777539253235}, {'text': 'Am and 11 Am', 'confidence': 0.9997491985559464}]
Annotation 2: [{'text': 'arrival delay', 'confidence': 0.9999812245368958}, {'text': 'flights', 'confidence': 0.999999463558197}, {'text': 'start', 'confidence': 0.598198413848877}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#80): predict the average air system delay for flights which will start from Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9982247948646545}, {'text': 'Am and 11 Am', 'confidence': 0.9996801614761353}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9997187852859497}, {'text': 'flights', 'confidence': 0.9999979734420776}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#81): predict the average security delay for flights which will start from Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 0.9999999403953552}, {'text': 'start', 'confidence': 0.9990405440330505}, {'text': 'Am and 11 Am', 'confidence': 0.9998077750205994}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999991655349731}, {'text': 'start', 'confidence': 0.5218973755836487}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#82): predict the average airline delay for flights which will start from Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 0.9999999403953552}, {'text': 'start', 'confidence': 0.997963547706604}, {'text': 'Am and 11 Am', 'confidence': 0.9998129308223724}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999989867210388}, {'text': 'start', 'confidence': 0.5502379536628723}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#83): predict the average late aircraft delay for flights which will start from Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9975235462188721}, {'text': 'Am and 11 Am', 'confidence': 0.9996020942926407}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999987483024597}, {'text': 'flights', 'confidence': 0.9999987483024597}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#84): predict the average change is duration of flights which will land in Atlanta International Airport which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'change', 'confidence': 1.0}, {'text': 'duration of flights which will land', 'confidence': 1.0}]
Annotation 2: [{'text': 'change', 'confidence': 0.9858617186546326}, {'text': 'duration of flights', 'confidence': 0.9999997814496359}]

Configuration 1 took 4 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#85): predict the average departure delay for flights which will land in Atlanta International Airport which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}, {'text': 'flights', 'confidence': 0.8389137387275696}, {'text': 'land', 'confidence': 0.9932240843772888}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999999105930328}, {'text': 'flights', 'confidence': 0.9999995827674866}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#86): predict the average arrival delay for flights which will land in Atlanta International Airport which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): ARRIVAL DELAY

Annotation 1: [{'text': 'arrival delay', 'confidence': 1.0}, {'text': 'flights', 'confidence': 0.5368631482124329}, {'text': 'land', 'confidence': 0.9948998093605042}]
Annotation 2: [{'text': 'arrival delay', 'confidence': 0.9998648464679718}, {'text': 'flights', 'confidence': 0.999999463558197}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#87): predict the average air system delay for flights which will land in Atlanta International Airport which will have flight duration more than four hours and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9891254901885986}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9999580383300781}, {'text': 'flights', 'confidence': 0.9999986886978149}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#88): predict the average security delay for flights which will land in Atlanta International Airport which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'flights', 'confidence': 0.6092509031295776}, {'text': 'land', 'confidence': 0.9970665574073792}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}, {'text': 'flights', 'confidence': 0.9999995231628418}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#89): predict the average airline delay for flights which will land in Atlanta International Airport which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'flights', 'confidence': 0.5597218871116638}, {'text': 'land', 'confidence': 0.9974995851516724}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999995231628418}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#90): predict the average late aircraft delay for flights which will land in Atlanta International Airport which will have elapsed time more than four hours and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9959267973899841}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999995529651642}, {'text': 'flights', 'confidence': 0.9999993443489075}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#91): predict the average change is duration of flights which will land in Atlanta International Airport with security delay more than five minutes for next week
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'change', 'confidence': 1.0}, {'text': 'duration of flights which will land', 'confidence': 1.0}]
Annotation 2: [{'text': 'change', 'confidence': 0.9975382089614868}, {'text': 'duration of flights', 'confidence': 0.9999997417132059}]

Configuration 1 took 11 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#92): predict the average departure delay for flights which will land in Atlanta International Airport which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}, {'text': 'flights', 'confidence': 0.7966871857643127}, {'text': 'land', 'confidence': 0.9242881536483765}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999999105930328}, {'text': 'flights', 'confidence': 0.9999995827674866}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#93): predict the average arrival delay for flights which will land in Atlanta International Airport which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): ARRIVAL DELAY

Annotation 1: [{'text': 'arrival delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9204458594322205}]
Annotation 2: [{'text': 'arrival delay', 'confidence': 0.9990013539791107}, {'text': 'flights', 'confidence': 0.999999463558197}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#94): predict the average air system delay for flights which will land in Atlanta International Airport which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9831230640411377}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9999665021896362}, {'text': 'flights', 'confidence': 0.9999982118606567}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#95): predict the average security delay for flights which will land in Atlanta International Airport which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9576001763343811}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}, {'text': 'flights', 'confidence': 0.9999995231628418}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#96): predict the average airline delay for flights which will land in Atlanta International Airport which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.972100555896759}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999995231628418}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#97): predict the average late aircraft delay for flights which will land in Atlanta International Airport which will have security delay more than four minutes and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9714077711105347}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999993741512299}, {'text': 'flights', 'confidence': 0.9999988675117493}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#98): predict the average change is duration of flights which will land in Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'change', 'confidence': 1.0}, {'text': 'duration of flights which will land', 'confidence': 0.9999999900658926}]
Annotation 2: [{'text': 'change', 'confidence': 0.9798771142959595}, {'text': 'duration of flights', 'confidence': 0.9999997615814209}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#99): predict the average departure delay for flights which will land in Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}, {'text': 'flights', 'confidence': 0.8611701726913452}, {'text': 'land', 'confidence': 0.9858734011650085}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999999105930328}, {'text': 'flights', 'confidence': 0.9999995827674866}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#100): predict the average arrival delay for flights which will land in Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): ARRIVAL DELAY

Annotation 1: [{'text': 'arrival delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.986422598361969}]
Annotation 2: [{'text': 'arrival delay', 'confidence': 0.9876114726066589}, {'text': 'flights', 'confidence': 0.9999995231628418}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#101): predict the average air system delay for flights which will land in Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9959697127342224}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.999976247549057}, {'text': 'flights', 'confidence': 0.9999986290931702}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#102): predict the average security delay for flights which will land in Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'flights', 'confidence': 0.5193527340888977}, {'text': 'land', 'confidence': 0.9929842948913574}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}, {'text': 'flights', 'confidence': 0.9999995231628418}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#103): predict the average airline delay for flights which will land in Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.995339035987854}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999995231628418}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#104): predict the average late aircraft delay for flights which will land in Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9938094615936279}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999994933605194}, {'text': 'flights', 'confidence': 0.9999991655349731}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#105): predict the average change is duration of flights which will land in Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'change', 'confidence': 1.0}, {'text': 'duration of flights which will land', 'confidence': 0.9999999403953552}]
Annotation 2: [{'text': 'change', 'confidence': 0.9900271892547607}, {'text': 'duration of flights', 'confidence': 0.9999997615814209}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#106): predict the average departure delay for flights which will land in Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}, {'text': 'flights', 'confidence': 0.7390733361244202}, {'text': 'land', 'confidence': 0.9634757041931152}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999999403953552}, {'text': 'flights', 'confidence': 0.9999996423721313}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#107): predict the average arrival delay for flights which will land in Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): ARRIVAL DELAY

Annotation 1: [{'text': 'arrival delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9624029994010925}]
Annotation 2: [{'text': 'arrival delay', 'confidence': 0.9995910227298737}, {'text': 'flights', 'confidence': 0.9999995827674866}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#108): predict the average air system delay for flights which will land in Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9822694659233093}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9996344745159149}, {'text': 'flights', 'confidence': 0.999998152256012}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#109): predict the average security delay for flights which will land in Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9783904552459717}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}, {'text': 'flights', 'confidence': 0.9999995827674866}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#110): predict the average airline delay for flights which will land in Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9833988547325134}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999995827674866}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#111): predict the average late aircraft delay for flights which will land in Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9768901467323303}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999992251396179}, {'text': 'flights', 'confidence': 0.999998927116394}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#112): predict the average change is duration of flights for flights which will land in Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time and will start within next week
Ground Truth (attribute): ELAPSED TIME

Annotation 1: [{'text': 'change', 'confidence': 0.9999999403953552}, {'text': 'duration of flights', 'confidence': 0.9999999602635702}, {'text': 'flights', 'confidence': 0.567602276802063}, {'text': 'land', 'confidence': 0.9998387098312378}, {'text': 'Am and 11 Am', 'confidence': 0.9986797869205475}]
Annotation 2: [{'text': 'change', 'confidence': 0.9957687258720398}, {'text': 'duration of flights', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999994039535522}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#113): predict the average departure delay for flights which will land in Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 0.9999999403953552}, {'text': 'flights', 'confidence': 0.5738309025764465}, {'text': 'land', 'confidence': 0.9743744730949402}, {'text': 'Am and 11 Am', 'confidence': 0.9994527846574783}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999999403953552}, {'text': 'flights', 'confidence': 0.9999995827674866}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#114): predict the average arrival delay for flights which will land in Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): ARRIVAL DELAY

Annotation 1: [{'text': 'arrival delay', 'confidence': 0.9999999403953552}, {'text': 'land', 'confidence': 0.9563851356506348}, {'text': 'Am and 11 Am', 'confidence': 0.9995807856321335}]
Annotation 2: [{'text': 'arrival delay', 'confidence': 0.9999947547912598}, {'text': 'flights', 'confidence': 0.9999996423721313}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#115): predict the average air system delay for flights which will land in Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.975321888923645}, {'text': 'Am and 11 Am', 'confidence': 0.9994402974843979}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9998917877674103}, {'text': 'flights', 'confidence': 0.9999983310699463}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#116): predict the average security delay for flights which will land in Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 0.9999999403953552}, {'text': 'land', 'confidence': 0.9717190861701965}, {'text': 'Am and 11 Am', 'confidence': 0.9996894299983978}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}, {'text': 'flights', 'confidence': 0.9999995827674866}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#117): predict the average airline delay for flights which will land in Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 0.9999999403953552}, {'text': 'land', 'confidence': 0.9765461683273315}, {'text': 'Am and 11 Am', 'confidence': 0.9996944069862366}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999995827674866}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#118): predict the average late aircraft delay for flights which will land in Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9661012887954712}, {'text': 'Am and 11 Am', 'confidence': 0.9992844611406326}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999991953372955}, {'text': 'flights', 'confidence': 0.999998927116394}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#119): predict the total security delay for the Emirates Airlines which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 4 attempt(s) to get the correct answer

--------------------------------------------------

Query (#120): predict the total air system delay for the Emirates Airlines which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9859592616558075}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#121): predict the total delay due to bad weather for each flight of Emirates Airlines where flight will start next week
Ground Truth (attribute): WEATHER DELAY

Annotation 1: [{'text': 'total delay due to bad', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'weather', 'confidence': 0.9999999403953552}]

Configuration 1 took 4 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#122): predict total cancelled flights for each airline for next week
Ground Truth (attribute): CANCELLED

Annotation 1: [{'text': 'cancelled flights', 'confidence': 1.0}]
Annotation 2: [{'text': 'flights', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#123): predict the total departure delay for the Emirates Airlines which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#124): predict the total airline delay for the Emirates Airlines which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999986886978149}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#125): predict the total late aircraft delay for the Emirates Airlines which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999515414237976}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#126): predict the total departure delay for the Emirates Airlines which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#127): predict the total air system delay for the Quatar Airlines which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9483138620853424}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#128): predict the total security delay for the Cathay Pacific Airways which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 4 attempt(s) to get the correct answer

--------------------------------------------------

Query (#129): predict the total airline delay for the Cathay Pacific Airways which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999993443489075}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#130): predict the total late aircraft delay for the Cathay Pacific Airways which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.999971330165863}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#131): predict the total departure delay for the Emirates Airlines which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}, {'text': 'Am and 11 Am', 'confidence': 0.9974885880947113}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#132): predict the total air system delay for the Quatar Airlines which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'Am and 11 Am', 'confidence': 0.9956394135951996}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9685396254062653}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#133): predict the total security delay for the Cathay Pacific Airways which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'Am and 11 Am', 'confidence': 0.9934006780385971}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 4 attempt(s) to get the correct answer

--------------------------------------------------

Query (#134): predict the total airline delay for the Cathay Pacific Airways which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'Am and 11 Am', 'confidence': 0.9922943711280823}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999992251396179}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#135): predict the total late aircraft delay for the Cathay Pacific Airways which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'Am and 11 Am', 'confidence': 0.981622502207756}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999768733978271}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#136): predict the total departure delay for the Emirates Airlines which will have flight duration more than five hours and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#137): predict the total air system delay for the Quatar Airlines which will have flight duration more than seven hours and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9506065845489502}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#138): predict the total security delay for the Cathay Pacific Airways which will have elapsed time more than eight hours and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 4 attempt(s) to get the correct answer

--------------------------------------------------

Query (#139): predict the total airline delay for the Cathay Pacific Airways which will have flight duration more than five hours and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999986290931702}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#140): predict the total late aircraft delay for the Cathay Pacific Airways which will have flight duration more than eleven hours and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999846816062927}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#141): predict the total departure delay for the Emirates Airlines security delay more than five minutes for next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#142): predict the total air system delay for the Quatar Airlines which will have security delay more than seven minutes and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.954426497220993}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#143): predict the total security delay for the Cathay Pacific Airways which will have security delay more than eight minutes and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 4 attempt(s) to get the correct answer

--------------------------------------------------

Query (#144): predict the total airline delay for the Cathay Pacific Airways with security delay more than five minutes for next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999972581863403}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#145): predict the total late aircraft delay for the Cathay Pacific Airways which will have security delay more than eleven minutes and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999169707298279}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#146): predict the total departure delay for flights which will start from Atlanta International Airport which will have flight duration more than five hours and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9980935454368591}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999994039535522}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#147): predict the total air system delay for flights which will start from Atlanta International Airport will have flight duration more than seven hours and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9665886163711548}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.997965544462204}, {'text': 'flights', 'confidence': 0.9999963641166687}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#148): predict the total security delay for flights which will start from Atlanta International Airport which will have elapsed time more than eight hours and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9999127984046936}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999989867210388}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#149): predict the total airline delay for flights which will start from Atlanta International Airport which will have flight duration more than five hours and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9992340207099915}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999991655349731}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#150): predict the total late aircraft delay for flights which will start from Atlanta International Airport which will have flight duration more than eleven hours and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9984339475631714}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999991059303284}, {'text': 'flights', 'confidence': 0.999998927116394}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#151): predict the total departure delay for flights which will start from Atlanta International Airport security delay more than five minutes for next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9967443943023682}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999998509883881}, {'text': 'flights', 'confidence': 0.9999977946281433}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#152): predict the total air system delay for flights which will start from Atlanta International Airport which will have security delay more than seven minutes and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9990497827529907}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9990991353988647}, {'text': 'flights', 'confidence': 0.9999971389770508}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#153): predict the total security delay for flights which will start from Atlanta International Airport which will have security delay more than eight minutes and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9996053576469421}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999990463256836}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#154): predict the total airline delay for flights which will start from Atlanta International Airport with security delay more than five minutes for next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9994135499000549}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999985694885254}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#155): predict the total late aircraft delay for flights which will start from Atlanta International Airport which will have security delay more than eleven minutes and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9987706542015076}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999984204769135}, {'text': 'flights', 'confidence': 0.9999985098838806}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#156): predict the total departure delay for flights which will start from Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 0.9999999701976776}, {'text': 'start', 'confidence': 0.9967368841171265}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999993443489075}, {'text': 'start', 'confidence': 0.3340062201023102}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#157): predict the total air system delay for flights which will start from Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9982988834381104}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9993135333061218}, {'text': 'flights', 'confidence': 0.9999977350234985}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#158): predict the total security delay for flights which will start from Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9993963837623596}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999991655349731}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#159): predict the total airline delay for flights which will start from Atlanta International Airport swhich will have tail number greater than 1k and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9997724890708923}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999988675117493}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#160): predict the total late aircraft delay for flights which will start from Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9978659152984619}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999987185001373}, {'text': 'flights', 'confidence': 0.9999987483024597}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#161): predict the total departure delay for flights which will start from Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9655441641807556}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999994039535522}, {'text': 'start', 'confidence': 0.5010397434234619}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#162): predict the total air system delay for flights which will start from Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9866108894348145}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9930895268917084}, {'text': 'flights', 'confidence': 0.9999974370002747}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#163): predict the total security delay for flights which will start from Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9926908612251282}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999992847442627}, {'text': 'start', 'confidence': 0.43442443013191223}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#164): predict the total airline delay for flights which will start from Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9851679801940918}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999992251396179}, {'text': 'start', 'confidence': 0.460251122713089}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#165): predict the total late aircraft delay for flights which will start from Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9856559038162231}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.999997466802597}, {'text': 'flights', 'confidence': 0.9999985098838806}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#166): predict the total departure delay for flights which will start from Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 0.9999999403953552}, {'text': 'start', 'confidence': 0.9971979260444641}, {'text': 'Am and 11 Am', 'confidence': 0.9995010197162628}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999992251396179}, {'text': 'start', 'confidence': 0.5438085794448853}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#167): predict the total air system delay for flights which will start from Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9979910850524902}, {'text': 'Am and 11 Am', 'confidence': 0.9995234459638596}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9966970980167389}, {'text': 'flights', 'confidence': 0.9999974370002747}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#168): predict the total security delay for flights which will start from Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 0.9999999403953552}, {'text': 'start', 'confidence': 0.9990046620368958}, {'text': 'Am and 11 Am', 'confidence': 0.9996719062328339}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999991059303284}, {'text': 'start', 'confidence': 0.48972734808921814}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#169): predict the total airline delay for flights which will start from Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 0.9999999403953552}, {'text': 'start', 'confidence': 0.9976869225502014}, {'text': 'Am and 11 Am', 'confidence': 0.9997096359729767}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999989867210388}, {'text': 'start', 'confidence': 0.5113261938095093}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#170): predict the total late aircraft delay for flights which will start from Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'start', 'confidence': 0.9971894025802612}, {'text': 'Am and 11 Am', 'confidence': 0.9993861019611359}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999972581863403}, {'text': 'flights', 'confidence': 0.9999985098838806}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#171): predict the total departure delay for flights which will land in Atlanta International Airport which will have flight duration more than five hours and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 0.9999999701976776}, {'text': 'flights', 'confidence': 0.9701439142227173}, {'text': 'land', 'confidence': 0.9967702031135559}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999999403953552}, {'text': 'flights', 'confidence': 0.9999995827674866}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#172): predict the total air system delay for flights which will land in Atlanta International Airport will have flight duration more than seven hours and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9509688019752502}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9990923702716827}, {'text': 'flights', 'confidence': 0.9999967217445374}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#173): predict the total security delay for flights which will land in Atlanta International Airport which will have elapsed time more than eight hours and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'flights', 'confidence': 0.7164409756660461}, {'text': 'land', 'confidence': 0.9988755583763123}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}, {'text': 'flights', 'confidence': 0.9999995231628418}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#174): predict the total airline delay for flights which will land in Atlanta International Airport which will have flight duration more than five hours and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'flights', 'confidence': 0.5499510169029236}, {'text': 'land', 'confidence': 0.9949965476989746}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999995827674866}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#175): predict the total late aircraft delay for flights which will land in Atlanta International Airport which will have flight duration more than eleven hours and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9957319498062134}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999994039535522}, {'text': 'flights', 'confidence': 0.9999992251396179}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#176): predict the total departure delay for flights which will land in Atlanta International Airport security delay more than five minutes for next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9994790554046631}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999998807907104}, {'text': 'flights', 'confidence': 0.9999986886978149}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#177): predict the total air system delay for flights which will land in Atlanta International Airport which will have security delay more than seven minutes and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9935658574104309}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9996724724769592}, {'text': 'flights', 'confidence': 0.9999975562095642}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#178): predict the total security delay for flights which will land in Atlanta International Airport which will have security delay more than eight minutes and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'flights', 'confidence': 0.5339817404747009}, {'text': 'land', 'confidence': 0.9869622588157654}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}, {'text': 'flights', 'confidence': 0.9999995231628418}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#179): predict the total airline delay for flights which will land in Atlanta International Airport with security delay more than five minutes for next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9998469948768616}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999992251396179}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#180): predict the total late aircraft delay for flights which will land in Atlanta International Airport which will have security delay more than eleven minutes and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9929283261299133}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999989867210388}, {'text': 'flights', 'confidence': 0.9999986886978149}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#181): predict the total departure delay for flights which will land in Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 0.9999999701976776}, {'text': 'flights', 'confidence': 0.971741259098053}, {'text': 'land', 'confidence': 0.998638391494751}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999999105930328}, {'text': 'flights', 'confidence': 0.9999995827674866}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#182): predict the total air system delay for flights which will land in Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9985363483428955}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9997597932815552}, {'text': 'flights', 'confidence': 0.9999979138374329}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#183): predict the total security delay for flights which will land in Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'flights', 'confidence': 0.6344344019889832}, {'text': 'land', 'confidence': 0.998045802116394}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}, {'text': 'flights', 'confidence': 0.9999995231628418}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#184): predict the total airline delay for flights which will land in Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'flights', 'confidence': 0.5794891715049744}, {'text': 'land', 'confidence': 0.9983870983123779}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999995231628418}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#185): predict the total late aircraft delay for flights which will land in Atlanta International Airport which will have tail number greater than 1k and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9982144832611084}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999991357326508}, {'text': 'flights', 'confidence': 0.999998927116394}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#186): predict the total departure delay for flights which will land in Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 1.0}, {'text': 'flights', 'confidence': 0.9425095915794373}, {'text': 'land', 'confidence': 0.9945606589317322}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999999105930328}, {'text': 'flights', 'confidence': 0.9999996423721313}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#187): predict the total air system delay for flights which will land in Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9929593801498413}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9967872202396393}, {'text': 'flights', 'confidence': 0.9999976754188538}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#188): predict the total security delay for flights which will land in Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 1.0}, {'text': 'flights', 'confidence': 0.6000588536262512}, {'text': 'land', 'confidence': 0.9918705821037292}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}, {'text': 'flights', 'confidence': 0.9999995827674866}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#189): predict the total airline delay for flights which will land in Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 1.0}, {'text': 'flights', 'confidence': 0.5574034452438354}, {'text': 'land', 'confidence': 0.9927233457565308}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999995827674866}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#190): predict the total late aircraft delay for flights which will land in Atlanta International Airport which will have scheduled departure hour after 4 PM is central time and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9924381971359253}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999984204769135}, {'text': 'flights', 'confidence': 0.9999986886978149}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#191): predict the total departure delay for flights which will land in Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): DEPARTURE DELAY

Annotation 1: [{'text': 'departure delay', 'confidence': 0.9999999105930328}, {'text': 'flights', 'confidence': 0.8707659840583801}, {'text': 'land', 'confidence': 0.9972479343414307}, {'text': 'Am and 11 Am', 'confidence': 0.9991730749607086}]
Annotation 2: [{'text': 'departure delay', 'confidence': 0.9999999403953552}, {'text': 'flights', 'confidence': 0.9999995827674866}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#192): predict the total air system delay for flights which will land in Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): AIR SYSTEM DELAY

Annotation 1: [{'text': 'air system delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9938637614250183}, {'text': 'Am and 11 Am', 'confidence': 0.9989833831787109}]
Annotation 2: [{'text': 'system delay', 'confidence': 0.9985577762126923}, {'text': 'flights', 'confidence': 0.9999977350234985}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#193): predict the total security delay for flights which will land in Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): SECURITY DELAY

Annotation 1: [{'text': 'security delay', 'confidence': 0.9999999403953552}, {'text': 'land', 'confidence': 0.9935497045516968}, {'text': 'Am and 11 Am', 'confidence': 0.9994068592786789}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999999403953552}, {'text': 'flights', 'confidence': 0.9999995231628418}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 5 attempt(s) to get the correct answer

--------------------------------------------------

Query (#194): predict the total airline delay for flights which will land in Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): AIRLINE DELAY

Annotation 1: [{'text': 'airline delay', 'confidence': 0.9999999403953552}, {'text': 'land', 'confidence': 0.994027853012085}, {'text': 'Am and 11 Am', 'confidence': 0.999492883682251}]
Annotation 2: [{'text': 'delay', 'confidence': 0.9999998211860657}, {'text': 'flights', 'confidence': 0.9999995231628418}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 6 attempt(s) to get the correct answer

--------------------------------------------------

Query (#195): predict the total late aircraft delay for flights which will land in Atlanta International Airport which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (attribute): LATE AIRCRAFT DELAY

Annotation 1: [{'text': 'late aircraft delay', 'confidence': 1.0}, {'text': 'land', 'confidence': 0.9927448630332947}, {'text': 'Am and 11 Am', 'confidence': 0.998561829328537}]
Annotation 2: [{'text': 'aircraft delay', 'confidence': 0.9999983310699463}, {'text': 'flights', 'confidence': 0.9999987483024597}]

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

