Configuration 1: bert on use with cosine distance
Configuration 2: bert on fasttext with cosine distance
Testing on: combined queries for the flight_delay schema
Entries below have greater num_guesses with a threshold of 3
--------------------------------------------------
--------------------------------------------------

Query (#1): Predict the average departure delay for each airline tomorrow
Ground Truth (filter_operation): NONE

Annotation 1: []
Annotation 2: []

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#2): Predict the average airline delay for each airline tomorrow
Ground Truth (filter_operation): NONE

Annotation 1: []
Annotation 2: []

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#3): Predict the average departure delay for each airline where elapsed time is more than five hours for tomorrow
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#4): Predict the number of airline that will not have a flight tomorrow
Ground Truth (filter_operation): NONE

Annotation 1: [{'text': 'have', 'confidence': 0.9954626560211182}]
Annotation 2: [{'text': 'have', 'confidence': 0.9954626560211182}]

Configuration 1 took 9 attempt(s) to get the correct answer
Configuration 2 took 10 attempt(s) to get the correct answer

--------------------------------------------------

Query (#5): Predict the average arrival delay for each airline tomorrow
Ground Truth (filter_operation): NONE

Annotation 1: []
Annotation 2: []

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#6): Predict the total departure delay for each airline that will be delayed more than five minutes by air system next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#7): Predict the average weather delay for each airline where id of aircraft is 5 for next week
Ground Truth (filter_operation): IS EQUAL TO

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#8): Predict the average weather delay for each airline where departure delay is less than 5 munites for next week
Ground Truth (filter_operation): IS LESS THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#9): Predict the average security delay for each airline where departure delay is more than ten munites tomorrow
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#10): Predict the average delay caused by late aircraft for each airline where elapsed time is more than seven hours for next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#11): Predict total number of airports with cancelled flight count more than five for tomorrow
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#12): Predict total departure delay for each origin airports with security delay more than five minutes for tomorrow
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#13): Predict average departure delay for each origin airports with security delay less than five minutes for tomorrow
Ground Truth (filter_operation): IS LESS THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#14): Predict average air system delay for each origin airports for all Emirates airways flights next week
Ground Truth (filter_operation): NONE

Annotation 1: []
Annotation 2: []

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#15): Predict total weather delay for each origin airports with aircraft delay more than ten minutes for tomorrow
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#16): Predict average departure delay for each origin airports with delay caused by aircraft is less than five minutes for next week
Ground Truth (filter_operation): IS LESS THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#17): Predict total aircraft delay for each origin airports where scheduled departure time is after 13:00 for tomorrow
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#18): Predict average elapsed time for each origin airports with security delay more than five minutes for next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#19): Predict total departure delay for each origin airports with flights starting next Monday
Ground Truth (filter_operation): NONE

Annotation 1: []
Annotation 2: []

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#20): Predict average elapsed time for each origin airports with delay caused by weather is more than five minutes for next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#21): Predict total arrival delay for each destination airports with security delay more than five minutes for tomorrow
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#22): Predict average arrival delay for each destination airports with weather delay less than five minutes for tomorrow
Ground Truth (filter_operation): IS LESS THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#23): Predict average air system delay for each destination airports for all Emirates airways flights next week
Ground Truth (filter_operation): NONE

Annotation 1: []
Annotation 2: []

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#24): Predict total weather delay for each destination airports with aircraft delay more than ten minutes for tomorrow
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#25): Predict average arrival delay for each destination airports with delay caused by aircraft is less than five minutes for next week
Ground Truth (filter_operation): IS LESS THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#26): Predict total aircraft delay for each destination airports where scheduled departure time is after 13:00 for tomorrow
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#27): Predict average elapsed time for each destination airports with security delay more than five minutes for next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#28): Predict total arrival delay for each destination airports with flights starting next Sunday
Ground Truth (filter_operation): NONE

Annotation 1: []
Annotation 2: []

Configuration 1 took 1 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#29): predict the average change in elapsed time for each destination airports which will have elapsed time more than four hours and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#30): predict the total departure delay for flights for each destination airports which will have flight duration more than five hours and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: [{'text': 'have', 'confidence': 0.9598866701126099}]
Annotation 2: [{'text': 'have', 'confidence': 0.9598866701126099}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#31): predict the average departure delay for flights for each destination airports which will have elapsed time more than four hours and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: [{'text': 'have', 'confidence': 0.9970839619636536}]
Annotation 2: [{'text': 'have', 'confidence': 0.9970839619636536}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#32): predict the average arrival delay for flights for each destination airports which will have elapsed time more than four hours and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: [{'text': 'have', 'confidence': 0.9967408180236816}]
Annotation 2: [{'text': 'have', 'confidence': 0.9967408180236816}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#33): predict the average air system delay for flights for each destination airports which will have flight duration more than four hours and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#34): predict the average security delay for flights for each destination airports which will have elapsed time more than four hours and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: [{'text': 'have', 'confidence': 0.9957042932510376}]
Annotation 2: [{'text': 'have', 'confidence': 0.9957042932510376}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#35): predict the average airline delay for flights for each destination airports which will have elapsed time more than four hours and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: [{'text': 'have', 'confidence': 0.998702347278595}]
Annotation 2: [{'text': 'have', 'confidence': 0.998702347278595}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#36): predict the average late aircraft delay for flights for each destination airports which will have elapsed time more than four hours and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#37): predict the average change in elapsed time for each destination airports with security delay more than five minutes for next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#38): predict the average departure delay for flights for each destination airports which will have security delay more than four minutes and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#39): predict the average arrival delay for flights for each destination airports which will have security delay more than four minutes and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#40): predict the average air system delay for flights for each destination airports which will have security delay more than four minutes and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#41): predict the average security delay for flights for each destination airports which will have security delay more than four minutes and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#42): predict the average airline delay for flights for each destination airports which will have security delay more than four minutes and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#43): predict the average late aircraft delay for flights for each destination airports which will have security delay more than four minutes and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#44): predict the average change in elapsed time for each destination airports which will have tail number greater than 1k and will start within next week
Ground Truth (filter_operation): IS GREATER THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#45): predict the average departure delay for flights for each destination airports which will have tail number greater than 1k and will start within next week
Ground Truth (filter_operation): IS GREATER THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#46): predict the average arrival delay for flights for each destination airports which will have tail number greater than 1k and will start within next week
Ground Truth (filter_operation): IS GREATER THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#47): predict the average air system delay for flights for each destination airports which will have tail number greater than 1k and will start within next week
Ground Truth (filter_operation): IS GREATER THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#48): predict the average security delay for flights for each destination airports which will have tail number greater than 1k and will start within next week
Ground Truth (filter_operation): IS GREATER THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#49): predict the average airline delay for flights for each destination airports which will have tail number greater than 1k and will start within next week
Ground Truth (filter_operation): IS GREATER THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#50): predict the average late aircraft delay for flights for each destination airports which will have tail number greater than 1k and will start within next week
Ground Truth (filter_operation): IS GREATER THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#51): predict the average change in elapsed time for each destination airports which will have scheduled departure hour after 4 PM in central time and will start within next week
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#52): predict the average departure delay for flights for each destination airports which will have scheduled departure hour after 4 PM in central time and will start within next week
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#53): predict the average arrival delay for flights for each destination airports which will have scheduled departure hour after 4 PM in central time and will start within next week
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#54): predict the average air system delay for flights for each destination airports which will have scheduled departure hour after 4 PM in central time and will start within next week
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#55): predict the average security delay for flights for each destination airports which will have scheduled departure hour after 4 PM in central time and will start within next week
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#56): predict the average airline delay for flights for each destination airports which will have scheduled departure hour after 4 PM in central time and will start within next week
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#57): predict the total air system delay for flights for each destination airports will have flight duration more than seven hours and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: [{'text': 'have', 'confidence': 0.961803674697876}]
Annotation 2: [{'text': 'have', 'confidence': 0.961803674697876}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#58): predict the average late aircraft delay for flights for each destination airports which will have scheduled departure hour after 4 PM in central time and will start within next week
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#59): predict the average change in elapsed time for flights for each destination airports which will have scheduled time is in between 7 Am and 11 Am in central time and will start within next week
Ground Truth (filter_operation): IS BETWEEN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#60): predict the average departure delay for flights for each destination airports which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (filter_operation): IS BETWEEN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#61): predict the average arrival delay for flights for each destination airports which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (filter_operation): IS BETWEEN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#62): predict the average air system delay for flights for each destination airports which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (filter_operation): IS BETWEEN

Annotation 1: [{'text': 'in', 'confidence': 0.5228738188743591}]
Annotation 2: [{'text': 'in', 'confidence': 0.5228738188743591}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#63): predict the average security delay for flights for each destination airports which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (filter_operation): IS BETWEEN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#64): predict the average airline delay for flights for each destination airports which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (filter_operation): IS BETWEEN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#65): predict the average late aircraft delay for flights for each destination airports which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (filter_operation): IS BETWEEN

Annotation 1: [{'text': 'in', 'confidence': 0.5054484009742737}]
Annotation 2: [{'text': 'in', 'confidence': 0.5054484009742737}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#66): predict the average change is elapsed time for each origin airports which will have elapsed time more than four hours and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#67): predict the average departure delay for flights for each origin airports which will have elapsed time more than four hours and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: [{'text': 'have', 'confidence': 0.9946513772010803}]
Annotation 2: [{'text': 'have', 'confidence': 0.9946513772010803}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#68): predict the average arrival delay for flights for each origin airports which will have elapsed time more than four hours and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: [{'text': 'have', 'confidence': 0.9946299195289612}]
Annotation 2: [{'text': 'have', 'confidence': 0.9946299195289612}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#69): predict the average air system delay for flights for each origin airports which will have flight duration more than four hours and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#70): predict the average security delay for flights for each origin airports which will have elapsed time more than four hours and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: [{'text': 'have', 'confidence': 0.9923433661460876}]
Annotation 2: [{'text': 'have', 'confidence': 0.9923433661460876}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#71): predict the average airline delay for flights for each origin airports which will have elapsed time more than four hours and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: [{'text': 'have', 'confidence': 0.9976810812950134}]
Annotation 2: [{'text': 'have', 'confidence': 0.9976810812950134}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#72): predict the average late aircraft delay for flights for each origin airports which will have elapsed time more than four hours and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#73): predict the average change is elapsed time for each origin airports with security delay more than five minutes for next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#74): predict the average departure delay for flights for each origin airports which will have security delay more than four minutes and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#75): predict the average arrival delay for flights for each origin airports which will have security delay more than four minutes and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#76): predict the average air system delay for flights for each origin airports which will have security delay more than four minutes and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#77): predict the average security delay for flights for each origin airports which will have security delay more than four minutes and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#78): predict the average airline delay for flights for each origin airports which will have security delay more than four minutes and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#79): predict the average late aircraft delay for flights for each origin airports which will have security delay more than four minutes and will start within next week
Ground Truth (filter_operation): IS MORE THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#80): predict the average change is elapsed time for each origin airports which will have tail number greater than 1k and will start within next week
Ground Truth (filter_operation): IS GREATER THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#81): predict the average departure delay for flights for each origin airports which will have tail number greater than 1k and will start within next week
Ground Truth (filter_operation): IS GREATER THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#82): predict the average arrival delay for flights for each origin airports which will have tail number greater than 1k and will start within next week
Ground Truth (filter_operation): IS GREATER THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#83): predict the average air system delay for flights for each origin airports which will have tail number greater than 1k and will start within next week
Ground Truth (filter_operation): IS GREATER THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#84): predict the average security delay for flights for each origin airports which will have tail number greater than 1k and will start within next week
Ground Truth (filter_operation): IS GREATER THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#85): predict the average airline delay for flights for each origin airports which will have tail number greater than 1k and will start within next week
Ground Truth (filter_operation): IS GREATER THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#86): predict the average late aircraft delay for flights for each origin airports which will have tail number greater than 1k and will start within next week
Ground Truth (filter_operation): IS GREATER THAN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#87): predict the average change is elapsed time for each origin airports which will have scheduled departure hour after 4 PM in central time and will start within next week
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#88): predict the average departure delay for flights for each origin airports which will have scheduled departure hour after 4 PM in central time and will start within next week
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#89): predict the average arrival delay for flights for each origin airports which will have scheduled departure hour after 4 PM in central time and will start within next week
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#90): predict the average air system delay for flights for each origin airports which will have scheduled departure hour after 4 PM in central time and will start within next week
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#91): predict the average security delay for flights for each origin airports which will have scheduled departure hour after 4 PM in central time and will start within next week
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#92): predict the average airline delay for flights for each origin airports which will have scheduled departure hour after 4 PM in central time and will start within next week
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#93): predict the average late aircraft delay for flights for each origin airports which will have scheduled departure hour after 4 PM in central time and will start within next week
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#94): predict the average change is elapsed time for flights for each origin airports which will have scheduled time is in between 7 Am and 11 Am in central time and will start within next week
Ground Truth (filter_operation): IS BETWEEN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#95): predict the average departure delay for flights for each origin airports which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (filter_operation): IS BETWEEN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#96): predict the average arrival delay for flights for each origin airports which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (filter_operation): IS BETWEEN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#97): predict the average air system delay for flights for each origin airports which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (filter_operation): IS BETWEEN

Annotation 1: [{'text': 'in', 'confidence': 0.4896983802318573}]
Annotation 2: [{'text': 'in', 'confidence': 0.4896983802318573}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#98): predict the average security delay for flights for each origin airports which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (filter_operation): IS BETWEEN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#99): predict the average airline delay for flights for each origin airports which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (filter_operation): IS BETWEEN

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#100): predict the average late aircraft delay for flights for each origin airports which will have scheduled time is in between 7 Am and 11 Am in central time  and will start within next week
Ground Truth (filter_operation): IS BETWEEN

Annotation 1: [{'text': 'in', 'confidence': 0.48064476251602173}]
Annotation 2: [{'text': 'in', 'confidence': 0.48064476251602173}]

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 1 attempt(s) to get the correct answer

--------------------------------------------------

Query (#101): predict the total departure delay for flights for each origin airports which will have scheduled departure hour after 4 PM in central time and will start within next week
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

Query (#102): predict the total air system delay for flights for each origin airports which will have scheduled departure hour after 4 PM in central time and will start within next week
Ground Truth (filter_operation): IS AFTER

Annotation 1: []
Annotation 2: []

Configuration 1 took 2 attempt(s) to get the correct answer
Configuration 2 took 2 attempt(s) to get the correct answer

--------------------------------------------------

