[
  {
    "context": [
      "to_entries[] | select((.value | length) > 0) | { (.key) : .value }",
      "current output\n\n    {\"full1\":\"test\"}\n    {\"full2\":{\"a\":1,\"b\":2}}",
      "desired output\n\n    {\n      \"full1\": \"test\",\n      \"full2\": {\n        \"a\": 1,\n        \"b\": 2\n      }\n    }",
      "jq 'with_entries(select((.value | length) > 0))' json",
      "For your question, from_entries does the opposite conversion of to_entries. And using with_entries(foo) is a shorthand for to_entries | map(foo) | from_entries syntax."
    ],
    "utterance": "Keep only properties whose value's length is greater than 0 and reconstruct the original object with those properties.",
    "expressions": [
      "with_entries(select((.value | length) > 0))",
      "to_entries | map(select((.value | length) > 0)) | from_entries"
    ],
    "data": [
      {
        "input": {
          "empty1": [],
          "empty2": [],
          "full1": "test",
          "full2": {
            "a": 1,
            "b": 2
          }
        },
        "output": {
          "full1": "test",
          "full2": {
            "a": 1,
            "b": 2
          }
        }
      }
    ],
    "identifier": 53739817,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have a json object with numeric keys in an `example.json` file:\r\n\r\n    {\r\n      \"1\": \"foo\",\r\n      \"2\": \"bar\"\r\n    }\r\n",
      "I want to get its properties by key via `jq`",
      "The result should be\r\n\r\n    \"foo\"",
      "Use quotes:\r\n\r\n    $ jq '.\"1\"' example.json\r\n    \"foo\"",
      "You can also put the key name into square brackets (as you have already tried) but it doesn't add any improvement, it's the same program only bloated.\r\n\r\n    jq '.[\"1\"]' example.json"
    ],
    "utterance": "Retrieve the value associated with the key \"1\" in an object where the keys are numbers as strings and the value should be \"foo\".",
    "expressions": [
      ".\"1\"",
      ".[\"1\"]"
    ],
    "data": [
      {
        "input": {
          "1": "foo",
          "2": "bar"
        },
        "output": "foo"
      }
    ],
    "identifier": 53742316,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "The first command gives the values I'm looking for using jq:",
      "aws ec2 describe-volumes | jq -r '.Volumes[] | .VolumeId, .AvailabilityZone, .Attachments[].InstanceId, .Attachments[].State, (.Tags // [] | from_entries.Name)'",
      "However, if I put it into CSV format so I can output the variable to a spread sheet, the command blows up and doesn't work:",
      "aws ec2 describe-volumes | jq -r '.Volumes[] | .VolumeId, .AvailabilityZone, .Attachments[].InstanceId, .Attachments[].State, (.Tags // [] | from_entries.Name)| @csv'",
      "jq: error (at <stdin>:4418): string (\"vol-743d1234\") cannot be csv-formatted, only array",
      "You can only apply `@csv` over an array content, just enclose your filter within a `[..]` as below",
      "jq -r '[.Volumes[] | .VolumeId, .AvailabilityZone, .Attachments[].InstanceId, .Attachments[].State, (.Tags // [] | from_entries.Name)]|@csv'",
      "Using the above might still retain the quotes, so using `join()` would also be appropriate here",
      "jq -r '[.Volumes[] | .VolumeId, .AvailabilityZone, .Attachments[].InstanceId, .Attachments[].State, (.Tags // [] | from_entries.Name)] | join(\",\")'"
    ],
    "utterance": "Output for each EBS volume the fields VolumeId, AvailabilityZone, InstanceId, State, and the Name tag as a single comma-separated CSV row.",
    "expressions": [
      ".Volumes[] | [.VolumeId, .AvailabilityZone, (.Attachments[]?.InstanceId // null), (.Attachments[]?.State // null), ((.Tags // [] | from_entries.Name) // null)] | @csv",
      ".Volumes[] | [.VolumeId, .AvailabilityZone, (.Attachments[]?.InstanceId // null), (.Attachments[]?.State // null), ((.Tags // [] | from_entries.Name) // null)] | join(\",\")"
    ],
    "identifier": 53754702,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "The task does not specify the input JSON structure (e.g., is it the output of describe-volumes? How are tags represented?), so it's unclear how to extract the Name tag or the other fields.",
      "other_environment": null
    }
  },
  {
    "context": [
      "If I run this command it gives me the time the volume was created in this format:\r\n\r\n    aws ec2 describe-volumes --volume-ids vol-066e24cb8d2294605 | jq -r  '.Volumes[].CreateTime'\r\n\r\n    2018-12-11T18:54:26.110Z",
      "I'm trying to format the time using strftime, but it doesn't work. What I get is this error:\r\n\r\n    aws ec2 describe-volumes --volume-ids vol-066e24cb8d2294605 | jq -r  '.Volumes[] | .CreateTime | strftime (\"%Y-%m-%d %H:%M:%S\")'\r\n    \r\n    jq: error (at <stdin>:39): strftime/1 requires parsed datetime inputs",
      "You will first have to parse the string. You could use `strptime`.  Here is an illustration of how to do so: \r\n\r\n    <<< '\"2018-12-11T18:54.11Z\"' jq -c '\r\n    strptime(\"%Y-%m-%dT%H:%M.%SZ\")' \r\n    [2018,11,11,18,54,11,2,344]",
      "So it looks like you might have to munge the incoming date-time string, e.g.:\r\n\r\n    if test(\"\\\\....Z$\") then sub(\".Z$\"; \"Z\") else . end\r\n    | strptime(\"%Y-%m-%dT%H:%M.%SZ\")\r\n    | strftime(\"%Y-%m-%d %H:%M:%S\")"
    ],
    "utterance": "Format ISO-style timestamp strings like '2018-12-11T18:54:26.110Z' into a more readable 'YYYY-MM-DD HH:MM:SS' format.",
    "expressions": [
      "if test(\"\\\\.\\d+Z$\") then sub(\"\\.\\d+Z$\"; \"Z\") else . end | strptime(\"%Y-%m-%dT%H:%M:%SZ\") | strftime(\"%Y-%m-%d %H:%M:%S\")"
    ],
    "data": [
      {
        "input": {
          "Volumes": [
            {
              "CreateTime": "2018-12-11T18:54:26.110Z"
            }
          ]
        },
        "output": "2018-12-11 18:54:26"
      }
    ],
    "identifier": 53766924,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Is there a way to do this in using jq, so I get all types of accounts and costs associated with them?",
      "With these, the following invocations:",
      "    group(\"account\"),",
      "    group(\"anotheraccount\")",
      "yield:",
      "[{\"account\":\"1\",\"cost\":17.009999999999998},{\"account\":\"2\",\"cost\":2.25}]",
      "[{\"anotheraccount\":\"a\",\"cost\":2}]"
    ],
    "utterance": "Sum the 'totalcost' values grouped by each distinct 'account' and 'anotheraccount', resulting in the total cost per account type.",
    "expressions": [
      "def sigma( f ): reduce .[] as $o (null; . + ($o | f )) ;\ndef group( keyname ):\n  map(select(has(keyname)))\n  | group_by( .[keyname] )\n  | map({(keyname) : .[0][keyname], cost: sigma(.cost[].totalcost | tonumber) })\n;\ngroup(\"account\"), group(\"anotheraccount\")"
    ],
    "data": [
      {
        "input": [
          {
            "account": "1",
            "cost": [
              {
                "usage": "low",
                "totalcost": "2.01"
              }
            ]
          },
          {
            "account": "2",
            "cost": [
              {
                "usage": "low",
                "totalcost": "2.25"
              }
            ]
          },
          {
            "account": "1",
            "cost": [
              {
                "usage": "low",
                "totalcost": "15"
              }
            ]
          },
          {
            "anotheraccount": "a",
            "cost": [
              {
                "usage": "low",
                "totalcost": "2"
              }
            ]
          }
        ],
        "output": [
          [
            {
              "account": "1",
              "cost": 17.01
            },
            {
              "account": "2",
              "cost": 2.25
            }
          ],
          [
            {
              "anotheraccount": "a",
              "cost": 2
            }
          ]
        ]
      }
    ],
    "identifier": 53768947,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Specifically, I'd like to remove all underscores (\"_\") in all field names except for \"_id\".",
      "I'd like to descend, to any depth, all the paths in a MongoDB JSON doc and possibly modify each field name using jq",
      "walk( if type == \"object\"\n      then with_entries( if .key != \"_id\" then .key |= gsub(\"_\";\"\") else . end ) \n      else . end)"
    ],
    "utterance": "For every key in all nested objects except for '_id', remove all underscores from the key name, preserving other characters and values.",
    "expressions": [
      "walk( if type == \"object\" then with_entries( if .key != \"_id\" then .key |= gsub(\"_\";\"\") else . end ) else . end)"
    ],
    "data": [
      {
        "input": {
          "_id": "doc1",
          "field_11": "value_11",
          "field_12": {
            "field_121": {
              "field_1211": "value_1211",
              "field_1212": "value_1212"
            },
            "field_122": {
              "field_1221": "value_1221",
              "field_1222": "value_1222"
            }
          },
          "field_13": [
            {
              "field_131": {
                "field_1311": "value_1311",
                "field_1312": "value_1312"
              },
              "field_132": {
                "field_1321": "value_1321",
                "field_1322": "value_1322"
              }
            }
          ],
          "field_one_four": "value_one_four",
          "Field_One_Five": "Value_One_Five"
        },
        "output": {
          "_id": "doc1",
          "field11": "value_11",
          "field12": {
            "field121": {
              "field1211": "value_1211",
              "field1212": "value_1212"
            },
            "field122": {
              "field1221": "value_1221",
              "field1222": "value_1222"
            }
          },
          "field13": [
            {
              "field131": {
                "field1311": "value_1311",
                "field1312": "value_1312"
              },
              "field132": {
                "field1321": "value_1321",
                "field1322": "value_1322"
              }
            }
          ],
          "fieldonefour": "value_one_four",
          "FieldOneFive": "Value_One_Five"
        }
      }
    ],
    "identifier": 53769485,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Given the kubectl command below, and its response, I want to yank out the Service object.",
      "`kubectl get -f quotem_v2.yaml -o json | jq '.items[] | select(.kind==\"Service\")'`",
      "Then it uses the jq command line json parser to get the items array, and from the items array, select the elements whose 'kind' attribute matches 'Service'"
    ],
    "utterance": "Select all objects where kind equals \"Service\" from the items array.",
    "expressions": [
      ".items[] | select(.kind==\"Service\")"
    ],
    "data": [
      {
        "input": {
          "apiVersion": "v1",
          "items": [
            {
              "apiVersion": "v1",
              "kind": "Service",
              "metadata": {
                "name": "qotm-v2"
              }
            },
            {
              "apiVersion": "v1",
              "kind": "Deployment",
              "metadata": {
                "name": "qotm-deployment"
              }
            }
          ]
        },
        "output": {
          "apiVersion": "v1",
          "kind": "Service",
          "metadata": {
            "name": "qotm-v2"
          }
        }
      }
    ],
    "identifier": 53771929,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I need to transform it to\n\n    ID , name:value,name:value,name:value\n    ID , name:value,name:value,name:value\n    ID , name:value,name:value,name:value\n\nOne line per json entry in the file.",
      "You can use string interpolation with `map`:\n\n    .id + \" , \" + (.records | map(\"\\(.name):\\(.value)\") | join(\",\"))",
      "jq -nr 'inputs\n        | .id + \" , \" + \n        ( [ .records[] | .name + \":\" + (.value | tostring)  ] | join(\",\") )'"
    ],
    "utterance": "For each object, output a single line combining its id, then each of its record name:value pairs, separated by commas, in the format 'id , name1:value1,name2:value2,...'.",
    "expressions": [
      ".id + \" , \" + (.records | map(\"\\(.name):\\(.value)\") | join(\",\"))",
      "\"\\(.id) , \\(.records | map(\\\"\\(.name):\\(.value)\\\") | join(\",\"))\"",
      "[ .id, ( .records[] | .name + \":\" + (.value | tostring) ) ] | join(\",\")",
      ".id + \" , \" + ( [ .records[] | .name + \":\" + (.value | tostring)  ] | join(\",\") )"
    ],
    "data": [
      {
        "input": {
          "id": "tjh",
          "records": [
            {
              "name": "number_of_logins",
              "shortname": "nol",
              "value": 3
            },
            {
              "name": "failed_attempts",
              "shortname": "fa",
              "value": 4
            },
            {
              "name": "locations",
              "shortname": "loc",
              "value": 5
            }
          ]
        },
        "output": "tjh , number_of_logins:3,failed_attempts:4,locations:5"
      }
    ],
    "identifier": 53772040,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to be able to check this config file for the existence of the mongodb_url.",
      "if exists, use that url that contains the username and password to pass into the mongodump script.",
      "Use [tag:jq] to parse JSON.",
      "jq -r '.mongodb_url' ~/.config/server_config.json",
      "You can use `jq` bash command to parse the JSON file."
    ],
    "utterance": "Check if the key 'mongodb_url' exists and retrieve its value from the file ~/.config/server_config.json.",
    "expressions": [
      "jq -r '.mongodb_url' ~/.config/server_config.json"
    ],
    "data": [
      {
        "input": {
          "server_name": "server1",
          "mongodb_url": "mongodb://user:password@localhost:27017/dbname",
          "other stuff": "stuff2"
        },
        "output": "mongodb://user:password@localhost:27017/dbname"
      }
    ],
    "identifier": 53774996,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "At the bottom is some example json data which I&#39;d like to add timestamps to. i.e `&quot;time&quot;: &quot;1544785866.176123088&quot;` The timestamps need to be unique on each iteration and need to occur on the line above &quot;bladesetName&quot;.",
      "Here\u2019s one approach, which uses [tag:jq]:",
      "def addnow: \n  . as $in\n  | to_entries\n  | (map(.key)|index(\"bladesetName\")) as $i\n  | if $i \n    then .[0:$i]+[{key:\"time\",value:now}]+.[$i:] | from_entries\n    else $in \n    end ;",
      "walk(if type==\"object\" then addnow else . end)",
      "If you really want the time as a string, simply add `tostring`, as in `(now|tostring)`"
    ],
    "utterance": "Insert a unique timestamp value as a new key 'time' immediately before each 'bladesetName' key in every object.",
    "expressions": [
      "def addnow: \n  . as $in\n  | to_entries\n  | (map(.key)|index(\"bladesetName\")) as $i\n  | if $i \n    then .[0:$i]+[{key:\"time\",value:(now|tostring)}]+.[$i:] | from_entries\n    else $in \n    end ;\nwalk(if type==\"object\" then addnow else . end)"
    ],
    "data": [
      {
        "input": {
          "pasxml": {
            "@version": "6.0.0",
            "system": {
              "name": "example1",
              "IPV4": "0.0.0.0",
              "alertLevel": "critical",
              "state": "online"
            },
            "volumes": {
              "volume": [
                {
                  "@id": "1",
                  "name": "/",
                  "bladesetName": {
                    "@id": "1",
                    "#text": "Set-1"
                  },
                  "state": "Online",
                  "raid": "Object RAID6+",
                  "director": "Shelf-001,1",
                  "volservice": "0x0400000000000004(FM)",
                  "objectId": "I-xD0200000000000004-xG7ee84b0c-xU00004a75726a656e",
                  "recoveryPriority": "1",
                  "efsaMode": "retry",
                  "spaceUsedGB": "0",
                  "spaceAvailableGB": "693903.38",
                  "hardQuotaGB": "0.52",
                  "softQuotaGB": "0.52",
                  "userQuotaPolicy": {
                    "@inherit": "0",
                    "#text": "disabled"
                  },
                  "stats": null
                }
              ]
            }
          }
        }
      }
    ],
    "identifier": 53779150,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am trying to achieve following output: \r\n\r\n    {\r\n    \t\u201caccount\u201d: \u201c1\u201d\r\n    \t\u201ccost\u201d: \u201c17.01\u201d\r\n    }\r\n    {\r\n    \t\u201caccount\u201d: \u201c2\u201d\r\n    \t\u201ccost\u201d: \u201c2.25\u201d\r\n    }\r\n",
      "I have following json: \r\n\r\n     [\r\n      {\r\n      \"account\": \"1\",\r\n      \"cost\": [\r\n          {\r\n             \"usage\":\"low\",\r\n             \"totalcost\": \"2.01\"\r\n          }\r\n       ]\r\n      },\r\n      {\r\n      \"account\": \"2\",\r\n      \"cost\": [\r\n          {\r\n             \"usage\":\"low\",\r\n             \"totalcost\": \"2.25\"\r\n          }\r\n       ]\r\n      },\r\n      {\r\n      \"account\": \"1\",\r\n      \"cost\": [\r\n          {\r\n             \"usage\":\"low\",\r\n             \"totalcost\": \"15\"\r\n          }\r\n       ]\r\n      }\r\n    ]"
    ],
    "utterance": "Aggregate the total cost for each account, summing the totalcost fields by account number.",
    "expressions": [
      "[ .[] | select(.account) | {account, cost: (.cost | map(.totalcost | tonumber) | add)} ]"
    ],
    "data": [
      {
        "input": [
          {
            "account": "1",
            "cost": [
              {
                "usage": "low",
                "totalcost": "2.01"
              }
            ]
          },
          {
            "account": "2",
            "cost": [
              {
                "usage": "low",
                "totalcost": "2.25"
              }
            ]
          },
          {
            "account": "1",
            "cost": [
              {
                "usage": "low",
                "totalcost": "15"
              }
            ]
          },
          {
            "anotheraccount": "a",
            "cost": [
              {
                "usage": "low",
                "totalcost": "2"
              }
            ]
          }
        ],
        "output": [
          {
            "account": "1",
            "cost": 17.01
          },
          {
            "account": "2",
            "cost": 2.25
          }
        ]
      }
    ],
    "identifier": 53781422,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "If a volume is not attached, it has an empty value for \"attach time\". I wold like to replace empty values for attach time with an empty array.",
      "If I try to substitute an empty array, I get this error:",
      "cat nonprod-ebs-sanitized.json | jq -r '.Volumes[]  | [ .VolumeId, .Size, (.Attachments[].AttachTime  // [] )] | @csv'",
      "jq: error (at <stdin>:4041): array ([]) is not valid in a csv row",
      "it would make sense to use \"[]\" instead of [] in the expression // []"
    ],
    "utterance": "Replace missing AttachTime values with the string \"[]\" instead of an actual array when generating CSV rows that include VolumeId, Size, and AttachTime.",
    "expressions": [
      ".Volumes[] | [ .VolumeId, .Size, (.Attachments[].AttachTime // \"[]\") ] | @csv"
    ],
    "identifier": 53782274,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "If I try this command:\r\n\r\n    cat ab-prod-images.json |  jq -r '.Images[] | [.ImageId, ([.BlockDeviceMappings[].Ebs.VolumeSize])] | @csv'",
      "jq: error (at <stdin>:53471): array ([100,5,10,2...) is not valid in a csv row",
      "What I want is for a comma separated list of volume sizes to appear in the same cell in the spread sheet.",
      "`@csv`'s input must be a flat array, but your filter (`.Images[] | [.ImageId, ([.BlockDeviceMappings[].Ebs.VolumeSize])]`) produces a nested array because you put volume sizes into a subarray (**[**.BlockDeviceMappings[].Ebs.VolumeSize**]**):",
      "So, use following command instead.",
      "jq -r '.Images[] | [ .ImageId, .BlockDeviceMappings[].Ebs.VolumeSize ] | @csv' ab-prod-images.json",
      "which yields:\n\n    \"ami-000bd263d51e443f2\",100,5,10,20,40,30,5"
    ],
    "utterance": "For each image, output its image id followed by all attached volume sizes as a single CSV row.",
    "expressions": [
      ".Images[] | [ .ImageId, .BlockDeviceMappings[].Ebs.VolumeSize ] | @csv"
    ],
    "data": [
      {
        "input": {
          "Images": [
            {
              "VirtualizationType": "hvm",
              "Description": "TRAN05-12-13-2018",
              "Hypervisor": "xen",
              "ImageId": "ami-000bd263d51e443f2",
              "State": "available",
              "BlockDeviceMappings": [
                {
                  "DeviceName": "/dev/sda1",
                  "Ebs": {
                    "SnapshotId": "snap-0f1076dce3103833f",
                    "DeleteOnTermination": true,
                    "VolumeType": "gp2",
                    "VolumeSize": 100,
                    "Encrypted": false
                  }
                },
                {
                  "DeviceName": "/dev/sdi",
                  "Ebs": {
                    "SnapshotId": "snap-0c7d71a9d3887a048",
                    "DeleteOnTermination": true,
                    "VolumeType": "gp2",
                    "VolumeSize": 5,
                    "Encrypted": false
                  }
                },
                {
                  "DeviceName": "/dev/sdg",
                  "Ebs": {
                    "SnapshotId": "snap-00402196b9e023a72",
                    "DeleteOnTermination": true,
                    "VolumeType": "gp2",
                    "VolumeSize": 10,
                    "Encrypted": false
                  }
                },
                {
                  "DeviceName": "/dev/sdk",
                  "Ebs": {
                    "SnapshotId": "snap-05f65a62735de2846",
                    "DeleteOnTermination": true,
                    "VolumeType": "gp2",
                    "VolumeSize": 20,
                    "Encrypted": false
                  }
                },
                {
                  "DeviceName": "/dev/sdj",
                  "Ebs": {
                    "SnapshotId": "snap-0d350170b9d5c62bb",
                    "DeleteOnTermination": true,
                    "VolumeType": "gp2",
                    "VolumeSize": 40,
                    "Encrypted": false
                  }
                },
                {
                  "DeviceName": "/dev/sdh",
                  "Ebs": {
                    "SnapshotId": "snap-04679d09d89609e79",
                    "DeleteOnTermination": true,
                    "VolumeType": "gp2",
                    "VolumeSize": 30,
                    "Encrypted": false
                  }
                },
                {
                  "DeviceName": "/dev/sdf",
                  "Ebs": {
                    "SnapshotId": "snap-0b5f861fd93da6b20",
                    "DeleteOnTermination": true,
                    "VolumeType": "gp2",
                    "VolumeSize": 5,
                    "Encrypted": false
                  }
                }
              ],
              "Architecture": "x86_64",
              "ImageLocation": "155775729998/AB-P-RIDD-TRAN05-12-13-2018",
              "RootDeviceType": "ebs",
              "OwnerId": "123456789101",
              "RootDeviceName": "/dev/sda1",
              "CreationDate": "2018-12-14T01:39:31.000Z",
              "Public": false,
              "ImageType": "machine",
              "Name": "TRAN05-12-13-2018"
            }
          ]
        },
        "output": "\"ami-000bd263d51e443f2\",100,5,10,20,40,30,5"
      }
    ],
    "identifier": 53786403,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "But the next value I need to add to the list is a true/false value and jq chokes at the next command:",
      "The goal is to output the csv to a spread sheet and have all the DeleteOnTermination values separated by a comma on one line in a cell for each image.",
      "How can get the DeleteOnTermination value to appear in one csv cell, separated by commas with one DeleteOnTermination value for each volume?",
      "You need to use `tostring`.",
      "jq -r '.Images[] | [.ImageId, ( [.BlockDeviceMappings[].Ebs.DeleteOnTermination | tostring] | join(\",\") )] | @csv' file",
      "\"ami-000bd263d51e443f2\",\"true,true,true,true,true,true,true\""
    ],
    "utterance": "For each image, output the ImageId and all BlockDeviceMappings[].Ebs.DeleteOnTermination values as comma-separated booleans in a single csv cell.",
    "expressions": [
      ".Images[] | [.ImageId, ([.BlockDeviceMappings[].Ebs.DeleteOnTermination | tostring] | join(\",\"))] | @csv"
    ],
    "data": [
      {
        "input": {
          "Images": [
            {
              "VirtualizationType": "hvm",
              "Description": "AB-P-RIDD-TRAN05-12-13-2018",
              "Hypervisor": "xen",
              "ImageId": "ami-000bd263d51e443f2",
              "State": "available",
              "BlockDeviceMappings": [
                {
                  "DeviceName": "/dev/sda1",
                  "Ebs": {
                    "SnapshotId": "snap-0f1076dce3103833f",
                    "DeleteOnTermination": true,
                    "VolumeType": "gp2",
                    "VolumeSize": 100,
                    "Encrypted": false
                  }
                },
                {
                  "DeviceName": "/dev/sdi",
                  "Ebs": {
                    "SnapshotId": "snap-0c7d71a9d3887a048",
                    "DeleteOnTermination": true,
                    "VolumeType": "gp2",
                    "VolumeSize": 5,
                    "Encrypted": false
                  }
                },
                {
                  "DeviceName": "/dev/sdg",
                  "Ebs": {
                    "SnapshotId": "snap-00402196b9e023a72",
                    "DeleteOnTermination": true,
                    "VolumeType": "gp2",
                    "VolumeSize": 10,
                    "Encrypted": false
                  }
                },
                {
                  "DeviceName": "/dev/sdk",
                  "Ebs": {
                    "SnapshotId": "snap-05f65a62735de2846",
                    "DeleteOnTermination": true,
                    "VolumeType": "gp2",
                    "VolumeSize": 20,
                    "Encrypted": false
                  }
                },
                {
                  "DeviceName": "/dev/sdj",
                  "Ebs": {
                    "SnapshotId": "snap-0d350170b9d5c62bb",
                    "DeleteOnTermination": true,
                    "VolumeType": "gp2",
                    "VolumeSize": 40,
                    "Encrypted": false
                  }
                },
                {
                  "DeviceName": "/dev/sdh",
                  "Ebs": {
                    "SnapshotId": "snap-04679d09d89609e79",
                    "DeleteOnTermination": true,
                    "VolumeType": "gp2",
                    "VolumeSize": 30,
                    "Encrypted": false
                  }
                },
                {
                  "DeviceName": "/dev/sdf",
                  "Ebs": {
                    "SnapshotId": "snap-0b5f861fd93da6b20",
                    "DeleteOnTermination": true,
                    "VolumeType": "gp2",
                    "VolumeSize": 5,
                    "Encrypted": false
                  }
                }
              ],
              "Architecture": "x86_64",
              "ImageLocation": "123456789191/AB-P-RIDD-TRAN05-12-13-2018",
              "RootDeviceType": "ebs",
              "OwnerId": "123456789191",
              "RootDeviceName": "/dev/sda1",
              "CreationDate": "2018-12-14T01:39:31.000Z",
              "Public": false,
              "ImageType": "machine",
              "Name": "AB-P-RIDD-TRAN05-12-13-2018"
            }
          ]
        },
        "output": "\"ami-000bd263d51e443f2\",\"true,true,true,true,true,true,true\""
      }
    ],
    "identifier": 53787854,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Basically I'd just like the key of the inner-most object along with its value for `hw_system_serial`, returned on the same line. Something like this:",
      "hostname01-con: <SERIAL_NUMBER_1>",
      "hostname02-con: <SERIAL_NUMBER_2>",
      "etc",
      "jq -r '.plays[].tasks[].hosts | to_entries[] | \"\\(.key +\": \"+ .value.hw_system_serial)\"' input.json",
      "The output:",
      "hostname01-con: <SERIALNUMBER>",
      "hostname02-con: <SERIALNUMBER>",
      "hostname03-con: <SERIALNUMBER>",
      "hostname04-con: <SERIALNUMBER>",
      "hostname05-con: <SERIALNUMBER>",
      "hostname06-con: <SERIALNUMBER>",
      "hostname07-con: <SERIALNUMBER>"
    ],
    "utterance": "Return each host name key and its corresponding hw_system_serial value as lines in the format 'hostname: SERIALNUMBER'.",
    "expressions": [
      ".plays[].tasks[].hosts | to_entries[] | \"\b(.key): \b(.value.hw_system_serial)\""
    ],
    "data": [
      {
        "input": {
          "custom_stats": {},
          "plays": [
            {
              "play": {
                "id": "001dd8b7-1cca-07c6-2280-00000000043a",
                "name": "Ansible Ad-Hoc"
              },
              "tasks": [
                {
                  "hosts": {
                    "hostname01-con": {
                      "_ansible_no_log": false,
                      "_ansible_verbose_always": true,
                      "action": "debug",
                      "changed": false,
                      "hw_system_serial": "<SERIALNUMBER>"
                    },
                    "hostname02-con": {
                      "_ansible_no_log": false,
                      "_ansible_verbose_always": true,
                      "action": "debug",
                      "changed": false,
                      "hw_system_serial": "<SERIALNUMBER>"
                    },
                    "hostname03-con": {
                      "_ansible_no_log": false,
                      "_ansible_verbose_always": true,
                      "action": "debug",
                      "changed": false,
                      "hw_system_serial": "<SERIALNUMBER>"
                    },
                    "hostname04-con": {
                      "_ansible_no_log": false,
                      "_ansible_verbose_always": true,
                      "action": "debug",
                      "changed": false,
                      "hw_system_serial": "<SERIALNUMBER>"
                    },
                    "hostname05-con": {
                      "_ansible_no_log": false,
                      "_ansible_verbose_always": true,
                      "action": "debug",
                      "changed": false,
                      "hw_system_serial": "<SERIALNUMBER>"
                    },
                    "hostname06-con": {
                      "_ansible_no_log": false,
                      "_ansible_verbose_always": true,
                      "action": "debug",
                      "changed": false,
                      "hw_system_serial": "<SERIALNUMBER>"
                    },
                    "hostname07-con": {
                      "_ansible_no_log": false,
                      "_ansible_verbose_always": true,
                      "action": "debug",
                      "changed": false,
                      "hw_system_serial": "<SERIALNUMBER>"
                    }
                  },
                  "task": {
                    "id": "001dd8b7-1cca-07c6-2280-00000000043c",
                    "name": "debug"
                  }
                }
              ]
            }
          ],
          "stats": {}
        },
        "output": [
          "hostname01-con: <SERIALNUMBER>",
          "hostname02-con: <SERIALNUMBER>",
          "hostname03-con: <SERIALNUMBER>",
          "hostname04-con: <SERIALNUMBER>",
          "hostname05-con: <SERIALNUMBER>",
          "hostname06-con: <SERIALNUMBER>",
          "hostname07-con: <SERIALNUMBER>"
        ]
      }
    ],
    "identifier": 53795759,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "but how do i now apply this simple logic:\n\n    intersection = unique((A+B) - (A U B))\ni'm used to one liners, and I'd like this snippet to be readable and elegant for future use.\nso how do i implement this in jq style?",
      "jq -cn '[\"a\",\"b\",\"e\",\"c\"] as $A | [\"g\",\"a\",\"t\",\"c\"] as $B | $A - ($A - $B)'",
      "echo '{ \"group_a\" : [\"a\",\"b\",\"e\",\"c\"], \"group_b\" : [\"g\",\"a\",\"t\",\"c\"] }' |\n jq -c '.group_a - (.group_a - .group_b)'",
      "They both yield:\n[\"a\",\"c\"]"
    ],
    "utterance": "Find the intersection of the arrays in the keys group_a and group_b, returning only elements present in both.",
    "expressions": [
      ".group_a - (.group_a - .group_b)",
      "[\"a\",\"b\",\"e\",\"c\"] as $A | [\"g\",\"a\",\"t\",\"c\"] as $B | $A - ($A - $B)"
    ],
    "data": [
      {
        "input": {
          "group_a": [
            "a",
            "b",
            "e",
            "c"
          ],
          "group_b": [
            "g",
            "a",
            "t",
            "c"
          ]
        },
        "output": [
          "a",
          "c"
        ]
      }
    ],
    "identifier": 53810833,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am able to get up to the '.DBInstances[].DBSubnetGroup[]' level in the [ORIGINAL JSON][1] file. With this command:",
      "jq -r '.DBInstances[].DBSubnetGroup[]' ab-prod-rds.json",
      "But I am unable to get at the information in the '.DBInstances[].DBSubnetGroup[].Subnets.SubnetStatus' level of the same.",
      "When I give this command: jq -r '.DBInstances[].DBSubnetGroup[].Subnets.SubnetStatus' ab-prod-rds.json",
      "The error I get is: jq: error (at ab-prod-rds.json:618): Cannot index array with string \"Subnets\"",
      "If you just want to output all the subnet statuses, your query should be this:",
      ".DBInstances[].DBSubnetGroup.Subnets[].SubnetStatus"
    ],
    "utterance": "Retrieve all subnet statuses within each instance's subnet group.",
    "expressions": [
      ".DBInstances[].DBSubnetGroup.Subnets[].SubnetStatus"
    ],
    "identifier": 53822333,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "The task does not specify the input JSON structure or provide an example, so it is unclear how to extract 'subnet statuses' or what the expected output should be.",
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to export them to JSON grouped by the username like the following",
      "If the CSV can be accurately parsed by simply splitting on commas, then the desired transformation can be accomplished using the following jq filter:",
      "aggregate_by(inputs | split(\",\"); .[0]; .[1])",
      "This assumes jq is invoked with the -R (raw) and -n options.",
      "With the given CSV input, the output would be:",
      "{\n  \"b00nw33\": [\n    \"harrypotter788.flv\",\n    \"harrypotter788.mov\",\n    \"levitation271.avi\"\n  ],\n  \"b01spider\": [\n    \"schimbvalutar109.avi\"\n  ]\n}"
    ],
    "utterance": "Group rows by the first column value so that each username maps to a list of their associated file names.",
    "expressions": [
      "aggregate_by(inputs | split(\",\"); .[0]; .[1])"
    ],
    "data": [
      {
        "input": [
          "b00nw33,harrypotter788.flv",
          "b00nw33,harrypotter788.mov",
          "b00nw33,levitation271.avi",
          "b01spider,schimbvalutar109.avi"
        ],
        "output": {
          "b00nw33": [
            "harrypotter788.flv",
            "harrypotter788.mov",
            "levitation271.avi"
          ],
          "b01spider": [
            "schimbvalutar109.avi"
          ]
        }
      }
    ],
    "identifier": 53825395,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have 2 hosted zone with the same name. I want to get the hostedZoneId of a Hostedzone used for us-west-2 region.",
      "I dont have any uniqueness now to identify the HostedZone used for Us other than the Comment.",
      "It is a simple filter on jq to use endswith or test to match us-west-2 on the .Config.Comment field value.",
      "jq '.HostedZones[] | select( .Config.Comment | test(\"us-west-2$\") ).Id'"
    ],
    "utterance": "Select the Id field of the HostedZone whose Config.Comment ends with 'us-west-2'.",
    "expressions": [
      ".HostedZones[] | select(.Config.Comment | test(\"us-west-2$\")) .Id",
      ".HostedZones[] | select(.Config.Comment | endswith(\"us-west-2\")) .Id"
    ],
    "data": [
      {
        "input": {
          "HostedZones": [
            {
              "ResourceRecordSetCount": 3,
              "CallerReference": "2018-08-07T14:02:30.733383821+05:30",
              "Config": {
                "Comment": "Private Hosted Zone for tenant:us-west-2",
                "PrivateZone": true
              },
              "Id": "/hostedzone/D2JGX0PDINSIDA",
              "Name": "domainname."
            },
            {
              "ResourceRecordSetCount": 3,
              "CallerReference": "2018-08-16T16:38:29.821900042+05:30",
              "Config": {
                "Comment": "Private Hosted Zone for tenant:eu-west-1",
                "PrivateZone": true
              },
              "Id": "/hostedzone/Q1HEEHGD5JH3G3",
              "Name": "domainname."
            }
          ],
          "DNSName": "domainname",
          "IsTruncated": false,
          "MaxItems": "100"
        },
        "output": "/hostedzone/D2JGX0PDINSIDA"
      }
    ],
    "identifier": 53828173,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to discard when \"bar\"=\"cccc\" without altering the structure.",
      "something like this does the trick but it alters the JSON structure.",
      "needed result is:\r\n\r\n    {\r\n      \"foofoo\": {\r\n        \"barbar\": [\r\n        {\r\n          \"foo\": \"0000\",\r\n          \"bar\": \"aaaa\"\r\n        }\r\n        {\r\n          \"foo\": \"1111\",\r\n          \"bar\": \"bbbb\"\r\n        }\r\n        ]\r\n      }\r\n    }",
      "What you want instead is to reset it:",
      ".foofoo.barbar |= map(select(.bar !=\"cccc\"))"
    ],
    "utterance": "Remove elements from the 'barbar' array where the 'bar' property is equal to 'cccc', retaining the same structure.",
    "expressions": [
      ".foofoo.barbar |= map(select(.bar !=\"cccc\"))"
    ],
    "data": [
      {
        "input": {
          "foofoo": {
            "barbar": [
              {
                "foo": "0000",
                "bar": "aaaa"
              },
              {
                "foo": "1111",
                "bar": "bbbb"
              },
              {
                "foo": "2222",
                "bar": "cccc"
              }
            ]
          }
        },
        "output": {
          "foofoo": {
            "barbar": [
              {
                "foo": "0000",
                "bar": "aaaa"
              },
              {
                "foo": "1111",
                "bar": "bbbb"
              }
            ]
          }
        }
      }
    ],
    "identifier": 53855828,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Gathering all root to leaf values along every path in a tree-like json object with JQ.",
      "I have this",
      "{\n     \"cond1\":[\"a\",\"b\",\"c\"],\n     \"cond2\":[\"a1\",\"b2\",\"c\"],\n     \"cond2\":[\"a\",\"b\",\"c3\"]    \n }",
      "I want this",
      "[\"cond1\",\"a\"]\n[\"cond1\",\"b\"]\n[\"cond1\",\"c\"]\n[\"cond2\",\"a1\"]\n[\"cond2\",\"b2\"]\n[\"cond2\",\"c\"]\n[\"cond2\":\"a\"]\n[\"cond2\":\"b\"]\n[\"cond2\":\"c3\"]",
      "The key to answering both questions is `paths`.",
      "For example, the following filter produces the desired output in the first case:",
      "paths as $p\n| getpath($p)\n| select(scalars)\n| ($p | map(select(type==\"string\"))) + [.]"
    ],
    "utterance": "List all root-to-leaf paths as arrays where the last element is a leaf value in the tree, including arrays and nested arrays as children.",
    "expressions": [
      "paths as $p | getpath($p) | select(scalars) | ($p | map(select(type==\"string\"))) + [.]"
    ],
    "data": [
      {
        "input": {
          "cond1": [
            "a",
            "b",
            "c"
          ],
          "cond2": [
            "a1",
            "b2",
            "c"
          ]
        },
        "output": [
          [
            "cond1",
            "a"
          ],
          [
            "cond1",
            "b"
          ],
          [
            "cond1",
            "c"
          ],
          [
            "cond2",
            "a1"
          ],
          [
            "cond2",
            "b2"
          ],
          [
            "cond2",
            "c"
          ]
        ]
      },
      {
        "input": {
          "cond1": [
            "a",
            "b",
            "c"
          ],
          "cond2": [
            [
              "x2",
              "x3"
            ],
            [
              "x1",
              "x2"
            ]
          ]
        },
        "output": [
          [
            "cond1",
            "a"
          ],
          [
            "cond1",
            "b"
          ],
          [
            "cond1",
            "c"
          ],
          [
            "cond2",
            "x2"
          ],
          [
            "cond2",
            "x3"
          ],
          [
            "cond2",
            "x1"
          ],
          [
            "cond2",
            "x2"
          ]
        ]
      }
    ],
    "identifier": 53864041,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "When modifying objects in jq, all it takes really is:",
      "# the environment object with the name \"JAVA_OPTS\"",
      ".environment[] | select(.name == \"JAVA_OPTS\")",
      "# update the value property using (= or |=)",
      ".environment[] | select(.name == \"JAVA_OPTS\").value = ...",
      "sub(\"(?<k>-Dspring\\\\.cloud\\\\.config\\\\.label)=<some value>\", \"\\(.k)=${Config_Label}\")",
      "Putting that all together:",
      ".environment[] | select(.name == \"JAVA_OPTS\").value",
      "|= sub(\"(?<k>-Dspring\\\\.cloud\\\\.config\\\\.label)=<some value>\", \"\\(.k)=${Config_Label}\")"
    ],
    "utterance": "Replace the value of the parameter -Dspring.cloud.config.label in the environment entry where name is JAVA_OPTS with ${Config_Label}.",
    "expressions": [
      ".environment[] | select(.name == \"JAVA_OPTS\").value |= sub(\"(?<k>-Dspring\\\\.cloud\\\\.config\\\\.label)=<some value>\", \"\\(.k)=${Config_Label}\")"
    ],
    "data": [
      {
        "input": {
          "environment": [
            {
              "name": "DD_AGENT_HOST",
              "value": "somevalue.com"
            },
            {
              "name": "DD_AGENT_PORT",
              "value": "1234"
            },
            {
              "name": "DD_TRACE_SPAN_TAGS",
              "value": "env:test"
            },
            {
              "name": "JAVA_OPTS",
              "value": "-javaagent:/dd-java-agent/dd-java-agent.jar -Xms512m -Xmx2548m -XX:MaxPermSize=512m -Dspring.profiles.active=testapp -Dspring.cloud.config.label=<some value> -Duser.timezone=America/New_York"
            }
          ]
        },
        "output": {
          "environment": [
            {
              "name": "DD_AGENT_HOST",
              "value": "somevalue.com"
            },
            {
              "name": "DD_AGENT_PORT",
              "value": "1234"
            },
            {
              "name": "DD_TRACE_SPAN_TAGS",
              "value": "env:test"
            },
            {
              "name": "JAVA_OPTS",
              "value": "-javaagent:/dd-java-agent/dd-java-agent.jar -Xms512m -Xmx2548m -XX:MaxPermSize=512m -Dspring.profiles.active=testapp -Dspring.cloud.config.label=${Config_Label} -Duser.timezone=America/New_York"
            }
          ]
        }
      }
    ],
    "identifier": 53872349,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am using jq and want to get the \"Id\" value of the object where 'Name' is 'internal.mydomain.com.'.",
      "Use `select()` filter on the name and get the id value with the following filter",
      "jq '.HostedZones[] | select (.Name==\"internal.mydomain.com.\").Id'"
    ],
    "utterance": "Get the Id of the object in HostedZones where Name is \"internal.mydomain.com.\"",
    "expressions": [
      ".HostedZones[] | select(.Name == \"internal.mydomain.com.\").Id"
    ],
    "data": [
      {
        "input": {
          "HostedZones": [
            {
              "Id": "Z3JX6LQV6IJO3E",
              "Name": "sandbox.mydomain.com."
            },
            {
              "Id": "Z13M9NFG2E4J5N",
              "Name": "internal.mydomain.com."
            }
          ]
        },
        "output": "Z13M9NFG2E4J5N"
      }
    ],
    "identifier": 53873159,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I need to match `.Rules[].Filter.Prefix==\"to_me/\"` and then change the associated `\"Status\": \"Enabled\"` to `\"Disabled\"`.",
      "Doing in-place updates can be done with `|=`, and deciding whether to modify content in-place can be done with if/then/else. Thus:",
      "jq '.Rules[] |= (if .Filter.Prefix == \"to_me/\" then .Status=\"Disabled\" else . end)'",
      "I prefer the idiom `ARRAY |= map(...)` over `ARRAY[] |= ...`, mainly because the former can be used reliably whether or not any of the substitutions evaluate to `empty`:",
      "jq '.Rules |= map(if .Filter.Prefix == \"to_me/\" \n                      then .Status=\"Disabled\" else . end)'"
    ],
    "utterance": "Change the Status value to Disabled for each object in Rules whose Filter.Prefix equals to_me/.",
    "expressions": [
      ".Rules[] |= (if .Filter.Prefix == \"to_me/\" then .Status=\"Disabled\" else . end)",
      ".Rules |= map(if .Filter.Prefix == \"to_me/\" then .Status=\"Disabled\" else . end)"
    ],
    "data": [
      {
        "input": {
          "Rules": [
            {
              "Filter": {
                "Prefix": "to_me/"
              },
              "Status": "Enabled"
            },
            {
              "Filter": {
                "Prefix": "from_me/"
              },
              "Status": "Enabled"
            },
            {
              "Filter": {
                "Prefix": "__bg/"
              },
              "Status": "Enabled"
            }
          ]
        },
        "output": {
          "Rules": [
            {
              "Filter": {
                "Prefix": "to_me/"
              },
              "Status": "Disabled"
            },
            {
              "Filter": {
                "Prefix": "from_me/"
              },
              "Status": "Enabled"
            },
            {
              "Filter": {
                "Prefix": "__bg/"
              },
              "Status": "Enabled"
            }
          ]
        }
      }
    ],
    "identifier": 53877887,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "How do I write a query that returns 'https://www.example.org/simplecategory?_format=hal_json'?",
      "I've then tried various combinations to get to the value of  `https://www.example.org/rest/relation/node/recording/field_category`.",
      "If you want to specify the top-level key explicitly, the follow-on query would be:",
      ".[\"https://www.example.org/rest/relation/node/recording/revision_uid\"][]\n| ._links.self.href",
      "That is, the entire query would be:\n\n    ._embedded\n    | .[\"https://www.example.org/rest/relation/node/recording/revision_uid\"][]\n    | ._links.self.href",
      "An alternative to specifying the top-level key explicitly might be to select the href of interest from the array of all of them:",
      "._embedded\n | [.[][]._links.self.href]"
    ],
    "utterance": "Extract the 'self.href' value from the _links object inside the https://www.example.org/rest/relation/node/recording/field_category key under _embedded.",
    "expressions": [
      "._embedded[\"https://www.example.org/rest/relation/node/recording/field_category\"][] | ._links.self.href"
    ],
    "data": [
      {
        "input": {
          "https://www.example.org/rest/relation/node/recording/revision_uid": [
            {
              "_links": {
                "self": {
                  "href": "https://www.example.org/user/37?_format=hal_json"
                },
                "type": {
                  "href": "https://www.example.org/rest/type/user/user"
                }
              },
              "uuid": [
                {
                  "value": "d40684cf-2321-42d2-9194"
                }
              ]
            }
          ],
          "https://www.example.org/rest/relation/node/recording/uid": [
            {
              "_links": {
                "self": {
                  "href": "https://www.example.org/user/37?_format=hal_json"
                },
                "type": {
                  "href": "https://www.example.org/rest/type/user/user"
                }
              },
              "uuid": [
                {
                  "value": "d40684cf-2321-42d2-9194"
                }
              ],
              "lang": "en"
            }
          ],
          "https://www.example.org/rest/relation/node/recording/field_category": [
            {
              "_links": {
                "self": {
                  "href": "https://www.example.org/simplecategory?_format=hal_json"
                },
                "type": {
                  "href": "https://www.example.org/rest/type/taxonomy_term/tags"
                }
              },
              "uuid": [
                {
                  "value": "3fef93d5-926a-41aa-95cb"
                }
              ]
            }
          ],
          "https://www.example.org/rest/relation/node/recording/field_part1_speaker": [
            {
              "_links": {
                "self": {
                  "href": "https://www.example.org/by/speakername?_format=hal_json"
                },
                "type": {
                  "href": "https://www.example.org/rest/type/taxonomy_term/author"
                }
              },
              "uuid": [
                {
                  "value": "fb6c806f-fa6a-4aa0-89ef"
                }
              ]
            }
          ]
        },
        "output": "https://www.example.org/simplecategory?_format=hal_json"
      }
    ],
    "identifier": 53893946,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Is it possible to filter an entire array of items in JQ in only one pass?",
      "It keeps looping through the array, removing the unneeded entries and storing same file again by using sponge.",
      "any ideas how to achieve a similar behavior with a lighter code?",
      "arr=(\"1111\" \"2222\")",
      "jq '($p | split(\" \") | map(tonumber)) as $exclude \n    | .foofoo.barbar \n      |= map(select(.bar as $b \n                    | any($exclude[]; . == $b) | not))' \\\n    --arg p \"${arr[*]}\" file.json | sponge file.json",
      "The output:\n\n    {\n      \"foofoo\": {\n        \"barbar\": [\n          {\n            \"foo\": \"aaaa\",\n            \"bar\": 0\n          }\n        ]\n      }\n    }"
    ],
    "utterance": "Remove all objects in the 'foofoo.barbar' array whose 'bar' value matches any value in the list 1111 or 2222.",
    "expressions": [
      "($p | split(\" \") | map(tonumber)) as $exclude | .foofoo.barbar |= map(select(.bar as $b | any($exclude[]; . == $b) | not))",
      "INDEX($p | split(\" \")[]; .) as $dict | .foofoo.barbar |= map(select($dict[.bar|tostring] | not))"
    ],
    "data": [
      {
        "input": {
          "foofoo": {
            "barbar": [
              {
                "foo": "aaaa",
                "bar": 0
              },
              {
                "foo": "bbbb",
                "bar": 1111
              },
              {
                "foo": "cccc",
                "bar": 2222
              }
            ]
          }
        },
        "output": {
          "foofoo": {
            "barbar": [
              {
                "foo": "aaaa",
                "bar": 0
              }
            ]
          }
        }
      }
    ],
    "identifier": 53896658,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to convert it to an array of strings by flattening the structure\n\n    [\n      \"key1.value1\",\n      \"key2.value2\",\n      \"key3.subKey1.value3\",\n      \"key3.subKey2.value4\",\n    ]",
      "Short alternative:\n\n    jq '[paths(scalars) as $p | $p + [getpath($p)] | join(\".\")]' f.json\n\nThe expected output:\n\n    [\n      \"key1.value1\",\n      \"key2.value2\",\n      \"key3.subKey1.value3\",\n      \"key3.subKey2.value4\"\n    ]"
    ],
    "utterance": "Flatten a nested object so each leaf value is represented as a string in the format 'key1.key2.value' and collect all such strings in an array.",
    "expressions": [
      "[paths(scalars) as $p | $p + [getpath($p)] | join(\".\")]"
    ],
    "data": [
      {
        "input": {
          "key1": "value1",
          "key2": "value2",
          "key3": {
            "subKey1": "value3",
            "subKey2": "value4"
          }
        },
        "output": [
          "key1.value1",
          "key2.value2",
          "key3.subKey1.value3",
          "key3.subKey2.value4"
        ]
      }
    ],
    "identifier": 53899837,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I'm trying to format the output as:\n\n    {\n      \"foo\": {\n        \"bar\": \"0.00000072\"\n      }\n    }",
      "You could use this generic function:\n\ndef to_decimal:\n  def rpad(n): if (n > length) then . + ((n - length) * \"0\") else . end;\n  def lpad(n): if (n > length) then ((n - length) * \"0\") + . else . end;\n  tostring\n  | . as $s\n  | capture( \"(?<sgn>[+-]?)(?<left>[0-9]*)(?<p>\\.?)(?<right>[0-9]*)(?<e>[Ee]?)(?<exp>[+-]?[0-9]+)\" )\n  | if .e == \"\" then $s\n    else (.left|length) as $len\n    | (.exp | tonumber) as $exp\n    | if $exp < 0 then .sgn + \"0.\" + (.left | lpad(1 - $exp - $len)) + .right\n      else .sgn + (.left | rpad($exp - $len)) + \".\" + .right\n      end\n    end ;\n\nExample:\n\n    \"7.2e-07\"|to_decimal\n\nyields:\n\n    \"0.00000072\""
    ],
    "utterance": "Format a small floating-point value so that the output uses standard decimal notation with leading zeros instead of scientific notation, e.g., '0.00000072'",
    "expressions": [
      "def to_decimal:\n  def rpad(n): if (n > length) then . + ((n - length) * \"0\") else . end;\n  def lpad(n): if (n > length) then ((n - length) * \"0\") + . else . end;\n  tostring\n  | . as $s\n  | capture( \"(?<sgn>[+-]?)(?<left>[0-9]*)(?<p>\\.?)(?<right>[0-9]*)(?<e>[Ee]?)(?<exp>[+-]?[0-9]+)\" )\n  | if .e == \"\" then $s\n    else (.left|length) as $len\n    | (.exp | tonumber) as $exp\n    | if $exp < 0 then .sgn + \"0.\" + (.left | lpad(1 - $exp - $len)) + .right\n      else .sgn + (.left | rpad($exp - $len)) + \".\" + .right\n      end\n    end ;"
    ],
    "data": [
      {
        "input": {
          "foo": {
            "bar": 7.2e-07
          }
        },
        "output": {
          "foo": {
            "bar": "0.00000072"
          }
        }
      }
    ],
    "identifier": 53915068,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am looking to validate that the sort order in which these records are stored are sorted by memberId ASC, memberCity ASC, lastTransaction ASC",
      "Is there a way via jq that I can assert (true/false) the sort order, taking multiple fields into account?",
      "jq -s 'map([.memberId, .memberCity, .lastTransaction]) | . == sort'",
      "jq 'all(foreach [inputs[]] as $i ([[.[]]]; [.[-1],$i]); .[0]<=.[1])'"
    ],
    "utterance": "Determine if the records are sorted in ascending order by memberId, then memberCity, then lastTransaction, returning true or false.",
    "expressions": [
      "jq -s 'map([.memberId, .memberCity, .lastTransaction]) | . == sort'",
      "jq 'all(foreach [inputs[]] as $i ([[.[]]]; [.[-1],$i]); .[0]<=.[1])'"
    ],
    "data": [
      {
        "input": [
          {
            "memberId": 123,
            "memberCity": "Chicago",
            "lastTransaction": 1504155600000
          },
          {
            "memberId": 123,
            "memberCity": "Chigago",
            "lastTransaction": 150175600000
          },
          {
            "memberId": 123,
            "memberCity": "New York",
            "lastTransaction": 150195600000
          }
        ]
      }
    ],
    "identifier": 53936117,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "jq reads by default from a hidden folder in your root directory files marked with .jq file extension: ~/.jq ([\"~/.jq\",  \"$ORIGIN/../lib/jq\",  \"$ORIGIN/../lib\"])",
      "To reference the module you can use the import function, then follow your normal jq command after a semicolon. The \"as lib\" below allows you to change the name of the namespace as well:",
      "jq 'import \"urldecode\" as lib; .http.referrer | lib::url_decode' file.json",
      "In theory, with your setup, you should be able to invoke jq along the lines of",
      "jq 'import \"urldecode\" as url_decode;\n      .http.referrer | url_decode::url_decode' file.json",
      "or more simply:",
      "jq 'include \"urldecode\";\n      .http.referrer | url_decode' file.json"
    ],
    "utterance": "Extract the '.http.referrer' field from the input and apply the custom 'url_decode' function as defined in the 'urldecode' module.",
    "expressions": [
      "import \"urldecode\" as lib; .http.referrer | lib::url_decode",
      "import \"urldecode\" as url_decode; .http.referrer | url_decode::url_decode",
      "include \"urldecode\"; .http.referrer | url_decode"
    ],
    "identifier": 53937411,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": "Requires the 'urldecode' module to be available in the jq environment."
    }
  },
  {
    "context": [
      "I would like to sum the Amount from the stream of JSON transactions. Each object contains data for 1 transaction.",
      "Is there a way to make JQ sum all numbers from a field in a stream, am I doing something wrong? Is there an easier method to sum all numbers for a field in JSON stream where I am taking the hard route?",
      "The problem is the way you're using `inputs`. According to the documentation, `inputs` \"outputs all remaining inputs, one by one\". Fine, but note the \"remaining\" part. By the time your code is run, the *first* input has already been read into `.`, so `inputs` returns all of the records *except* that one. The simple fix is to use the `-n` option to jq, which runs the filter once, with the `null` value in `.`. This leaves *all* of the records available for reading by `inputs`, which makes your code work correctly."
    ],
    "utterance": "Sum all values of the Amount field from a stream of transaction objects.",
    "expressions": [
      "jq -n 'reduce inputs.Amount as $a (0; . + $a)'",
      "jq -n 'inputs | .Amount' | jq -s 'add'"
    ],
    "data": [
      {
        "input": [
          {
            "Originating Account Number": "####-####-####-0000",
            "Cardholder Name": "NAME ON CARD",
            "Merchant Name": "VENDOR NAME",
            "Amount": 342.75,
            "MCC Description": "Computer Software"
          },
          {
            "Originating Account Number": "####-####-####-0000",
            "Cardholder Name": "NAME ON CARD",
            "Merchant Name": "VENDOR NAME",
            "Amount": 492.71,
            "MCC Description": "Computer Software"
          },
          {
            "Originating Account Number": "####-####-####-0000",
            "Cardholder Name": "NAME ON CARD",
            "Merchant Name": "VENDOR NAME",
            "Amount": 573,
            "MCC Description": "Computer Software"
          }
        ],
        "output": 1408.46
      }
    ],
    "identifier": 53937609,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": true,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "jq -r '.head.url | split(\";\")[1] | split(\"=\")[1]'",
      "jq -r '.head.url\n             | split(\";\")[]\n             | select(test(\"barid=\"))\n             | sub(\"barid=\";\"\")'"
    ],
    "utterance": "Extract the value of barid from the head.url field, where the URL contains a segment in the form barid=347EDAFA2B136D7825745B0A490DE32.",
    "expressions": [
      ".head.url | split(\";\")[1] | split(\"=\")[1]",
      ".head.url | split(\";\")[] | select(test(\"barid=\")) | sub(\"barid=\";\"\")"
    ],
    "data": [
      {
        "input": {
          "head": {
            "url": "anotherfoo;barid=347EDAFA2B136D7825745B0A490DE32"
          }
        },
        "output": "347EDAFA2B136D7825745B0A490DE32"
      }
    ],
    "identifier": 53947130,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "My goal is to get the id (E50DA...) after `=` only. I could simply use `[1]` to get the second element and then use a regex to get the part after the `=`.",
      "But the order of elements is not safe and I'm sure there's a better way with jq already that I dont know of. Maybe create a map of the elements and use `myid` as a key to get the value (E50...)?",
      "But alas, it is easily possible by first building an object from the key value pairs and then look up the value for the key in question:",
      ".head.url | split(\";\") | map(split(\"=\") | { key: .[0], value: .[1] }) | from_entries | .myid",
      "Or without building an object, simply by selecting the first array item with matching key, then outputting its value:",
      ".head.url | split(\";\")[] | split(\"=\") | select(first == \"myid\")[1]",
      "Using jq's match() with positive lookbehind to output what's after myid=:",
      ".head.url | split(\";\")[] | match(\"(?<=myid=).*;\"g\").string",
      "or drop the split() and match() after myid= until the end or ;:",
      ".head.url | match(\"(?<=myid=)[^;]*\";\"g\").string"
    ],
    "utterance": "Extract the value associated with the 'myid' key from the semicolon-delimited url string, regardless of the order of elements.",
    "expressions": [
      ".head.url | split(\";\") | map(split(\"=\") | {key: .[0], value: .[1]}) | from_entries | .myid",
      ".head.url | split(\";\")[] | split(\"=\") | select(first == \"myid\")[1]",
      ".head.url | split(\";\")[] | match(\"(?<=myid=).*;\"g).string",
      ".head.url | match(\"(?<=myid=)[^;]*\";\"g\").string"
    ],
    "data": [
      {
        "input": {
          "head": {
            "url": "foobar;myid=E50DAA932C22739F92BB250C14365440"
          }
        },
        "output": "E50DAA932C22739F92BB250C14365440"
      }
    ],
    "identifier": 53959049,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have to get a ID from the inputJson , (activeItem) and use that ID to get the name of the element from list of items below.",
      "Can this be done in single query ?",
      "With your data, the filter:",
      ".amazon\n| .activeitem as $id\n| .items[]\n| select(.id == $id)\n| .name",
      "produces:",
      "\"adidas shoes\""
    ],
    "utterance": "Retrieve the name of the item whose id matches the activeitem value.",
    "expressions": [
      ".amazon | .activeitem as $id | .items[] | select(.id == $id) | .name"
    ],
    "data": [
      {
        "input": {
          "amazon": {
            "activeitem": 2,
            "items": [
              {
                "id": 1,
                "name": "harry potter",
                "state": "sold"
              },
              {
                "id": 2,
                "name": "adidas shoes",
                "state": "in inventory"
              },
              {
                "id": 3,
                "name": "watch",
                "state": "returned"
              }
            ]
          }
        },
        "output": "adidas shoes"
      }
    ],
    "identifier": 53963083,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have the below JSON and would like to get the ip value if the name value matches with the passed input value.",
      "i.e pass the input as abc.com and return the value as 101.0.0.0. How can I achieve this using jq?",
      "When I pass key as abc.com which should return the value 101.0.0.0",
      "jq -r --arg name abc.com '.[][][] | select(.name == $name).value'",
      "jq -r --arg name abc.com '.. | objects | select(.ips) | .ips[] | select(.name == $name).value'",
      "jq -r --arg name abc.com '.. | objects | select(.name == $name).value'"
    ],
    "utterance": "Retrieve the value field of the ips array element where name is abc.com.",
    "expressions": [
      ".[][][] | select(.name == $name).value",
      ".. | objects | select(.ips) | .ips[] | select(.name == $name).value",
      ".. | objects | select(.name == $name).value"
    ],
    "data": [
      {
        "input": [
          {
            "ips": [
              {
                "name": "google.com",
                "value": "172.217.164.110"
              }
            ]
          },
          {
            "ips": [
              {
                "name": "abc.com",
                "value": "101.0.0.0"
              }
            ]
          }
        ],
        "output": "101.0.0.0"
      }
    ],
    "identifier": 53963324,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "$ echo '{\"foo\": 0}' | ./jq-win64.exe .",
      "{\n  \"foo\": 0\n}",
      "$ echo '{\"foo\": 0}' | jq",
      "{\n  \"foo\": 0\n}",
      "Run command\n\necho '{\"foo\": 0}' | jq \n\nOutput\n\n{\n   \"foo\": 0\n}\n\nenjoy!"
    ],
    "utterance": "Display the entire input where it is an object with a property 'foo' set to 0.",
    "expressions": [
      "."
    ],
    "data": [
      {
        "input": {
          "foo": 0
        },
        "output": {
          "foo": 0
        }
      }
    ],
    "identifier": 53967693,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have an array \"operations\" from which I would like to return all the elements that contain a  matching string like `\"w51\"`.",
      "My expected output when searching for an array element that contains \"w51\" would be this:\n\n    [\n          \"create\",\n          \"w51\",\n          \"rwt.widgets.Label\",\n          {\n            \"parent\": \"w41\",\n            \"style\": [\n              \"NONE\"\n            ],\n            \"bounds\": [\n              101,\n              0,\n              49,\n              42\n            ],\n            \"tabIndex\": -1,\n            \"customVariant\": \"variant_pufferLabelLogout\"\n          }\n    ]",
      "If you use jq version 1.4 or later, the following should produce the desired output:\n\n    .operations[]\n    | select( index(\"w51\") )",
      "there are many alternatives, depending on which version of jq you have.  If your jq has `any/0`, the following is an efficient option:\n\n    .operations[] | select( any(. == \"w51\" ) )"
    ],
    "utterance": "Return all elements of the operations array that contain the string \"w51\".",
    "expressions": [
      ".operations[] | select(index(\"w51\"))",
      ".operations[] | select(any(. == \"w51\" ))"
    ],
    "data": [
      {
        "input": {
          "operations": [
            [
              "create",
              "w51",
              "rwt.widgets.Label",
              {
                "parent": "w41",
                "style": [
                  "NONE"
                ],
                "bounds": [
                  101,
                  0,
                  49,
                  42
                ],
                "tabIndex": -1,
                "customVariant": "variant_pufferLabelLogout"
              }
            ],
            [
              "create",
              "w39",
              "rwt.widgets.Composite",
              {
                "parent": "w34",
                "style": [
                  "NONE"
                ],
                "children": [
                  "w52"
                ],
                "bounds": [
                  0,
                  42,
                  762,
                  868
                ],
                "tabIndex": -1,
                "clientArea": [
                  0,
                  0,
                  762,
                  868
                ]
              }
            ]
          ]
        },
        "output": [
          "create",
          "w51",
          "rwt.widgets.Label",
          {
            "parent": "w41",
            "style": [
              "NONE"
            ],
            "bounds": [
              101,
              0,
              49,
              42
            ],
            "tabIndex": -1,
            "customVariant": "variant_pufferLabelLogout"
          }
        ]
      }
    ],
    "identifier": 53978065,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "But what I actually want is the index of the most outer array \"arr\" that this `text:FooBar` pair is nested in, which is 0 in this case.",
      "Performing your filtering and returning the index then becomes fairly trivial:",
      "jq '.arr | to_entries | .[] | select(.value[3].text | contains(\"FooBar\")) | .key' <test.json",
      "Here's a solution that does not depend on the assumption that the object of interest has a fixed position within the array:",
      ".arr\n| map( .[] | objects | .text ) \n| index(\"FooBar\")",
      "More robustly:",
      ".arr \n| map( first(.[] | objects) // null | .text )\n| index(\"FooBar\")"
    ],
    "utterance": "Return the index of the element in the top-level array where an object with the key 'text' equals 'FooBar' is located.",
    "expressions": [
      ".arr | to_entries | .[] | select(.value[3].text | contains(\"FooBar\")) | .key",
      ".arr | map( .[] | objects | .text ) | index(\"FooBar\")",
      ".arr | map( first(.[] | objects) // null | .text ) | index(\"FooBar\")"
    ],
    "data": [
      {
        "input": {
          "arr": [
            [
              "create",
              "w71",
              "rwt.widgets.Label",
              {
                "parent": "w68",
                "style": "1",
                "bounds": "2",
                "tabIndex": -1,
                "background": "ww",
                "font": "test",
                "text": "FooBar",
                "alignment": "right"
              }
            ],
            [
              "create",
              "w72",
              "rwt.widgets.Label",
              {
                "parent": "w68",
                "style": "22",
                "bounds": "1",
                "tabIndex": -1,
                "foreground": "null",
                "background": "1",
                "font": "2",
                "text": "55",
                "alignment": "right"
              }
            ]
          ]
        },
        "output": 0
      }
    ],
    "identifier": 53986312,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "This call finds all elements of `arr` that have a `FooItem` in them and then returns `texts` from the nested array at index `3`:",
      "jq '.arr[] | select(index(\"FooItem\")) | .[3].texts'",
      "But I'd like to create a new json from these objects that looks like this:",
      "{\n\t\"arr\": [\n\t\t[\n\t\t\t\"aBarfoo\",\n\t\t\t\"avalue\"\n\t\t],\n\t\t[\n\t\t\t\"mybarfoo\",\n\t\t\t\"bValue\"\n\t\t]\n\t]\n}",
      "One more addition: Considering that texts also has strings of zero length, how would you delete those/not have them in the result?",
      "\"texts\": [\"\",\n          \"mybarfoo\",\n          \"bValue\",\n          \"\"\n          ]",
      ".arr |= map( select(index(\"FooItem\"))\n             | .[3].texts | map(select(length>0)))"
    ],
    "utterance": "Produce an object with key 'arr' containing arrays from each element with 'FooItem', extracting their 'texts' arrays and removing any empty strings.",
    "expressions": [
      ".arr |= map( select(index(\"FooItem\")) | .[3].texts | map(select(length>0)))"
    ],
    "data": [
      {
        "input": {
          "arr": [
            [
              "create",
              "w199",
              "FooItem",
              {
                "index": 0,
                "texts": [
                  "aBarfoo",
                  "avalue"
                ]
              }
            ],
            [
              "create",
              "w200",
              "NoItem",
              {
                "index": 1,
                "val": 5,
                "hearts": 5
              }
            ],
            [
              "create",
              "w200",
              "FooItem",
              {
                "index": 1,
                "texts": [
                  "mybarfoo",
                  "bValue"
                ]
              }
            ],
            [
              "create",
              "w201",
              "FooItem",
              {
                "index": 2,
                "texts": [
                  "",
                  "mybarfoo",
                  "bValue",
                  ""
                ]
              }
            ]
          ]
        },
        "output": {
          "arr": [
            [
              "aBarfoo",
              "avalue"
            ],
            [
              "mybarfoo",
              "bValue"
            ],
            [
              "mybarfoo",
              "bValue"
            ]
          ]
        }
      }
    ],
    "identifier": 53987713,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am looking to check if there is any duplication of the memberId - ideally return a true/false and then also return the duplicated values.",
      "to produce a true/false indicator, followed by the duplicates if any, you could write:",
      "reduce (inputs|.memberId|tostring) as $id ({}; .[$id] += 1)\n| to_entries\n| map(select(.value > 1))\n| (length > 0), .[].key",
      "The solution can now be written more concisely:",
      "bow(inputs.memberId)\n| to_entries\n| map(select(.value > 1))\n| (length > 0), .[].key"
    ],
    "utterance": "Determine if any memberId values appear more than once and output true if there are duplicates, followed by the duplicated memberId values.",
    "expressions": [
      "reduce (inputs|.memberId|tostring) as $id ({}; .[$id] += 1) | to_entries | map(select(.value > 1)) | (length > 0), .[].key",
      "def bow(stream): reduce stream as $word ({}; .[($word|tostring)] += 1); bow(inputs.memberId) | to_entries | map(select(.value > 1)) | (length > 0), .[].key"
    ],
    "data": [
      {
        "input": [
          {
            "memberId": "123",
            "city": "New York"
          },
          {
            "memberId": "234",
            "city": "Chicago"
          },
          {
            "memberId": "345",
            "city": "San Francisco"
          },
          {
            "memberId": "123",
            "city": "New York"
          },
          {
            "memberId": "345",
            "city": "San Francisco"
          }
        ],
        "output": [
          true,
          "123",
          "345"
        ]
      }
    ],
    "identifier": 53989197,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I would like to display all the values that contain a `-` or a `_` or is `blank`.",
      "I have tried the following:\r\n\r\n    select (. | contains(\"-\",\"_\",\" \"))'",
      "Your query won't detect empty strings, and will possibly emit the same string more than once.  It would be easier to use `test`, e.g.:\r\n\r\n    select( length==0 or test(\"[-_ ]\") )"
    ],
    "utterance": "Select all values that contain a hyphen, underscore, or are blank strings.",
    "expressions": [
      "select(length==0 or test(\"[-_ ]\"))"
    ],
    "data": [
      {
        "input": [
          "2014-03-26 01:58:00",
          "9019549360",
          "109092812_20150626",
          "134670164",
          "",
          "97695498",
          "680561513"
        ],
        "output": [
          "2014-03-26 01:58:00",
          "109092812_20150626",
          ""
        ]
      }
    ],
    "identifier": 53990847,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Just to complete the picture, this is also possible with `jq`, simply `add` two json files:",
      "jq -s 'add' credHelpers.json config.json",
      "The `add` filter takes as input an array, and produces as output the elements of the array added together, as the `jq` [documentation says](https://stedolan.github.io/jq/manual/). In addition you need the `--slurp/-s` command line option, which turns the entire input stream into a large array and runs the filter just once.",
      "What I want is this:",
      "    {",
      "        \"auths\": {",
      "                \"https://index.docker.io/v1/\": {",
      "                        \"auth\": \"supercalifragilistic\"",
      "                }",
      "        },",
      "        \"HttpHeaders\": {",
      "                \"User-Agent\": \"Docker-Client/18.03.0-ce (windows)\"",
      "        },",
      "        \"credHelpers\": {",
      "                \"111111111111.dkr.ecr.us-east-1.amazonaws.com\": \"ecr-login\"",
      "        }",
      "    }"
    ],
    "utterance": "Combine two documents so that all top-level keys from both appear together, resulting in one object containing keys 'auths', 'HttpHeaders', and 'credHelpers'.",
    "expressions": [
      "jq -s 'add' credHelpers.json config.json"
    ],
    "data": [
      {
        "input": [
          {
            "credHelpers": {
              "111111111111.dkr.ecr.us-east-1.amazonaws.com": "ecr-login"
            }
          },
          {
            "auths": {
              "https://index.docker.io/v1/": {
                "auth": "supercalifragilistic"
              }
            },
            "HttpHeaders": {
              "User-Agent": "Docker-Client/18.03.0-ce (windows)"
            }
          }
        ],
        "output": {
          "auths": {
            "https://index.docker.io/v1/": {
              "auth": "supercalifragilistic"
            }
          },
          "HttpHeaders": {
            "User-Agent": "Docker-Client/18.03.0-ce (windows)"
          },
          "credHelpers": {
            "111111111111.dkr.ecr.us-east-1.amazonaws.com": "ecr-login"
          }
        }
      }
    ],
    "identifier": 54001333,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have values of the format \"h:mm AM\"/\"h:mm PM\" in my json that I get from jq.\r\nIs there a way to parse these 12h formatted values and convert them to 24h ones?",
      "Sample Input:\r\n\r\n    { \"times\": [ [ \"8:16 AM\", \"EventA\" ], [ \"3:19 PM\", \"EventB\" ] ]}\r\n\r\nDesired Output:\r\n\r\n    { \"times\": [ [ \"8:16\", \"EventA\" ], [ \"15:19\", \"EventB\" ] ]}",
      "% jq -c '.times[][0] |= (strptime(\"%H:%M %p\") | strftime(\"%H:%M\"))'\r\n{ \"times\": [ [ \"8:16 AM\", \"EventA\" ], [ \"3:19 PM\", \"EventB\" ] ]}\r\n{\"times\":[[\"08:16\",\"EventA\"],[\"15:19\",\"EventB\"]]}"
    ],
    "utterance": "Convert all time strings in the array that are in 12-hour format with AM/PM to their equivalent 24-hour format, keeping the rest of the structure unchanged.",
    "expressions": [
      ".times[][0] |= (strptime(\"%H:%M %p\") | strftime(\"%H:%M\"))",
      ".times |= map(map(if test(\"^\\d{1,2}:\\d{1,2} [AP]M$\"; \"i\") then (strptime(\"%H:%M %p\") | strftime(\"%H:%M\")) else . end))"
    ],
    "data": [
      {
        "input": {
          "times": [
            [
              "8:16 AM",
              "EventA"
            ],
            [
              "3:19 PM",
              "EventB"
            ]
          ]
        },
        "output": {
          "times": [
            [
              "08:16",
              "EventA"
            ],
            [
              "15:19",
              "EventB"
            ]
          ]
        }
      }
    ],
    "identifier": 54006570,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I would like to inplace modify the value for the matched key ... I want more like in-place modify or the whole json from the original with new value modified, like this: ... DB_USERNAME ... \"value\": \"new\" ... Is it possible to do with `jq` or any known workaround?",
      "It is enough to assign to the path, if you are using `|=`, e.g.\n\n    jq '(\n      .taskDefinition.containerDefinitions[0].environment[] | \n       select(.name==\"DB_USERNAME\") | .value) |= \"new\"\n    ' infile.json\n",
      "Here is a select-free solution using `|=`:\n\n    .taskDefinition.containerDefinitions[0].environment |=\n      map(if .name==\"DB_USERNAME\" then .value = \"new\"\n          else . end)\n",
      "You might like to consider this alternative to using `|=`:\n\n    walk( if type==\"object\" and .name==\"DB_USERNAME\" \n          then .value=\"new\" else . end)"
    ],
    "utterance": "Update the value of the environment variable named DB_USERNAME to \"new\" in the container definitions at index 0, preserving the overall structure.",
    "expressions": [
      "(.taskDefinition.containerDefinitions[0].environment[] | select(.name==\"DB_USERNAME\") | .value) |= \"new\"",
      ".taskDefinition.containerDefinitions[0].environment |= map(if .name==\"DB_USERNAME\" then .value = \"new\" else . end)",
      "walk( if type==\"object\" and .name==\"DB_USERNAME\" then .value=\"new\" else . end)"
    ],
    "data": [
      {
        "input": {
          "taskDefinition": {
            "containerDefinitions": [
              {
                "name": "web",
                "image": "my-image",
                "environment": [
                  {
                    "name": "DB_HOST",
                    "value": "localhost"
                  },
                  {
                    "name": "DB_USERNAME",
                    "value": "user"
                  }
                ]
              }
            ]
          }
        },
        "output": {
          "taskDefinition": {
            "containerDefinitions": [
              {
                "name": "web",
                "image": "my-image",
                "environment": [
                  {
                    "name": "DB_HOST",
                    "value": "localhost"
                  },
                  {
                    "name": "DB_USERNAME",
                    "value": "new"
                  }
                ]
              }
            ]
          }
        }
      }
    ],
    "identifier": 54039268,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Using jq I would like to conditionally update the objects in the \"folders\" array.",
      "For each object which contains \"cand_name\":\"folderA\" a new attribute \"files\" which has the value of compare.files should be added",
      ".tmp.compare.files as $f\n| .tmp.folders \n  |= map( if .cand_name==\"folderA\" \n          then .files = $f else . end)"
    ],
    "utterance": "Add a files attribute with the value from compare.files to each object in the folders array where cand_name is folderA.",
    "expressions": [
      ".tmp.compare.files as $f | .tmp.folders |= map( if .cand_name==\"folderA\" then .files = $f else . end )"
    ],
    "data": [
      {
        "input": {
          "tmp": {
            "folders": [
              {
                "ref_name": "folderB",
                "cand_name": "folderA"
              },
              {
                "ref_name": "folderC",
                "cand_name": "folderA"
              },
              {
                "ref_name": "folderC",
                "cand_name": "folderE"
              }
            ],
            "compare": {
              "files": [
                {
                  "candidate": "Z3S1"
                }
              ]
            }
          }
        },
        "output": {
          "tmp": {
            "folders": [
              {
                "ref_name": "folderB",
                "cand_name": "folderA",
                "files": [
                  {
                    "candidate": "Z3S1"
                  }
                ]
              },
              {
                "ref_name": "folderC",
                "cand_name": "folderA",
                "files": [
                  {
                    "candidate": "Z3S1"
                  }
                ]
              },
              {
                "ref_name": "folderC",
                "cand_name": "folderE"
              }
            ],
            "compare": {
              "files": [
                {
                  "candidate": "Z3S1"
                }
              ]
            }
          }
        }
      }
    ],
    "identifier": 54050996,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "It has multiple top level elements which aren't separated by a comma so if I try to edit the file with jq using something like this jq '.pwd_sha1 = \"NEW HASH\"' web.conf I get the following ... jq is adding a new element to the first top level object and changing the second top level element's value. How can I get this to only change the existing item in the second top level element?",
      "jq 'if .pwd_sha1 then .pwd_sha1 = \"NEW HASH\" else . end' web.conf",
      "jq 'if type == \"object\" and has(\"pwd_sha1\") then .pwd_sha1 = \"NEW HASH\" else . end' web.conf",
      "To edit the second top-level item only, you could use `foreach inputs` with the -n command-line option: foreach inputs as $in (0; .+1; if . == 2 then $in | .pwd_sha1 = \"NEW_HASH\" else $in end)"
    ],
    "utterance": "Update only the pwd_sha1 field to NEW_HASH in the second top-level object of a file containing multiple top-level objects.",
    "expressions": [
      "jq 'if .pwd_sha1 then .pwd_sha1 = \"NEW HASH\" else . end' web.conf",
      "jq 'if type == \"object\" and has(\"pwd_sha1\") then .pwd_sha1 = \"NEW HASH\" else . end' web.conf",
      "jq -n 'foreach inputs as $in (0; .+1; if . == 2 then $in | .pwd_sha1 = \"NEW_HASH\" else $in end)' web.conf"
    ],
    "data": [
      {
        "input": [
          {
            "file": 1,
            "format": 1
          },
          {
            "sidebar_show_zero": false,
            "show_session_speed": false,
            "pwd_sha1": "CHANGEME",
            "show_sidebar": true,
            "sessions": {},
            "enabled_plugins": [],
            "base": "/",
            "first_login": true,
            "theme": "gray",
            "pkey": "ssl/daemon.pkey",
            "default_daemon": "",
            "cert": "test",
            "session_timeout": 3600,
            "https": false,
            "interface": "0.0.0.0",
            "sidebar_multiple_filters": true,
            "pwd_salt": "salt",
            "port": 8112
          }
        ],
        "output": [
          {
            "file": 1,
            "format": 1
          },
          {
            "sidebar_show_zero": false,
            "show_session_speed": false,
            "pwd_sha1": "NEW_HASH",
            "show_sidebar": true,
            "sessions": {},
            "enabled_plugins": [],
            "base": "/",
            "first_login": true,
            "theme": "gray",
            "pkey": "ssl/daemon.pkey",
            "default_daemon": "",
            "cert": "test",
            "session_timeout": 3600,
            "https": false,
            "interface": "0.0.0.0",
            "sidebar_multiple_filters": true,
            "pwd_salt": "salt",
            "port": 8112
          }
        ]
      }
    ],
    "identifier": 54066590,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Use `jq -r` to output a string \"raw\", without JSON formatting, and use the `@sh` formatter to format your results as a string for shell consumption.",
      "So can do e.g.\n\n    msgids=($(<test.json jq -r '.logs[]._id | @sh'))\n\nand get the result you want.",
      "Command: `cat input.json | jq -r '.keys | @sh'`\n\nOutput: 'key1' 'key2' 'key3'\n\nExplanation: | @sh removes [ and \"",
      "To correctly parse values that may have newlines (and any other arbitrary (non-NUL) characters) use jq's `@sh` filter to generate space-separated quoted strings, and Bash's `declare -a` to parse the quoted strings as array elements. (No pre-processing required)",
      "`foo.json`:\n```json\n{\"data\": [\"$0\", \" \\t\\n\", \"*\", \"\\\"\", \"\"]}\n```\n\n```bash\nstr=$(jq -r '.data | @sh' foo.json)\ndeclare -a arr=\"($str)\"   # must be quoted like this\n```"
    ],
    "utterance": "Convert an array from the field .logs[]._id into a Bash array so that each value becomes a separate Bash array element.",
    "expressions": [
      ".logs[]._id | @sh",
      "-r '.logs[]._id | @sh'",
      ".keys | @sh",
      "-r '.keys | @sh'"
    ],
    "data": [
      {
        "input": {
          "logs": [
            {
              "_id": 34
            },
            {
              "_id": 235
            },
            {
              "_id": 436
            },
            {
              "_id": 546
            }
          ]
        },
        "output": "34 235 436 546"
      }
    ],
    "identifier": 54087481,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want the UUID of the found result with no errors and store it as a variable in shell",
      ".entries[] | select(.icon | contains(\"BBC\")) | .uuid",
      "Here is [an example][2] that will simply omit the error and only print the UUIDs if there is no error for that object:",
      ".entries[] | select(.icon | try contains(\"BBC\")) | .uuid",
      "You can preselect only objects that have the `icon` key with `has(\"icon\")`.",
      "jq '.entries[] | select(has(\"icon\")) | select(.icon | contains(\"BBC\")).uuid' file",
      "jq '.entries[] | select(.icon != null) | select(.icon | contains(\"BBC\")).uuid' file"
    ],
    "utterance": "Output the uuid of any object in entries whose icon value contains \"BBC\" without erroring if the icon key is missing or null.",
    "expressions": [
      ".entries[] | select(.icon | try contains(\"BBC\")) | .uuid",
      ".entries[] | select(has(\"icon\")) | select(.icon | contains(\"BBC\")).uuid",
      ".entries[] | select(.icon != null) | select(.icon | contains(\"BBC\")).uuid"
    ],
    "data": [
      {
        "input": {
          "entries": [
            {
              "uuid": "174501xxxxxxxxxxxxxe6342a03",
              "enabled": true,
              "autoname": true,
              "name": "BBC",
              "number": 0,
              "icon": "file:///logos/BBC.png",
              "icon_public_url": "imagecache/1097",
              "epgauto": true,
              "epggrab": [],
              "dvr_pre_time": 0,
              "dvr_pst_time": 0,
              "epg_running": -1,
              "services": [],
              "tags": [],
              "bouquet": ""
            },
            {
              "uuid": "174501xxxxxxxxxxxxxe6342a04",
              "enabled": true,
              "autoname": true,
              "name": "ABC",
              "number": 0,
              "icon_public_url": "imagecache/1098",
              "epgauto": true,
              "epggrab": [],
              "dvr_pre_time": 0,
              "dvr_pst_time": 0,
              "epg_running": -1,
              "services": [],
              "tags": [],
              "bouquet": ""
            }
          ]
        },
        "output": "174501xxxxxxxxxxxxxe6342a03"
      }
    ],
    "identifier": 54087732,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am trying to take an existing JSON element and create a new one with an added value for a key.",
      "I am using this to filter on the object I want\n\n`.entries[] | select(.uuid==\"4a05\")`",
      "I want to create the following output if I ended up filtering on `uuid=4a05`\n\n`[{\"tags\":[\"08fa7\",\"MYNEWVALUE\"],\"number\":mynew##,\"uuid\":\"4a05\"}]`",
      "Here's one way to accomplish the task - first create an object with the fields of interest, and then update `.tags`:\n\n    .entries[]\n    | select(.uuid==\"4a05\") \n    | {tags, uuid, number: \"mynew##\"}\n    | .tags += [\"MYNEWVALUE\"]"
    ],
    "utterance": "Select the entry with uuid equal to \"4a05\", keep only the tags, uuid, and a new number field set to \"mynew##\", and append \"MYNEWVALUE\" to the tags array.",
    "expressions": [
      ".entries[] | select(.uuid==\"4a05\") | {tags, uuid, number: \"mynew##\"} | .tags += [\"MYNEWVALUE\"]"
    ],
    "data": [
      {
        "input": {
          "entries": [
            {
              "uuid": "1a03",
              "enabled": true,
              "autoname": true,
              "name": "LBC",
              "number": 0,
              "epgauto": true,
              "epggrab": [],
              "dvr_pre_time": 0,
              "dvr_pst_time": 0,
              "epg_running": -1,
              "services": [
                "ea8"
              ],
              "tags": [
                "0a7"
              ],
              "bouquet": ""
            },
            {
              "uuid": "4a05",
              "enabled": true,
              "autoname": true,
              "name": "Radio X",
              "number": 0,
              "epgauto": true,
              "epggrab": [],
              "dvr_pre_time": 0,
              "dvr_pst_time": 0,
              "epg_running": -1,
              "services": [
                "9d36"
              ],
              "tags": [
                "08fa7"
              ],
              "bouquet": ""
            }
          ]
        },
        "output": {
          "tags": [
            "08fa7",
            "MYNEWVALUE"
          ],
          "uuid": "4a05",
          "number": "mynew##"
        }
      }
    ],
    "identifier": 54092267,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "cat $2.json | jq '.transactionId = \"$timestamp\"' > $3/$timestamp.json",
      "There are better ways to pass variables to `jq`:",
      "$ new=bar",
      "$ jq --arg new_foo \"$new\" '.foo |= $new_foo' data.test",
      "{\n  \"foo\": \"bar\"\n}"
    ],
    "utterance": "Overwrite the value of the transactionId field with the value of a shell variable named timestamp.",
    "expressions": [
      "jq --arg timestamp \"$timestamp\" '.transactionId = $timestamp' input.json"
    ],
    "data": [
      {
        "input": {
          "transactionId": "$timestamp"
        },
        "output": {
          "transactionId": "actual-timestamp-value"
        }
      }
    ],
    "identifier": 54092824,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am trying to get a list of elements in \"Parameters\" that have a \"Default\" key:",
      "Is there a way I can get just a list of keys which match this filter? e.g.\n\n    \"InstanceTenancy\"\n    \"InstanceAmi\"",
      "Using `to_entries` allows a fundamentally simple solution:\n\n    .Parameters\n    | to_entries[]\n    | select(.value | has(\"Default\"))\n    | .key",
      "But your approach can also be made to work:\n\n    .Parameters\n    | map_values(has(\"Default\"))\n    | keys_unsorted[] as $k\n    | select(.[$k])\n    | $k"
    ],
    "utterance": "List the keys in the Parameters object that contain a Default field.",
    "expressions": [
      ".Parameters | to_entries[] | select(.value | has(\"Default\")) | .key",
      ".Parameters | map_values(has(\"Default\")) | keys_unsorted[] as $k | select(.[$k]) | $k"
    ],
    "data": [
      {
        "input": {
          "Parameters": {
            "Ecosystem": {
              "Type": "String",
              "Description": "Ecosystem to deploy the resources in to",
              "MinLength": "1"
            },
            "InstanceTenancy": {
              "Type": "String",
              "Description": "EC2 Instance Tenancy",
              "Default": "default",
              "AllowedValues": [
                "default",
                "dedicated"
              ]
            },
            "InstanceSecurityGroups": {
              "Type": "List<AWS::EC2::SecurityGroup::Id>",
              "Description": "EC2 Instance Security Groups",
              "MinLength": "1"
            },
            "InstanceAmi": {
              "Type": "AWS::EC2::Image::Id",
              "Description": "AMI to deploy to the EC2 instances",
              "Default": "ami-11223344"
            }
          }
        },
        "output": [
          "InstanceTenancy",
          "InstanceAmi"
        ]
      }
    ],
    "identifier": 54099498,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Input:",
      "{\n    \"a\": \"1\",\n    \"list\": \n    [\n        {\n            \"Key\": \"b\",\n            \"Value\": \"2\"\n        },\n        {\n            \"Key\": \"c\",\n            \"Value\": \"3\"\n        },\n        {\n            \"Key\": \"d\",\n            \"Value\": \"4\"\n        }\n    ]\n}",
      "Expected Output:",
      "{\n  \"a\": \"1\",\n  \"b\": \"2\",\n  \"c\": \"3\",\n  \"d\": \"4\"\n}",
      "As it happens, your .list is exactly suited to jq's `from_entries`, so you can simply write:\n\n    {a} + (.list|from_entries)",
      "Your approach, however, can easily be made to work:\n\n    {a} +  (.list | map({(.Key): .Value}) | add)"
    ],
    "utterance": "Flatten an object with a key 'a' and a list of key-value pairs under 'list' into a single object with 'a' and each key-value pair from the list at the top level.",
    "expressions": [
      "{a} + (.list | from_entries)",
      "{a} + (.list | map({(.Key): .Value}) | add)"
    ],
    "data": [
      {
        "input": {
          "a": "1",
          "list": [
            {
              "Key": "b",
              "Value": "2"
            },
            {
              "Key": "c",
              "Value": "3"
            },
            {
              "Key": "d",
              "Value": "4"
            }
          ]
        },
        "output": {
          "a": "1",
          "b": "2",
          "c": "3",
          "d": "4"
        }
      }
    ],
    "identifier": 54099851,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "my input is \"2009-12-20 08:00:00\" and i want to get the Day of Week.",
      "my test filter is (`\"2009-12-20 08:00:00\" | sub(\" \"; \"T\") + \"Z\" | fromdate | gmtime)` but all i get is an error: jq: error: strptime/1 not implemented on this platform",
      "\"2009-12-20 08:00:00\"\n| strptime(\"%Y-%m-%d %T\")\n| strftime(\"%a\")\n\nIt yields \"Sun\" (which is correct :-)",
      "\"2009-12-20 08:00:00\"\n| capture(\"(?<Y>\\\\d+)-(?<m>\\\\d+)-(?<d>\\\\d+)\")\n| map_values(tonumber)\n| day_of_week(.Y; .m; .d)\n\nThis yields 1, signifying Sunday.",
      "def day_of_week($year; $month; $day):\n  if $month == 1 or $month == 2 then\n    [$month + 12, $year - 1]\n  else\n    [$month, $year]\n  end \n  | $day + (13*(.[0] + 1)/5|floor)\n    +  (.[1]%100)       + ((.[1]%100)/4|floor)\n    +  (.[1]/400|floor) - 2*(.[1]/100|floor) \n  | . % 7 ;"
    ],
    "utterance": "Extract the day of week from the string \"2009-12-20 08:00:00\".",
    "expressions": [
      "\"2009-12-20 08:00:00\" | strptime(\"%Y-%m-%d %T\") | strftime(\"%a\")",
      "def day_of_week($year; $month; $day):\n  if $month == 1 or $month == 2 then\n    [$month + 12, $year - 1]\n  else\n    [$month, $year]\n  end \n  | $day + (13*(.[0] + 1)/5|floor)\n    +  (.[1]%100)       + ((.[1]%100)/4|floor)\n    +  (.[1]/400|floor) - 2*(.[1]/100|floor) \n  | . % 7 ;\n\n\"2009-12-20 08:00:00\"\n| capture(\"(?<Y>\\\\d+)-(?<m>\\\\d+)-(?<d>\\\\d+)\")\n| map_values(tonumber)\n| day_of_week(.Y; .m; .d)"
    ],
    "data": [
      {
        "input": "\"2009-12-20 08:00:00\"",
        "output": "\"Sun\""
      },
      {
        "input": "\"2009-12-20 08:00:00\"",
        "output": 1
      }
    ],
    "identifier": 54100199,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to merge the XML into the JSON using jq to get something like\n\n    {\n      \"thj\": \"abc\", \n      \"xml\": \"&lt;blah abc=\\\"def\\\"&gt;123&lt;/blah&gt;\"\n    }",
      "echo \"$JSON_STRING\" | jq --arg xml \"$<\"$XML_FILENAME\"\" '.xml=($xml | @html)'",
      "jq -R --argjson jstr \"$JSON_STRING\" '$jstr + { \"xml\" : @html }' \"$XML_FILENAME\""
    ],
    "utterance": "Add a key whose value is the original XML source, escaped as HTML entities, to an existing object, preserving the existing fields.",
    "expressions": [
      ".xml = ($xml | @html)",
      "$jstr + { \"xml\": @html }"
    ],
    "data": [
      {
        "input": {
          "thj": "abc"
        },
        "output": {
          "thj": "abc",
          "xml": "&lt;blah abc=\\\"def\\\"&gt;123&lt;/blah&gt;"
        }
      }
    ],
    "identifier": 54106880,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "The task requires the original XML source as an additional input (not present in the input JSON), and the transformation is not fully specified: how is the XML source provided to jq? Is it a variable, a file, or stdin? This information is missing.",
      "other_environment": null
    }
  },
  {
    "context": [
      "how to transform it with jq to the array of \"[parent, child_order_number, child]\" items like",
      "jq -c '[.[] | range(.children|length) as $i | [.parent, $i, .children[$i]]]' file\nyields:\n\n    [[\"x\",0,\"a\"],[\"x\",1,\"b\"],[\"y\",0,\"c\"],[\"y\",1,\"d\"],[\"y\",2,\"e\"]]"
    ],
    "utterance": "For each object, output an array containing its parent value, the index of each child, and the child value itself.",
    "expressions": [
      "[.[] | range(.children|length) as $i | [.parent, $i, .children[$i]]]"
    ],
    "data": [
      {
        "input": [
          {
            "parent": "x",
            "children": [
              "a",
              "b"
            ]
          },
          {
            "parent": "y",
            "children": [
              "c",
              "d",
              "e"
            ]
          }
        ],
        "output": [
          [
            "x",
            0,
            "a"
          ],
          [
            "x",
            1,
            "b"
          ],
          [
            "y",
            0,
            "c"
          ],
          [
            "y",
            1,
            "d"
          ],
          [
            "y",
            2,
            "e"
          ]
        ]
      }
    ],
    "identifier": 54110870,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "My goal is to add up all the time differences between an \"IN\" event and the next \"OUT\" event.",
      "Exception: If the last element is an \"IN\", the calculation has to be done between the current time and the timestamp from the last \"IN\" event.",
      "Here&#39;s an only-jq solution that only calls jq once.",
      "def mins: split(\":\" ) | map(tonumber) | .[0] * 60 + .[1];",
      "def diff: (.[1] - .[0]) | if . >= 0 then . else 24*60 + . end;",
      "def now_mins: now | gmtime | .[3] * 60 + .[4];",
      "def pairs:\n  range(0; length; 2) as $i | [.[$i], .[$i+1] ];",
      "def sigma(s): reduce s as $s (0; . + $s);",
      ".times\n| map( .[0] |= mins )\n| if .[-1][1] == \"IN\" then . + [ [now_mins, \"OUT\"] ] else . end\n| sigma(pairs | map(.[0]) | diff)"
    ],
    "utterance": "Calculate the total duration in minutes between each 'IN' event and the following 'OUT' event, using the current time for an unmatched final 'IN', and sum all durations.",
    "expressions": [
      "def mins: split(\":\") | map(tonumber) | .[0] * 60 + .[1];\ndef diff: (.[1] - .[0]) | if . >= 0 then . else 24*60 + . end;\ndef now_mins: now | gmtime | .[3] * 60 + .[4];\ndef pairs: range(0; length; 2) as $i | [.[$i], .[$i+1] ];\ndef sigma(s): reduce s as $s (0; . + $s);\n.times\n| map( .[0] |= mins )\n| if .[-1][1] == \"IN\" then . + [ [now_mins, \"OUT\"] ] else . end\n| sigma(pairs | map(.[0]) | diff)"
    ],
    "data": [
      {
        "input": {
          "times": [
            [
              "7:43",
              "IN"
            ],
            [
              "8:26",
              "OUT"
            ],
            [
              "8:27",
              "IN"
            ],
            [
              "9:12",
              "OUT"
            ],
            [
              "9:14",
              "IN"
            ],
            [
              "9:22",
              "OUT"
            ],
            [
              "9:23",
              "IN "
            ],
            [
              "12:12",
              "OUT"
            ],
            [
              "13:12",
              "IN"
            ]
          ]
        }
      }
    ],
    "identifier": 54112485,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to extract only the value: \"this is an example\"",
      "I suppose you are trying to get the `.info` field inside `.Event` which should have been written as below. Use `-r` for without quotes",
      "jq '.Event.info'"
    ],
    "utterance": "Extract the value of the 'info' field inside the 'Event' object at the top level.",
    "expressions": [
      ".Event.info"
    ],
    "data": [
      {
        "input": {
          "Event": {
            "id": "13",
            "orgc_id": "1",
            "org_id": "1",
            "date": "2019-01-09",
            "threat_level_id": "3",
            "info": "test9",
            "published": false,
            "uuid": "5c35d180",
            "attribute_count": "2",
            "analysis": "0",
            "timestamp": "1547044733",
            "distribution": "1",
            "proposal_email_lock": false,
            "locked": false,
            "publish_timestamp": "1547034089",
            "sharing_group_id": "0",
            "disable_correlation": false,
            "extends_uuid": "",
            "event_creator_email": "o@cyhgfnt.com",
            "Org": {
              "id": "1",
              "name": "Cygfdgfdnt",
              "uuid": "5b9f938d-e3a0-4ecb-83b3-0bdeac1b41bc"
            },
            "Orgc": {
              "id": "1",
              "name": "Cyhgfgft",
              "uuid": "5b9f938d-e3a0-4ecb-83b3-0bdeac1b41bc"
            },
            "Attribute": [
              {
                "id": "292630",
                "type": "domain",
                "category": "Network activity",
                "to_ids": true,
                "uuid": "5c35dd94-cccc-4086-b386-682823717aa5",
                "event_id": "1357",
                "distribution": "5",
                "timestamp": "1547034584",
                "comment": "This is a comment",
                "sharing_group_id": "0",
                "deleted": false,
                "disable_correlation": false,
                "object_id": "0",
                "object_relation": null,
                "value": "dodskj.com",
                "Galaxy": [],
                "ShadowAttribute": [],
                "Tag": [
                  {
                    "id": "223",
                    "name": "kill-chain:Exploitation",
                    "colour": "#a80079",
                    "exportable": true,
                    "user_id": "0",
                    "hide_tag": false,
                    "numerical_value": null
                  }
                ]
              },
              {
                "id": "292631",
                "type": "ip-dst",
                "category": "Network activity",
                "to_ids": true,
                "uuid": "5c35dd94-fe90-4ef6-b3a9-682823717aa5",
                "event_id": "1357",
                "distribution": "5",
                "timestamp": "1547044733",
                "comment": "comment example",
                "sharing_group_id": "0",
                "deleted": false,
                "disable_correlation": false,
                "object_id": "0",
                "object_relation": null,
                "value": "8.8.6.6",
                "Galaxy": [],
                "ShadowAttribute": [],
                "Tag": [
                  {
                    "id": "247",
                    "name": "maec-malware-capabilities:maec-malware-capability=\"anti-removal\"",
                    "colour": "#3f0004",
                    "exportable": true,
                    "user_id": "0",
                    "hide_tag": false,
                    "numerical_value": null
                  },
                  {
                    "id": "465",
                    "name": "osint:lifetime=\"perpetual\"",
                    "colour": "#006ebe",
                    "exportable": true,
                    "user_id": "0",
                    "hide_tag": false,
                    "numerical_value": null
                  }
                ]
              }
            ],
            "ShadowAttribute": [],
            "RelatedEvent": [],
            "Galaxy": [
              {
                "id": "3",
                "uuid": "698774c7-8022-42c4-917f-8d6e4f06ada3",
                "name": "Threat Actor",
                "type": "threat-actor",
                "description": "Threat actors are characteristics of malicious actors (or adversaries) representing a cyber attack threat including presumed intent and historically observed behaviour.",
                "version": "3",
                "icon": "user-secret",
                "namespace": "misp",
                "GalaxyCluster": [
                  {
                    "id": "6397",
                    "collection_uuid": "7cdff317-a673-4474-84ec-4f1754947823",
                    "type": "threat-actor",
                    "value": "Sofacy",
                    "tag_name": "misp-galaxy:threat-actor=\"Sofacy\"",
                    "description": "The Sofacy Group (also known as APT28, Pawn Storm, Fancy Bear and Sednit) is a cyber espionage group believed to have ties to the Russian government. Likely operating since 2007, the group is known to target government, military, and security organizations. It has been characterized as an advanced persistent threat.",
                    "galaxy_id": "3",
                    "source": "MISP Project",
                    "authors": [
                      "Alexandre Dulaunoy",
                      "Florian Roth",
                      "Thomas Schreck",
                      "Timo Steffens",
                      "Various"
                    ],
                    "version": "82",
                    "uuid": "5b4ee3ea-eee3-4c8e-8323-85ae32658754",
                    "tag_id": "608",
                    "meta": {
                      "cfr-suspected-state-sponsor": [
                        "Russian Federation"
                      ],
                      "cfr-suspected-victims": [
                        "Georgia",
                        "France",
                        "Jordan",
                        "United States",
                        "Hungary",
                        "World Anti-Doping Agency",
                        "Armenia",
                        "Tajikistan",
                        "Japan",
                        "NATO",
                        "Ukraine",
                        "Belgium",
                        "Pakistan",
                        "Asia Pacific Economic Cooperation",
                        "International Association of Athletics Federations",
                        "Turkey",
                        "Mongolia",
                        "OSCE",
                        "United Kingdom",
                        "Germany",
                        "Poland",
                        "European Commission",
                        "Afghanistan",
                        "Kazakhstan",
                        "China"
                      ],
                      "cfr-target-category": [
                        "Government",
                        "Military"
                      ],
                      "cfr-type-of-incident": [
                        "Espionage"
                      ],
                      "country": [
                        "RU"
                      ],
                      "refs": [
                        "https://en.wikipedia.org/wiki/Sofacy_Group",
                        "https://aptnotes.malwareconfig.com/web/viewer.html?file=../APTnotes/2014/apt28.pdf",
                        "http://www.trendmicro.com/cloud-content/us/pdfs/security-intelligence/white-papers/wp-operation-pawn-storm.pdf",
                        "https://www2.fireeye.com/rs/848-DID-242/images/wp-mandiant-matryoshka-mining.pdf",
                        "https://www.crowdstrike.com/blog/bears-midst-intrusion-democratic-national-committee/",
                        "http://researchcenter.paloaltonetworks.com/2016/06/unit42-new-sofacy-attacks-against-us-government-agency/",
                        "https://www.cfr.org/interactive/cyber-operations/apt-28",
                        "https://blogs.microsoft.com/on-the-issues/2018/08/20/we-are-taking-new-steps-against-broadening-threats-to-democracy/",
                        "https://www.bleepingcomputer.com/news/security/microsoft-disrupts-apt28-hacking-campaign-aimed-at-us-midterm-elections/",
                        "https://www.bleepingcomputer.com/news/security/apt28-uses-lojax-first-uefi-rootkit-seen-in-the-wild/"
                      ],
                      "synonyms": [
                        "APT 28",
                        "APT28",
                        "Pawn Storm",
                        "PawnStorm",
                        "Fancy Bear",
                        "Sednit",
                        "TsarTeam",
                        "Tsar Team",
                        "TG-4127",
                        "Group-4127",
                        "STRONTIUM",
                        "TAG_0700",
                        "Swallowtail",
                        "IRON TWILIGHT",
                        "Group 74"
                      ]
                    }
                  }
                ]
              }
            ],
            "Object": [],
            "Tag": [
              {
                "id": "608",
                "name": "misp-galaxy:threat-actor=\"Sofacy\"",
                "colour": "#12e000",
                "exportable": true,
                "user_id": "0",
                "hide_tag": false,
                "numerical_value": null
              },
              {
                "id": "118",
                "name": "gdpr:special-categories=\"health\"",
                "colour": "#3ce600",
                "exportable": true,
                "user_id": "0",
                "hide_tag": false,
                "numerical_value": null
              }
            ]
          }
        },
        "output": "test9"
      }
    ],
    "identifier": 54125850,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I only want to look at a restricted set of keys.",
      "I want a result like:\n\n[\n    {\n        \"field_1\":123,\n        \"field_2\":123,\n    },\n    {\n        \"field_1\":123,\n        \"field_2\":123,\n    },\n    ...all objects like this...\n]",
      "jq '.[] | {field_1:.field_1,field_2:.field_2}'",
      "There is a shorthand notation for that:\n\n```\njq '.[] | {field_1,field_2}'\n```",
      "$ jq --arg fields 'field_1,field_2' '\nmap(with_entries(select(. as {$key} | any($fields|split(\",\")[]; $key == .))))\n' input.json",
      "def filter_keys($keys):\n    with_entries(select(. as {$key} | any($keys[]; $key == .)));",
      "$ jq --arg fields 'field_1,field_2' 'map(filter_keys($fields|split(\",\")))'"
    ],
    "utterance": "Select only the keys field_1 and field_2 from each object in an array.",
    "expressions": [
      ".[] | {field_1,field_2}",
      "map({field_1,field_2})",
      "map(with_entries(select(. as {$key} | any($fields|split(\",\")[]; $key == .))))",
      "map(filter_keys($fields|split(\",\")))"
    ],
    "data": [
      {
        "input": [
          {
            "field_1": 123,
            "field_2": 123,
            "field_3": 123,
            "field_4": 123,
            "field_5": 123
          },
          {
            "field_1": 123,
            "field_2": 123,
            "field_3": 123,
            "field_4": 123,
            "field_5": 123
          }
        ],
        "output": [
          {
            "field_1": 123,
            "field_2": 123
          },
          {
            "field_1": 123,
            "field_2": 123
          }
        ]
      }
    ],
    "identifier": 54129474,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I would like to rename my package.json \"name\" value like this:",
      "PROJECT_NAME=\"MyProject\"",
      "jq '.name = \"$PROJECT_NAME\"' package.json > tmp.$$.json && mv tmp.$$.json package.json",
      "But it doesn't work.",
      "Without the variable works as expected:",
      "jq '.name = \"MyProject\"' package.json > tmp.$$.json && mv tmp.$$.json package.json"
    ],
    "utterance": "Set the 'name' field to a specific value stored in a shell variable PROJECT_NAME",
    "expressions": [
      "jq --arg name \"$PROJECT_NAME\" '.name = $name' package.json"
    ],
    "identifier": 54136250,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "How would I write a `jq` query to grab all `StackName`s for stacks that do *NOT* have a `Tags` value `\"Key\": \"Joe Shmo\"`? So the result would return simply `hello-man`.",
      "jq -r '.Stacks[]|select(.Tags|contains([{\"Key\": \"Joe Shmo\"}])|not).StackName'",
      ".Stacks[]\n| select( any(.Tags[]; .Key == \"Joe Shmo\" ) | not)\n| .StackName"
    ],
    "utterance": "Return all StackName values where none of the associated Tags have Key equal to \"Joe Shmo\".",
    "expressions": [
      ".Stacks[] | select(.Tags | contains([{\"Key\": \"Joe Shmo\"}]) | not) | .StackName",
      ".Stacks[] | select( any(.Tags[]; .Key == \"Joe Shmo\" ) | not) | .StackName"
    ],
    "data": [
      {
        "input": {
          "Stacks": [
            {
              "StackName": "hello-world",
              "Tags": [
                {
                  "Key": "environment",
                  "Value": "sandbox"
                },
                {
                  "Key": "Joe Shmo",
                  "Value": "Dev"
                }
              ]
            },
            {
              "StackName": "hello-man",
              "Tags": [
                {
                  "Key": "environment",
                  "Value": "live"
                },
                {
                  "Key": "Tandy",
                  "Value": "Dev"
                }
              ]
            }
          ]
        },
        "output": "hello-man"
      }
    ],
    "identifier": 54138923,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I would like to insert the following text post Management Configuration.",
      "Then, assuming the object to be inserted is $o, we would write:",
      "    .[0:$i+1] + [$o] + .[$i+1:]",
      "the task boils down to finding $i, which can be done efficiently as follows:",
      "    first(range(0;length) as $i",
      "          | select(.[$i].name==\"Management Configuration\")",
      "          | $i)",
      "the solution then becomes:",
      "    indexof(.name==\"Management Configuration\") as $i",
      "    | if $i then .[0:$i+1] + [$o] + .[$i+1:] else . end",
      "$ jq --argjson i '...' 'map(if .name == \"Management Configuration\" then ., $i else . end)'"
    ],
    "utterance": "Insert a new object after the array element whose name is \"Management Configuration\".",
    "expressions": [
      "(first(range(0;length) as $i | select(.[$i].name==\"Management Configuration\") | $i) // null) as $i | if $i then .[0:$i+1] + [$o] + .[$i+1:] else . end",
      "def indexof(f): first(range(0;length) as $i | select(.[$i]|f) | $i) // null; indexof(.name==\"Management Configuration\") as $i | if $i then .[0:$i+1] + [$o] + .[$i+1:] else . end",
      "map(if .name == \"Management Configuration\" then ., $i else . end)"
    ],
    "data": [
      {
        "input": [
          {
            "name": "Dependencies",
            "asyncInstallation": false,
            "failOnError": true,
            "yamls": [
              "dependency.yaml"
            ]
          },
          {
            "name": "Management Configuration",
            "asyncInstallation": false,
            "failOnError": true,
            "yamls": [
              "mgmt-job.yaml"
            ]
          },
          {
            "name": "Data Configuration",
            "asyncInstallation": false,
            "failOnError": true,
            "yamls": [
              "data-job.yaml"
            ]
          },
          {
            "name": "Tools",
            "asyncInstallation": true,
            "failOnError": false,
            "yamls": [
              "tools-job.yaml"
            ]
          }
        ],
        "output": [
          {
            "name": "Dependencies",
            "asyncInstallation": false,
            "failOnError": true,
            "yamls": [
              "dependency.yaml"
            ]
          },
          {
            "name": "Management Configuration",
            "asyncInstallation": false,
            "failOnError": true,
            "yamls": [
              "mgmt-job.yaml"
            ]
          },
          {
            "name": "NEW Configuration",
            "asyncInstallation": false,
            "failOnError": true,
            "yamls": [
              "newfile.yaml"
            ]
          },
          {
            "name": "Data Configuration",
            "asyncInstallation": false,
            "failOnError": true,
            "yamls": [
              "data-job.yaml"
            ]
          },
          {
            "name": "Tools",
            "asyncInstallation": true,
            "failOnError": false,
            "yamls": [
              "tools-job.yaml"
            ]
          }
        ]
      }
    ],
    "identifier": 54141077,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to fetch `type` results from this JSON file. How can I do this?",
      "Sample Output:- \r\n\r\n    \"type\":\"org.graylog.plugins.beats.Beats2Input\"\r\n    \"type\":\"org.graylog2.inputs.gelf.tcp.GELFTCPInput\"\r\n",
      "The requirements are a bit fuzzy but when used with the -r command-line option, the following produces the desired output and has the advantage of simplicity, but you might want to tweak it to meet your exact requirements:",
      ".. | objects | select(.type) | \"\\\"type\\\":\\\"\\(.type)\\\"\""
    ],
    "utterance": "Extract all values of the field 'type' from every object in the data, formatted as \"type\":\"<value>\".",
    "expressions": [
      ".. | objects | select(.type) | \"\\\"type\\\":\\\"\\(.type)\\\"\""
    ],
    "data": [
      {
        "input": {
          "inputs": [
            {
              "title": "vault_beats",
              "global": true,
              "name": "Beats",
              "content_pack": null,
              "created_at": "2019-01-11T05:31:49.720Z",
              "type": "org.graylog.plugins.beats.Beats2Input",
              "creator_user_id": "admin",
              "attributes": {
                "recv_buffer_size": 1048576,
                "tcp_keepalive": false,
                "number_worker_threads": 1,
                "tls_client_auth_cert_file": "",
                "beats_prefix": false,
                "bind_address": "0.0.0.0",
                "tls_cert_file": "",
                "port": 5044,
                "tls_key_file": "admin",
                "tls_enable": false,
                "tls_key_password": "admin",
                "tls_client_auth": "disabled",
                "override_source": null
              },
              "static_fields": {},
              "node": null,
              "id": "5c382a45590801001d742017"
            },
            {
              "title": "vault_beatsl",
              "global": true,
              "name": "GELF TCP",
              "content_pack": null,
              "created_at": "2019-01-11T05:32:05.869Z",
              "type": "org.graylog2.inputs.gelf.tcp.GELFTCPInput",
              "creator_user_id": "admin",
              "attributes": {
                "recv_buffer_size": 1048576,
                "tcp_keepalive": false,
                "use_null_delimiter": true,
                "number_worker_threads": 1,
                "tls_client_auth_cert_file": "",
                "bind_address": "0.0.0.0",
                "tls_cert_file": "",
                "decompress_size_limit": 8388608,
                "port": 12201,
                "tls_key_file": "admin",
                "tls_enable": false,
                "tls_key_password": "admin",
                "max_message_size": 2097152,
                "tls_client_auth": "disabled",
                "override_source": null
              },
              "static_fields": {},
              "node": null,
              "id": "5c382a55590801001d74202b"
            }
          ],
          "total": 2
        },
        "output": [
          "\"type\":\"org.graylog.plugins.beats.Beats2Input\"",
          "\"type\":\"org.graylog2.inputs.gelf.tcp.GELFTCPInput\""
        ]
      }
    ],
    "identifier": 54141729,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "In my input json there's an array `times` of nested arrays, each having (always) two string elements. I want those two string elements concatenated and the nested array removed so that there's ony one array left:",
      "My input:",
      "{\n   \"times\":[\n      [\n         \"7:29\", \"IN\"\n      ],\n      [\n         \"10:29\", \"OUT\"\n      ]\n   ],\n   \"foo\":\"bar\"\n}",
      "My desired output is:",
      "{\n   \"times\":\n   [\n         \"7:29 IN\", \"10:29 OUT\"\n   ],\n   \"foo\":\"bar\"\n}",
      "jq '.times |= map(join(\" \"))' file\nyields:\n{\n  \"times\": [\n    \"7:29 IN\",\n    \"10:29 OUT\"\n  ],\n  \"foo\": \"bar\"\n}"
    ],
    "utterance": "Concatenate each pair of string elements in the 'times' property and replace the nested arrays with single strings, keeping all other properties unchanged.",
    "expressions": [
      ".times |= map(join(\" \"))"
    ],
    "data": [
      {
        "input": {
          "times": [
            [
              "7:29",
              "IN"
            ],
            [
              "10:29",
              "OUT"
            ]
          ],
          "foo": "bar"
        },
        "output": {
          "times": [
            "7:29 IN",
            "10:29 OUT"
          ],
          "foo": "bar"
        }
      }
    ],
    "identifier": 54145011,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have two separate lists. The first is a list (base_list) with basic parameters, and the second is a list (dev_list) with parameters for a specific stand.",
      "My goal is to combine these two lists to get one item.name with several item.path and item.src. Paths that look like this:",
      "Assuming you had well-formed json and those are properties on the root object, jq is perfectly suited for this.  Group the contents of the arrays by name then generate the appropriate result objects.",
      "$ jq '{\n    end_combine: (\n        .base_list + .dev_list\n          | group_by(.name)\n          | map({ name: .[0].name, path: map(.path), src: map(.src) })\n    )\n}' input.json"
    ],
    "utterance": "Combine two lists of objects, grouping by the name field, and aggregate each group's path and src values into arrays.",
    "expressions": [
      "{ end_combine: (.base_list + .dev_list | group_by(.name) | map({ name: .[0].name, path: map(.path), src: map(.src) })) }"
    ],
    "data": [
      {
        "input": {
          "base_list": [
            {
              "name": "kibana",
              "path": "kibana/conf/kibana.xml",
              "src": "/Users/ansible/inventories/_base/group_vars/kibana/conf/kibana.xml"
            },
            {
              "name": "logstash",
              "path": "logstash/conf/logstash.yml",
              "src": "/Users/ansible/inventories/_base/group_vars/logstash/conf/logstash.yml"
            },
            {
              "name": "grafana",
              "path": "grafana/conf/grafana.json",
              "src": "/Users/ansible/inventories/_base/group_vars/grafana/conf/grafana.json"
            },
            {
              "name": "grafana",
              "path": "grafana/conf/nginx.json",
              "src": "/Users/ansible/inventories/_base/group_vars/grafana/conf/nginx.json"
            },
            {
              "name": "grafana",
              "path": "grafana/conf/config.json",
              "src": "/Users/ansible/inventories/_base/group_vars/grafana/conf/config.json"
            }
          ],
          "dev_list": [
            {
              "name": "kibana",
              "path": "kibana/conf/kibana.xml",
              "src": "/Users/ansible/inventories/dev-st/group_vars/kibana/conf/kibana.xml"
            },
            {
              "name": "logstash",
              "path": "logstash/conf/jvm.options",
              "src": "/Users/ansible/inventories/dev-st/group_vars/logstash/conf/jvm.options"
            }
          ]
        },
        "output": {
          "end_combine": [
            {
              "name": "grafana",
              "path": [
                "grafana/conf/grafana.json",
                "grafana/conf/nginx.json",
                "grafana/conf/config.json"
              ],
              "src": [
                "/Users/ansible/inventories/_base/group_vars/grafana/conf/grafana.json",
                "/Users/ansible/inventories/_base/group_vars/grafana/conf/nginx.json",
                "/Users/ansible/inventories/_base/group_vars/grafana/conf/config.json"
              ]
            },
            {
              "name": "kibana",
              "path": [
                "kibana/conf/kibana.xml",
                "kibana/conf/kibana.xml"
              ],
              "src": [
                "/Users/ansible/inventories/_base/group_vars/kibana/conf/kibana.xml",
                "/Users/ansible/inventories/dev-st/group_vars/kibana/conf/kibana.xml"
              ]
            },
            {
              "name": "logstash",
              "path": [
                "logstash/conf/logstash.yml",
                "logstash/conf/jvm.options"
              ],
              "src": [
                "/Users/ansible/inventories/_base/group_vars/logstash/conf/logstash.yml",
                "/Users/ansible/inventories/dev-st/group_vars/logstash/conf/jvm.options"
              ]
            }
          ]
        }
      }
    ],
    "identifier": 54146501,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "With JQ I am trying to read/extract the values present in the second-level of the curly braces (.phone, .termination, .duration, while ignoring values present in the first-level of curly braces (12345, 67891 and 78912, which are already repeated inside the second-level braces. Can this be done?",
      "I&#39;d like to have a line containing comma separated values for the three filters described.",
      "If the innermost keys are always in the same order, you could use the filter:\r\n\r\n    .[] | [.[][]] | @csv",
      "Otherwise:\r\n\r\n    .[][]\r\n    | [.phone, .termination, .duration]\r\n    | @csv",
      "With the -r command-line option, these last two yield:\r\n\r\n    12345, picked-up, 5\r\n    67891, picked-up, 10\r\n    78912, busy, 0"
    ],
    "utterance": "Output each second-level object's phone, termination, and duration values as comma-separated values, one line per object.",
    "expressions": [
      ".[] | [.[][]] | @csv",
      ".[][] | [.phone, .termination, .duration] | @csv",
      ".[] | [.[][]] | join(\", \")",
      ".[][] | \"\\(.phone), \\(.termination), \\(.duration)\""
    ],
    "data": [
      {
        "input": {
          "12345": [
            {
              "phone": "12345",
              "termination": "picked-up",
              "duration": 5
            }
          ],
          "67891": [
            {
              "phone": "67891",
              "termination": "picked-up",
              "duration": 10
            }
          ],
          "78912": [
            {
              "phone": "78912",
              "termination": "busy",
              "duration": 0
            }
          ]
        },
        "output": [
          "12345, picked-up, 5",
          "67891, picked-up, 10",
          "78912, busy, 0"
        ]
      }
    ],
    "identifier": 54153245,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have the following json payload in `payload.json`",
      "{ \"REGION\":\"us-east-1\",\"EMAIL\":\"contain\",\"UPDATE\":1 }",
      "which I want to convert into `=` delimited keypairs, like so;",
      "REGION=us-east-1",
      "EMAIL=contain",
      "UPDATE=1",
      "jq -r 'to_entries | .[] | .key + \"=\" + .value' < payload.json",
      "jq: error (at <stdin>:0): string (\"UPDATE=\") and number (1) cannot be added",
      "A pair of parentheses:",
      ".key + \"=\" + ( .value|tostring )",
      "Alternatively, you could use string interpolation, e.g.",
      "\"\\(.key)=\\(.value)\""
    ],
    "utterance": "Print each key-value pair in the object as key=value, one per line, ensuring values are converted to strings when needed.",
    "expressions": [
      "to_entries | .[] | .key + \"=\" + (.value|tostring)",
      "to_entries | .[] | \"\\(.key)=\\(.value)\""
    ],
    "data": [
      {
        "input": {
          "REGION": "us-east-1",
          "EMAIL": "contain",
          "UPDATE": 1
        },
        "output": [
          "REGION=us-east-1",
          "EMAIL=contain",
          "UPDATE=1"
        ]
      }
    ],
    "identifier": 54184147,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "However this is not the case when variable is a part of json path:",
      ".auths.$tempvarjq.auth= $basejq",
      "jq: error: syntax error, unexpected '$', expecting FORMAT or QQSTRING_START (Unix shell quoting issues?) at <top-level>, line 1:",
      "Try this.",
      "jq -c --arg basejq \"${base}\" \\",
      "    --arg tempvarjq \"${tempvar}\" \\",
      "    '.auths[$tempvarjq].auth= $basejq'"
    ],
    "utterance": "Set the value of the 'auth' field under the object in 'auths' whose key matches the value of a variable, assigning it the value of another variable.",
    "expressions": [
      ".auths[$tempvarjq].auth = $basejq"
    ],
    "identifier": 54184803,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "What I'd like to receive as output is:\n\n    [\n      { \"email\":\"foo1@bar.com\", \"type\":\"open\", \"time\":\"t1\" },\n      { \"email\":\"foo1@bar.com\", \"type\":\"click\", \"time\":\"t2\" },\n      { \"email\":\"foo2@bar.com\", \"type\":\"open\", \"time\":\"t3\" },\n      { \"email\":\"foo2@bar.com\", \"type\":\"click\", \"time\":\"t4\" },\n      { \"email\":\"foo2@bar.com\", \"type\":\"open\", \"time\":\"t5\" }\n    ]",
      ".emails | map( {email}  + .events[] )",
      "[.emails[] | {email}  + .events[]]",
      ".emails | map( {email}  + (.events[] | del(.ignore) ) )"
    ],
    "utterance": "For each entry in the emails array, produce an array where each event is combined with the parent email address, removing the ignore field.",
    "expressions": [
      ".emails | map( {email}  + .events[] )",
      "[.emails[] | {email}  + .events[]]",
      ".emails | map( {email}  + (.events[] | del(.ignore) ) )"
    ],
    "data": [
      {
        "input": {
          "emails": [
            {
              "email": "foo1@bar.com",
              "events": [
                {
                  "type": "open",
                  "time": "t1",
                  "ignore": "this"
                },
                {
                  "type": "click",
                  "time": "t2",
                  "ignore": "this"
                }
              ]
            },
            {
              "email": "foo2@bar.com",
              "events": [
                {
                  "type": "open",
                  "time": "t3",
                  "ignore": "this"
                },
                {
                  "type": "click",
                  "time": "t4",
                  "ignore": "this"
                },
                {
                  "type": "open",
                  "time": "t5",
                  "ignore": "this"
                }
              ]
            }
          ]
        },
        "output": [
          {
            "email": "foo1@bar.com",
            "type": "open",
            "time": "t1"
          },
          {
            "email": "foo1@bar.com",
            "type": "click",
            "time": "t2"
          },
          {
            "email": "foo2@bar.com",
            "type": "open",
            "time": "t3"
          },
          {
            "email": "foo2@bar.com",
            "type": "click",
            "time": "t4"
          },
          {
            "email": "foo2@bar.com",
            "type": "open",
            "time": "t5"
          }
        ]
      }
    ],
    "identifier": 54187750,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am having trouble using `jq` in `bash` to read out:\r\n\r\n    xrb_1111111111111111111111111111111111111111111111111117353trpda\r\n    xrb_3t6k35gi95xu6tergt6p69ck76ogmitsa8mnijtpxm9fkcm736xtoncuohr3",
      "$ jq -r '.blocks | to_entries[].key' file",
      "jq -r '.blocks | keys_unsorted[]' input.json"
    ],
    "utterance": "Extract all top-level keys inside the blocks object.",
    "expressions": [
      ".blocks | to_entries[].key",
      ".blocks | keys_unsorted[]"
    ],
    "data": [
      {
        "input": {
          "blocks": {
            "xrb_1111111111111111111111111111111111111111111111111117353trpda": {
              "142A538F36833D1CC78B94E11C766F75818F8B940771335C6C1B8AB880C5BB1D": "6000000000000000000000000000000"
            },
            "xrb_3t6k35gi95xu6tergt6p69ck76ogmitsa8mnijtpxm9fkcm736xtoncuohr3": {
              "4C1FEEF0BEA7F50BE35489A1233FE002B212DEA554B55B1B470D78BD8F210C74": "106370018000000000000000000000000"
            }
          }
        },
        "output": [
          "xrb_1111111111111111111111111111111111111111111111111117353trpda",
          "xrb_3t6k35gi95xu6tergt6p69ck76ogmitsa8mnijtpxm9fkcm736xtoncuohr3"
        ]
      }
    ],
    "identifier": 54188847,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want something analogous to this:\n\n    $ `jq 'typeof(\"id\")' < test.json`\n    (output)>> string",
      "In your example you could check:\n\n    jq '.id|type==\"number\"' file.json",
      "You can use the `type` function in order to get the value type as a string."
    ],
    "utterance": "Return the type of the value associated with the key 'id'.",
    "expressions": [
      ".id | type"
    ],
    "data": [
      {
        "input": {
          "id": 50,
          "name": "Joe"
        },
        "output": "number"
      }
    ],
    "identifier": 54191177,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "just want to get some expert tips on you on how to grab only this from AutoScalingGroups:\r\n\r\n> awseb-e-ASG",
      "`awseb-e-ASG` is under .AutoScalingGroups\r\n\r\nso you could use the following filter:\r\n\r\n    .EnvironmentResources.AutoScalingGroups[].Name"
    ],
    "utterance": "Extract the value of the Name property from all objects in the AutoScalingGroups array under EnvironmentResources where the expected value is awseb-e-ASG.",
    "expressions": [
      ".EnvironmentResources.AutoScalingGroups[].Name"
    ],
    "data": [
      {
        "input": {
          "EnvironmentResources": {
            "EnvironmentName": "MY-APP",
            "AutoScalingGroups": [
              {
                "Name": "awseb-e-ASG"
              }
            ],
            "Triggers": [],
            "LoadBalancers": [
              {
                "Name": "awseb-e-ELB"
              }
            ],
            "Queues": [],
            "Instances": [
              {
                "Id": "i-XXXXXXXXXXXXXXXd"
              }
            ],
            "LaunchConfigurations": [
              {
                "Name": "awseb-e-LAUNCH"
              }
            ]
          }
        },
        "output": "awseb-e-ASG"
      }
    ],
    "identifier": 54192356,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I would like to use JQ to group data per month starting at the first value :",
      "My month would be :",
      "- 2018 08 22 to 2018 09 21",
      "- 2018 09 22 to 2018 10 21",
      "- 2018 10 22 to 2018 11 21",
      "My expected output is the following :",
      "{\r\n  \"data\": [  \r\n    {\"month\": \"2018-08-22 to 2018-09-21\",\"sum\": 6},\r\n    {\"month\": \"2018-09-22 to 2018-10-21\",\"sum\": 12},\r\n    {\"month\": \"2018-10-23 to 2018-11-21\",\"sum\": 6}\r\n  ]\r\n}",
      "A solution is easily obtained with the following helper function:",
      "def bucket($day):\r\n  def prev: if .m > 1 then .m -= 1 else .m = 12 | .y -= 1 end;\r\n  def fmt: if .m <= 9 then \"\\(.y)-0\\(.m)\" else \"\\(.y)-\\(.m)\" end;\r\n  capture(\"(?<y>[0-9]+)-(?<m>[0-9]+)-(?<d>[0-9]+)\")\r\n  | map_values(tonumber)\r\n  | if .d >= $day then . else prev end\r\n  | fmt ;",
      ".data\r\n| (.[0].date | capture(\"(?<d>[0-9]+)$\") | .d | tonumber) as $day\r\n| reduce .[] as $pair ({};\r\n  .[$pair.date | bucket($day)] += ($pair.value) )"
    ],
    "utterance": "Group and sum values by custom monthly intervals starting from the first date, where each 'month' spans from the start date through the same date in the next month minus one day.",
    "expressions": [
      "def bucket($day):\n  def prev: if .m > 1 then .m -= 1 else .m = 12 | .y -= 1 end;\n  def fmt: if .m <= 9 then \"\\(.y)-0\\(.m)\" else \"\\(.y)-\\(.m)\" end;\n  capture(\"(?<y>[0-9]+)-(?<m>[0-9]+)-(?<d>[0-9]+)\")\n  | map_values(tonumber)\n  | if .d >= $day then . else prev end\n  | fmt ;\n\ndata\n| (.[0].date | capture(\"(?<d>[0-9]+)$\") | .d | tonumber) as $day\n| reduce .[] as $pair ({};\n  .[$pair.date | bucket($day)] += ($pair.value) )"
    ],
    "data": [
      {
        "input": {
          "data": [
            {
              "date": "2018-08-22",
              "value": 1
            },
            {
              "date": "2018-08-30",
              "value": 2
            },
            {
              "date": "2018-09-01",
              "value": 3
            },
            {
              "date": "2018-09-22",
              "value": 3
            },
            {
              "date": "2018-09-28",
              "value": 4
            },
            {
              "date": "2018-10-18",
              "value": 5
            },
            {
              "date": "2018-10-23",
              "value": 6
            }
          ]
        },
        "output": {
          "2018-08": 6,
          "2018-09": 12,
          "2018-10": 6
        }
      }
    ],
    "identifier": 54199974,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "val=text1,text2,text3",
      "jq --arg val \"$val\" '.whatever.item |= ($val | split(\",\"))' <<<'{\"whatever\": {}}'",
      "- The `jq` argument `--arg` is used to pass that variable from a shell context to a jq context.",
      "- The `|=` construct is used to modify a nested value while still evaluating to the larger document."
    ],
    "utterance": "Update a nested key with the contents of a shell variable split on commas.",
    "expressions": [
      ".whatever.item |= ($val | split(\",\"))"
    ],
    "data": [
      {
        "input": {
          "whatever": {}
        },
        "output": {
          "whatever": {
            "item": [
              "text1",
              "text2",
              "text3"
            ]
          }
        }
      }
    ],
    "identifier": 54200265,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "How can I filter by the key to find only a given id (such as that for \"level2\")?",
      ".organizationalStructure | map(select(.key == \"level2\") | .id)[0]",
      "jq '.organizationalStructure[] |\n        select(.key == \"level1\") | {id: .id}'",
      "jq -r '.organizationalStructure[] |\n            select(.key == \"level1\") | {id: .id}.id'",
      "jq -r '.organizationalStructure[] |\n        select(.key == \"level1\").id'"
    ],
    "utterance": "Retrieve the id of the entry in organizationalStructure where key equals \"level2\".",
    "expressions": [
      ".organizationalStructure | map(select(.key == \"level2\") | .id)[0]",
      ".organizationalStructure[] | select(.key == \"level2\").id"
    ],
    "data": [
      {
        "input": {
          "organizationStructure": [
            {
              "id": 212119,
              "key": "level2"
            },
            {
              "id": 212112,
              "key": "level1"
            }
          ]
        },
        "output": 212119
      }
    ],
    "identifier": 54202642,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "With the following nested structure of companies, suppose the goal is to extract the name, ID, company and site for each *person* listed.  (We can ignore the address.)",
      "jq '.pears[]|{pear: .name,file: .somepath,toBeFiltered: (.appletypes[]|select(.name == \"orange\")|.bananas[]|{banana: .name,apples: .apples[]})}|{pear,file,banana: .toBeFiltered.banana,applestem: .toBeFiltered.apples.applestem,orangecomment: (.toBeFiltered.apples.peaches[]|select(.akey == \"string\")|.avalue.value),linenumber: (.toBeFiltered.apples.peaches[]|select(.akey == \"string\")|.line)}' realfile.json",
      ".company as $company\n| .sites[]\n| .name as $site\n| .personnel[]\n| { name, id: .UID, $company, $site }",
      "(.sites[] | ( (.personnel[] | { name, id: .UID} ) +  {site: .name} )) + {company}",
      "{name, id, company, site}"
    ],
    "utterance": "Extract the name, UID as id, company, and site for each person from each site's personnel in all companies, avoiding cross combinations.",
    "expressions": [
      ".company as $company | .sites[] | .name as $site | .personnel[] | { name, id: .UID, $company, $site }",
      "(.sites[] | ( (.personnel[] | { name, id: .UID} ) + {site: .name} )) + {company} | {name, id, company, site}"
    ],
    "data": [
      {
        "input": {
          "company": "Initrode",
          "sites": [
            {
              "name": "HQ",
              "address": "123 Main Street",
              "personnel": [
                {
                  "name": "John Smith",
                  "UID": 12345
                },
                {
                  "name": "Jane Doe",
                  "UID": 23456
                }
              ]
            },
            {
              "name": "Branch Office",
              "address": "Spodunk, Nowhereville",
              "personnel": [
                {
                  "name": "Fred Anderson",
                  "UID": 56789
                },
                {
                  "name": "Bill Jones",
                  "UID": 34567
                }
              ]
            }
          ]
        },
        "output": [
          {
            "name": "John Smith",
            "id": 12345,
            "company": "Initrode",
            "site": "HQ"
          },
          {
            "name": "Jane Doe",
            "id": 23456,
            "company": "Initrode",
            "site": "HQ"
          },
          {
            "name": "Fred Anderson",
            "id": 56789,
            "company": "Initrode",
            "site": "Branch Office"
          },
          {
            "name": "Bill Jones",
            "id": 34567,
            "company": "Initrode",
            "site": "Branch Office"
          }
        ]
      },
      {
        "input": {
          "company": "Inittech",
          "sites": [
            {
              "name": "Main Office",
              "address": "5678 Avenue Blvd",
              "personnel": [
                {
                  "name": "Fred Johnson",
                  "UID": 6543
                },
                {
                  "name": "James Fredson",
                  "UID": 9876
                }
              ]
            },
            {
              "name": "Testing Station",
              "address": "Alaskan Wilderness",
              "personnel": [
                {
                  "name": "Sally May",
                  "UID": 5432
                },
                {
                  "name": "Jack James",
                  "UID": 8765
                }
              ]
            }
          ]
        },
        "output": [
          {
            "name": "Fred Johnson",
            "id": 6543,
            "company": "Inittech",
            "site": "Main Office"
          },
          {
            "name": "James Fredson",
            "id": 9876,
            "company": "Inittech",
            "site": "Main Office"
          },
          {
            "name": "Sally May",
            "id": 5432,
            "company": "Inittech",
            "site": "Testing Station"
          },
          {
            "name": "Jack James",
            "id": 8765,
            "company": "Inittech",
            "site": "Testing Station"
          }
        ]
      }
    ],
    "identifier": 54210566,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "but I want to filter `inputs` somehow.",
      "**Where do I put the `select`?**",
      "reduce (inputs | select(Your_filter)) as $line\n([]; . + [$line])",
      "a simpler solution is:\n\n    [inputs | select(...)]"
    ],
    "utterance": "Collect all filtered input lines matching a condition into an array.",
    "expressions": [
      "reduce (inputs | select(...)) as $line ([]; . + [$line])",
      "[inputs | select(...)]"
    ],
    "identifier": 54213786,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "The task does not specify the input format (is it JSON lines, or plain text lines?), nor does it specify the condition to filter by, so the transformation is underspecified.",
      "other_environment": null
    }
  },
  {
    "context": [
      "what i need to do is have as output for each discrete value of the ll element, the unique values of ta, in a separate file, named after a one to one representation where each dd code is substituted with a human readable representation:",
      "Desired output, in a per row basis, each unique value of jlo with the count of times it was found in each dd element so we get in the end something like this:",
      "jq --arg dd d5f --raw-output 'map(select(.dd == $dd)) | group_by(.jlo) | map(\"\\(.[0].jlo) \\(length)\") | .[]' yourJsonFile > departmentone.txt",
      "jq --arg dd 5d9 --raw-output 'map(select(.dd == $dd)) | group_by(.jlo) | map(\"\\(.[0].jlo) \\(length)\") | .[]' yourJsonFile > departmentalt.txt",
      "jq --arg dd 1b1 --raw-output 'map(select(.dd == $dd)) | group_by(.jlo) | map(\"\\(.[0].jlo) \\(length)\") | .[]' yourJsonFile > departmentshort.txt",
      "You could extract those codes and labels to generate the files :",
      "jq -r --rawfile dd2name dd2name.tsv -f group.jq input.json |",
      "def dict:\n  split(\"\\n\") | map(select(length>0) | split(\"\\t\"))\n  | INDEX(.[0]) | map_values(.[1]);\n\n($dd2name | dict) as $dict\n| ($dict | keys_unsorted[]) as $dd\n| map(select(.dd == $dd))\n| group_by(.jlo)\n| map(\"\\($dict[$dd])\\t\\(.[0].jlo) \\(length)\")[]"
    ],
    "utterance": "For each value of dd, output the list of unique jlo values with their counts per dd, using a mapping from dd codes to human-readable names for output file naming.",
    "expressions": [
      "map(select(.dd == $dd)) | group_by(.jlo) | map(\"\\(.[0].jlo) \\(length)\") | .[]",
      "def dict:\n  split(\"\\n\") | map(select(length>0) | split(\"\\t\"))\n  | INDEX(.[0]) | map_values(.[1]);\n\n($dd2name | dict) as $dict\n| ($dict | keys_unsorted[]) as $dd\n| map(select(.dd == $dd))\n| group_by(.jlo)\n| map(\"\\($dict[$dd])\\t\\(.[0].jlo) \\(length)\")[]"
    ],
    "data": [
      {
        "input": [
          {
            "_id": {
              "$oid": "6d2"
            },
            "jlo": "\u0395\u0399 AJSB",
            "dd": "d5f"
          },
          {
            "_id": {
              "$oid": "c6d3"
            },
            "jlo": "\u0395\u0399 ALKSB",
            "dd": "5d9"
          },
          {
            "_id": {
              "$oid": "b0cc6d4"
            },
            "jlo": "\u0395\u0399 AGHTSB",
            "dd": "1b1"
          },
          {
            "_id": {
              "$oid": "6d2"
            },
            "jlo": "\u0395POW\u0399 AJSB",
            "dd": "d5f"
          },
          {
            "_id": {
              "$oid": "c6d3"
            },
            "jlo": "\u0395GT\u0399 ALKSB",
            "dd": "5d9"
          },
          {
            "_id": {
              "$oid": "b0cc6d4"
            },
            "jlo": "\u0395LK\u0399 AGHTSB",
            "dd": "1b1"
          }
        ],
        "output": {
          "departmentone.txt": [
            "\u0395\u0399 AJSB 1",
            "\u0395POW\u0399 AJSB 1"
          ],
          "departmentalt.txt": [
            "\u0395\u0399 ALKSB 1",
            "\u0395GT\u0399 ALKSB 1"
          ],
          "departshort.txt": [
            "\u0395\u0399 AGHTSB 1",
            "\u0395LK\u0399 AGHTSB 1"
          ]
        }
      }
    ],
    "identifier": 54217986,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": true,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I&#39;ve got 2 json files and want to merge them into one unique file",
      "Here the content of files:\r\n\r\nfile1:\r\n\r\n    {\"tag_id\" : [\"t1\"], \"inst_id\" : \"s1\"}\r\n    {\"tag_id\" : [\"t1\"], \"inst_id\" : \"s2\"}\r\n\r\nfile2:\r\n  \r\n\r\n    {\"tag_id\" : [\"t2\"], \"inst_id\" : \"s1\"}\r\n    {\"tag_id\" : [\"t2\"], \"inst_id\" : \"s2\"}\r\n    {\"tag_id\" : [\"t2\"], \"inst_id\" : \"s3\"}",
      "expected result:\r\n  \r\n\r\n    {\"tag_id\" : [\"t1\",\"t2\"], \"inst_id\" : \"s1\"}\r\n    {\"tag_id\" : [\"t1\",\"t2\"], \"inst_id\" : \"s2\"}\r\n    {\"tag_id\" : [\"t2\"], \"inst_id\" : \"s3\"}",
      "The task can now be accomplished as follows:\r\n\r\n    keywise_add(inputs | {(.inst_id): .tag_id} )\r\n    | keys_unsorted[] as $k\r\n    | {tag_id: .[$k], inst_id: $k}",
      "With the above program in add.jq, the invocation:\r\n\r\n    jq -c -n -f add.jq file1.json file2.json\r\n\r\nyields:\r\n\r\n    {\"tag_id\":[\"t1\",\"t2\"],\"inst_id\":\"s1\"}\r\n    {\"tag_id\":[\"t1\",\"t2\"],\"inst_id\":\"s2\"}\r\n    {\"tag_id\":[\"t2\"],\"inst_id\":\"s3\"}"
    ],
    "utterance": "Merge two files containing records with tag_id arrays and inst_id strings so that records with the same inst_id are combined, concatenating their tag_id arrays, and output one record per inst_id.",
    "expressions": [
      "def keywise_add(s):\n  reduce s as $x ({};\n     reduce ($x|keys_unsorted)[] as $k (.; \n       .[$k] += $x[$k]));\n\nkeywise_add(inputs | {(.inst_id): .tag_id} )\n| keys_unsorted[] as $k\n| {tag_id: .[$k], inst_id: $k}"
    ],
    "data": [
      {
        "input": [
          {
            "tag_id": [
              "t1"
            ],
            "inst_id": "s1"
          },
          {
            "tag_id": [
              "t1"
            ],
            "inst_id": "s2"
          },
          {
            "tag_id": [
              "t2"
            ],
            "inst_id": "s1"
          },
          {
            "tag_id": [
              "t2"
            ],
            "inst_id": "s2"
          },
          {
            "tag_id": [
              "t2"
            ],
            "inst_id": "s3"
          }
        ],
        "output": [
          {
            "tag_id": [
              "t1",
              "t2"
            ],
            "inst_id": "s1"
          },
          {
            "tag_id": [
              "t1",
              "t2"
            ],
            "inst_id": "s2"
          },
          {
            "tag_id": [
              "t2"
            ],
            "inst_id": "s3"
          }
        ]
      }
    ],
    "identifier": 54232764,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": true,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I would like to get the average per month.",
      "My expected output is the following :",
      "{\r\n  \"data\": [  \r\n    {\"month\": \"2018-08\",\"avg\": 4},\r\n    {\"month\": \"2018-09\",\"avg\": 6},\r\n    {\"month\": \"2018-10\",\"avg\": 3}\r\n  ]\r\n}",
      "def bucket: sub(\"-\\\\d+$\"; \"\");",
      ".data\n| reduce .[] as $pair ({};\n    .[$pair.date | bucket] += [$pair.value])\n| {data: [to_entries[] | {month: .key, avg: (.value| add/length)}]}"
    ],
    "utterance": "Calculate the average of the value field for each month and output an array of objects with the month and its average.",
    "expressions": [
      "def bucket: sub(\"-\\\\d+$\"; \"\");\n.data\n| reduce .[] as $pair ({};\n    .[$pair.date | bucket] += [$pair.value])\n| {data: [to_entries[] | {month: .key, avg: (.value | add/length)}]}"
    ],
    "data": [
      {
        "input": {
          "data": [
            {
              "date": "2018-08-22",
              "value": 3
            },
            {
              "date": "2018-08-30",
              "value": 5
            },
            {
              "date": "2018-09-01",
              "value": 5
            },
            {
              "date": "2018-09-22",
              "value": 9
            },
            {
              "date": "2018-09-28",
              "value": 4
            },
            {
              "date": "2018-10-18",
              "value": 2
            },
            {
              "date": "2018-10-23",
              "value": 1
            }
          ]
        },
        "output": {
          "data": [
            {
              "month": "2018-08",
              "avg": 4
            },
            {
              "month": "2018-09",
              "avg": 6
            },
            {
              "month": "2018-10",
              "avg": 3
            }
          ]
        }
      }
    ],
    "identifier": 54240276,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to replace only the one field `.features[].properties.name` with a random value from a 1d-array inside another txt file.",
      "This works because `shuf -r words.txt` creates an unending stream of words randomly chosen from `words.txt`, and the `jq -R .` inside the process substitution quotes those as strings.",
      "# actually run the replacements\njq -n --slurpfile buildings buildings.json '\n  def replaceName: (.properties.name |= input);\n  $buildings[] | (.features |= map(replaceName))\n' < <(shuf -r words.txt | jq -R .)"
    ],
    "utterance": "Replace each features[].properties.name with a random value chosen from a separate list of strings in an external file.",
    "expressions": [
      "jq -n --slurpfile buildings buildings.json '\n  def replaceName: (.properties.name |= input);\n  $buildings[] | (.features |= map(replaceName))\n' < <(shuf -r words.txt | jq -R .)"
    ],
    "data": [
      {
        "input": {
          "features": [
            {
              "type": "Feature",
              "properties": {
                "name": "0",
                "height": 0.7
              }
            },
            {
              "type": "Feature",
              "properties": {
                "name": "1",
                "height": 0
              }
            }
          ]
        },
        "output": {
          "features": [
            {
              "type": "Feature",
              "properties": {
                "name": "Third Word",
                "height": 0.7
              }
            },
            {
              "type": "Feature",
              "properties": {
                "name": "Second Word",
                "height": 0
              }
            }
          ]
        }
      }
    ],
    "identifier": 54241829,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": true,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to use `jq` to extract any appeareance of the key `Action` in a JSON file.",
      "I want to select cases in which .Action is the value \"*\" or is an array with \"*\" as an element",
      "As a plus, it would be good to include the piece of JSON that contains the `\"Action\"` up to some levels above.",
      "Given input similar to that shown in the Q, the filter immediately above would produce:\r\n\r\n    {\r\n      \"AssumeRolePolicyDocument\": {\r\n        \"Statement\": [\r\n          {\r\n            \"Action\": \"*\",\r\n            \"Effect\": \"Allow\",\r\n            \"Principal\": {\r\n              \"Service\": \"ec2.amazonaws.com\"\r\n            }\r\n          }\r\n        ]\r\n      }\r\n    }"
    ],
    "utterance": "Extract all objects where a key named Action exists, and its value is either \"*\" or an array containing \"*\", including enough parent structure to preserve the context.",
    "expressions": [
      ". as $in | reduce paths(objects and has(\"Action\") and (.Action == \"*\" or (.Action|index(\"*\")))) as $path ({}; setpath($path; $in | getpath($path)))"
    ],
    "data": [
      {
        "input": {
          "AssumeRolePolicyDocument": {
            "Version": "2012-10-17",
            "Statement": [
              {
                "Action": "*",
                "Effect": "Allow",
                "Principal": {
                  "Service": "ec2.amazonaws.com"
                }
              },
              {
                "Action": [
                  "not this",
                  "*",
                  "not that"
                ],
                "Effect": "Deny",
                "Principal": {
                  "Service": "lambda.amazonaws.com"
                }
              },
              {
                "Action": [
                  "somethingElse"
                ],
                "Effect": "Deny",
                "Principal": {
                  "Service": "s3.amazonaws.com"
                }
              }
            ]
          }
        },
        "output": {
          "AssumeRolePolicyDocument": {
            "Statement": [
              {
                "Action": "*",
                "Effect": "Allow",
                "Principal": {
                  "Service": "ec2.amazonaws.com"
                }
              },
              {
                "Action": [
                  "not this",
                  "*",
                  "not that"
                ],
                "Effect": "Deny",
                "Principal": {
                  "Service": "lambda.amazonaws.com"
                }
              }
            ]
          }
        }
      }
    ],
    "identifier": 54244859,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have a big json file with lots of product data like what's below. I'm wanting to filter out based upon which website_ids they have.",
      "Sometimes a product has this for the website_id items:\n\n   \"website_ids\": {\n      \"item\": \"2\"\n   }\n\nWhich results in the error:\n\nCannot iterate over string (\"2\")",
      "Is there a way around this?",
      "After your edit, it's a bit more complicated, but it can be worked around by checking the type of .website_ids.item and then based off of that type, doing a contains check or a simple equality check:",
      "map((select((.website_ids.item | type) == \"array\") | select(.website_ids.item[] | contains(\"4\"))), (select((.website_ids.item | type) == \"string\") | select (.website_ids.item == \"4\")))"
    ],
    "utterance": "Filter the list to include only objects where the website_ids item field is either the string \"4\" or an array that contains \"4\".",
    "expressions": [
      "map(\n    (select((.website_ids.item | type) == \"array\") | select(.website_ids.item[] | contains(\"4\"))),\n    (select((.website_ids.item | type) == \"string\") | select (.website_ids.item == \"4\"))\n)"
    ],
    "data": [
      {
        "input": [
          {
            "product_id": "2",
            "sku": "PROD2",
            "name": "Product Name 2",
            "set": "4",
            "type": "simple",
            "category_ids": {
              "item": "15"
            },
            "website_ids": {
              "item": [
                "1",
                "4"
              ]
            }
          },
          {
            "product_id": "3",
            "sku": "PROD3",
            "name": "Product Name 3",
            "set": "4",
            "type": "simple",
            "category_ids": {
              "item": "15"
            },
            "website_ids": {
              "item": [
                "1",
                "2"
              ]
            }
          },
          {
            "product_id": "4",
            "sku": "PROD4",
            "name": "Product Name 4",
            "set": "4",
            "type": "simple",
            "category_ids": {
              "item": "12"
            },
            "website_ids": {
              "item": "4"
            }
          },
          {
            "product_id": "5",
            "sku": "PROD5",
            "name": "Product Name 5",
            "set": "4",
            "type": "simple",
            "category_ids": {
              "item": "13"
            },
            "website_ids": {
              "item": "2"
            }
          }
        ],
        "output": [
          {
            "product_id": "2",
            "sku": "PROD2",
            "name": "Product Name 2",
            "set": "4",
            "type": "simple",
            "category_ids": {
              "item": "15"
            },
            "website_ids": {
              "item": [
                "1",
                "4"
              ]
            }
          },
          {
            "product_id": "4",
            "sku": "PROD4",
            "name": "Product Name 4",
            "set": "4",
            "type": "simple",
            "category_ids": {
              "item": "12"
            },
            "website_ids": {
              "item": "4"
            }
          }
        ]
      }
    ],
    "identifier": 54254963,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Edit: Clarification. What I want to be able to do is, with the commented json, `json='{\"data\": {\"one\": \"1\", \"two\": \"2\", \"three\": \"3\"}}'` in the example code and have the final result of `json='{ \"data\": { \"four\": \"4\", \"one\": \"100\", \"two\": \"2\"}}'` dumped.",
      "I can read in the 'data' structure and assign the key/values correctly, but I'm not having any luck in figuring out how to embed the `{\"four\": ...}` construct into the `\"data\":  {...}` object.",
      "Edit 2: The answer to my second issue, in combination with peak's answer is the following:",
      "for key in \"${!aaname[@]}\"; do\n  printf '%s\\n%s\\n' \"$key\" \"${aaname[$key]}\"\ndone | jq -SRn '.data = ([inputs | {(.): input}] | add)'"
    ],
    "utterance": "Wrap a set of key/value pairs as an object inside a top-level object under the key 'data'.",
    "expressions": [
      ".data = ([inputs | {(.): input}] | add)"
    ],
    "data": [
      {
        "input": [
          "four",
          "4",
          "one",
          "100",
          "two",
          "2"
        ],
        "output": {
          "data": {
            "four": "4",
            "one": "100",
            "two": "2"
          }
        }
      }
    ],
    "identifier": 54300938,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "And I'd like to merge them in order to obtain the following result where each of the `vars` array of each section are merged together",
      "the keys (here `one` and `two`) would be arbitrary and an undefined number of them could be present.",
      "the `vars` array would not contain duplicate (based on `name`) and right precedence would be applied to override values from the first array.",
      "here is a solution which is oblivious to the top-level key names, but which does not attempt to avoid duplicates:\n\n    $A\n    | reduce keys_unsorted[] as $k (.;\n        if .[$k] | (type == \"object\") and has(\"vars\")\n        then (.[$k]|.vars) += ($B[$k]|.vars) else . end )",
      "To avoid duplicates, I'd suggest writing a helper function to do just that, as illustrated in the following section.",
      "def extend(stream):\n  reduce stream as $s (.;\n    (map(.name) | index($s|.name)) as $i\n    | if $i then .[$i] += $s\n\t  else . + [$s]\n      end) ;\n\n\n$A\n| reduce keys_unsorted[] as $k (.;\n    if .[$k] | (type == \"object\") and has(\"vars\")\n    then (.[$k].vars) = ( .[$k].vars | extend(($B[$k].vars[])))\n    else . end\n  )"
    ],
    "utterance": "Merge two objects so that for each arbitrary top-level key containing a 'vars' array, the arrays are concatenated and duplicate entries by 'name' are eliminated with later values taking precedence.",
    "expressions": [
      "def extend(stream):\n  reduce stream as $s (.;\n    (map(.name) | index($s|.name)) as $i\n    | if $i then .[$i] += $s\n      else . + [$s]\n      end) ;\n\n$A\n| reduce keys_unsorted[] as $k (.;\n    if .[$k] | (type == \"object\") and has(\"vars\")\n    then (.[$k].vars) = ( .[$k].vars | extend(($B[$k].vars[])))\n    else . end\n  )"
    ],
    "data": [
      {
        "input": [
          {
            "one": {
              "vars": [
                {
                  "name": "a",
                  "value": "a"
                },
                {
                  "name": "b",
                  "value": "b"
                }
              ]
            },
            "two": {
              "vars": [
                {
                  "name": "c",
                  "value": "c"
                },
                {
                  "name": "d",
                  "value": "d"
                }
              ]
            },
            "extras": "whatever"
          },
          {
            "one": {
              "vars": [
                {
                  "name": "e",
                  "value": "e"
                },
                {
                  "name": "f",
                  "value": "f"
                }
              ]
            },
            "two": {
              "vars": [
                {
                  "name": "g",
                  "value": "g"
                },
                {
                  "name": "h",
                  "value": "h"
                }
              ]
            }
          }
        ],
        "output": {
          "one": {
            "vars": [
              {
                "name": "a",
                "value": "a"
              },
              {
                "name": "b",
                "value": "b"
              },
              {
                "name": "e",
                "value": "e"
              },
              {
                "name": "f",
                "value": "f"
              }
            ]
          },
          "two": {
            "vars": [
              {
                "name": "c",
                "value": "c"
              },
              {
                "name": "d",
                "value": "d"
              },
              {
                "name": "g",
                "value": "g"
              },
              {
                "name": "h",
                "value": "h"
              }
            ]
          },
          "extras": "whatever"
        }
      }
    ],
    "identifier": 54306265,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I need to get the value from only one of these object, but I only know the \"name\" value.",
      "I've tried somthing like this:",
      "echo $objects | jq '.name == \"thing2\"'",
      "But I need to get the values from this object, how do I achive this?",
      "echo \"$objects\" | jq 'select(.name == \"thing2\")'"
    ],
    "utterance": "Extract the object whose name property is equal to \"thing2\".",
    "expressions": [
      "select(.name == \"thing2\")"
    ],
    "data": [
      {
        "input": [
          {
            "name": "thing1",
            "important_value": "42",
            "other_value": "27"
          },
          {
            "name": "thing2",
            "important_value": "13",
            "other_value": "5"
          },
          {
            "name": "thing3",
            "important_value": "788",
            "other_value": "58"
          }
        ],
        "output": {
          "name": "thing2",
          "important_value": "13",
          "other_value": "5"
        }
      }
    ],
    "identifier": 54309985,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to add a default string \"In Process\" if the status is empty or Null.",
      "Based on Item condition, using the query below and take the first object from the filtered results.",
      ".amazon.items[] | select(.name | contains (\"shoes\"))",
      "Expected Output:\n\n    {\n        \"activeitem\": 2,\n        \"item\": {\n          \"id\": 2,\n          \"name\": \"adidas shoes\",\n          \"state\": \"in inventory\",\n          \"status\": \"IN PROCESS\"\n        }\n      }\n",
      "the key is to use `|=`:",
      ".amazon\n| .items[] |= (if (.name | contains (\"shoes\")) and .status == null \n               then .status=\"IN PROCESS\" else . end)",
      ".amazon\n| .items |= map(select(.name | contains (\"shoes\"))\n                | if .status == null then .status=\"IN PROCESS\"\n                  else . end)"
    ],
    "utterance": "Get the first item whose name contains \"shoes\" and output its id, name, state, and status (use \"IN PROCESS\" if status is empty or null), together with the activeitem value.",
    "expressions": [
      ".amazon.items[] | select(.name | contains(\"shoes\")) | {id, name, state, status: (if (.status == null or .status == \"\") then \"IN PROCESS\" else .status end)} as $item | {activeitem: .amazon.activeitem, item: $item} | first",
      ".amazon | {activeitem, item: (items[] | select(.name | contains(\"shoes\")) | {id, name, state, status: (if (.status == null or .status == \"\") then \"IN PROCESS\" else .status end)})}"
    ],
    "data": [
      {
        "input": {
          "amazon": {
            "activeitem": 2,
            "items": [
              {
                "id": 1,
                "name": "harry potter",
                "state": "sold"
              },
              {
                "id": 2,
                "name": "adidas shoes",
                "state": "in inventory"
              },
              {
                "id": 3,
                "name": "watch",
                "state": "returned"
              },
              {
                "id": 4,
                "name": "Nike shoes",
                "state": "in inventory"
              }
            ]
          }
        },
        "output": {
          "activeitem": 2,
          "item": {
            "id": 2,
            "name": "adidas shoes",
            "state": "in inventory",
            "status": "IN PROCESS"
          }
        }
      }
    ],
    "identifier": 54332521,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "The goal is to increment the `startTime` by adding the given `duration`. `startTime` is in milliseconds whereas `duration` can be any time unit (ms, sec, min, hr, day, week, month, year) which is provided with `unit`.",
      "Here&#39;s a simple solution that takes into account leap years:",
      "def later($n; $unit):\n  def dict: { YEAR:0, MONTH:1, DAY:2, HOUR:3, MINUTE:4, SECOND:5, MIN:4, SEC:5};\n  gmtime\n  | .[dict[$unit]] += $n\n  | mktime ;"
    ],
    "utterance": "Increment a timestamp in milliseconds by a specified duration and unit (e.g., add 4 days to a start time), where the unit can be day, month, year, hour, minute, or second.",
    "expressions": [
      "def later($n; $unit):\n  def dict: { YEAR:0, MONTH:1, DAY:2, HOUR:3, MINUTE:4, SECOND:5, MIN:4, SEC:5};\n  gmtime\n  | .[dict[$unit]] += $n\n  | mktime ;\n\n(.startTime.long / 1000 | floor | later(.applicationDuration.duration; .applicationDuration.unit)) * 1000"
    ],
    "data": [
      {
        "input": {
          "applicationDuration": {
            "duration": 4,
            "unit": "DAY"
          },
          "startTime": {
            "long": 1539196595567
          }
        }
      }
    ],
    "identifier": 54333170,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to select the particular item from the array using `contains` and get the first item using JQ.",
      "Expected Result:\r\n\r\n    {\r\n      \"activeitem\": 2,\r\n      \"item\": {\r\n        \"id\": 2,\r\n        \"name\": \"adidas shoes\",\r\n        \"state\": \"in inventory\"\r\n      }\r\n    }",
      "If you really want to change the name 'items' to 'item', you could tweak the above as follows:\r\n\r\n    .amazon\r\n    | .item = (.items | map(select(.name | contains (\"shoes\")))[0])\r\n    | del(.items)"
    ],
    "utterance": "Select the first array element whose name contains \"shoes\", rename it as 'item', and keep 'activeitem', removing the original array.",
    "expressions": [
      ".amazon | .item = (.items | map(select(.name | contains (\"shoes\")))[0]) | del(.items)"
    ],
    "data": [
      {
        "input": {
          "amazon": {
            "activeitem": 2,
            "items": [
              {
                "id": 1,
                "name": "harry potter",
                "state": "sold"
              },
              {
                "id": 2,
                "name": "adidas shoes",
                "state": "in inventory"
              },
              {
                "id": 3,
                "name": "watch",
                "state": "returned"
              },
              {
                "id": 4,
                "name": "adidas shoes",
                "state": "in inventory"
              }
            ]
          }
        },
        "output": {
          "activeitem": 2,
          "item": {
            "id": 2,
            "name": "adidas shoes",
            "state": "in inventory"
          }
        }
      }
    ],
    "identifier": 54334293,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Input:",
      "{\n  \"a\": [\n    {\n      \"b\": \"c\",\n      \"d\": \"e\"\n    },\n    {\n      \"b\": \"f\",\n      \"d\": \"g\"\n    }\n  ],\n  \"h\": [\n    {\n      \"b\": \"c\",\n      \"i\": \"j\"\n    },\n    {\n      \"b\": \"f\",\n      \"i\": \"k\"\n    }\n  ]\n}",
      "desired output:",
      "{\n  \"l\": [\n    {\n      \"b\": \"c\",\n      \"d\": \"e\",\n      \"i\": \"j\"\n    },\n    {\n      \"b\": \"f\",\n      \"d\": \"g\",\n      \"i\": \"k\"\n    }\n  ]\n}",
      "The key to a simple solution is `transpose`:\n\n    [.a, .h]\n    | transpose\n    | map(add)\n    | {l: .}"
    ],
    "utterance": "Merge each pair of objects from arrays 'a' and 'h' that have the same index into a single object and output them as array 'l'.",
    "expressions": [
      "[.a, .h] | transpose | map(add) | {l: .}"
    ],
    "data": [
      {
        "input": {
          "a": [
            {
              "b": "c",
              "d": "e"
            },
            {
              "b": "f",
              "d": "g"
            }
          ],
          "h": [
            {
              "b": "c",
              "i": "j"
            },
            {
              "b": "f",
              "i": "k"
            }
          ]
        },
        "output": {
          "l": [
            {
              "b": "c",
              "d": "e",
              "i": "j"
            },
            {
              "b": "f",
              "d": "g",
              "i": "k"
            }
          ]
        }
      }
    ],
    "identifier": 54350532,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "If you have all the values as environment variables, you could just utilize the `env` object to get the values. The syntax then becomes trivial.",
      "jq -n 'env | {\n    systemTime,\n    sensorType,\n    sensorLocation,\n    kernalName,\n    nodeName,\n    kernalRelease,\n    machine,\n    processor,\n    hardwarePlatform,\n    operatingSystem,\n    timeup,\n    softVersion \n}'",
      "jq_args=(\n   --arg systemTime \"$systemTime\"\n   --arg sensorType \"$sensorType\"\n   --arg sensorLocation \"$sensorLocation\"\n   --arg sensorCommand \"$sensorCommand\"\n   --arg kernalName \"$kernalName\"\n   --arg nodeName \"$nodeName\"\n   --arg kernalRelease \"$kernalRelease\"\n   --arg kernalVersion \"$kernalVersion\" \n   --arg machine \"$machine\"\n   --arg processor \"$processor\"\n   --arg hardwarePlatform \"$hardwarePlatform\"\n   --arg operatingSystem \"$operatingSystem\"\n   --arg timeup \"$timeup\"\n   --arg softVersion \"$softVersion\"\n)\nJSON_STRING=$( jq -n \"${jq_args[@]}\" '{\n   systemTime: $systemTime,\n   sensorType: $sensorType,\n   sensorLocation: $sensorLocation,\n   kernalName: $kernalName,\n   nodeName: $nodeName,\n   kernalRelease: $kernalRelease,\n   machine: $machine,\n   processor: $processor,\n   hardwarePlatform: $hardwarePlatform,\n   operatingSystem: $operatingSystem,\n   timeup: $timeup,\n   softVersion: $softVersion \n }' )"
    ],
    "utterance": "Create an object with fields systemTime, sensorType, sensorLocation, kernalName, nodeName, kernalRelease, machine, processor, hardwarePlatform, operatingSystem, timeup, and softVersion, using the values from corresponding shell variables or environment variables.",
    "expressions": [
      "jq -n 'env | {\n    systemTime,\n    sensorType,\n    sensorLocation,\n    kernalName,\n    nodeName,\n    kernalRelease,\n    machine,\n    processor,\n    hardwarePlatform,\n    operatingSystem,\n    timeup,\n    softVersion \n}'",
      "jq_args=(\n   --arg systemTime \"$systemTime\"\n   --arg sensorType \"$sensorType\"\n   --arg sensorLocation \"$sensorLocation\"\n   --arg sensorCommand \"$sensorCommand\"\n   --arg kernalName \"$kernalName\"\n   --arg nodeName \"$nodeName\"\n   --arg kernalRelease \"$kernalRelease\"\n   --arg kernalVersion \"$kernalVersion\" \n   --arg machine \"$machine\"\n   --arg processor \"$processor\"\n   --arg hardwarePlatform \"$hardwarePlatform\"\n   --arg operatingSystem \"$operatingSystem\"\n   --arg timeup \"$timeup\"\n   --arg softVersion \"$softVersion\"\n)\nJSON_STRING=$( jq -n \"${jq_args[@]}\" '{\n   systemTime: $systemTime,\n   sensorType: $sensorType,\n   sensorLocation: $sensorLocation,\n   kernalName: $kernalName,\n   nodeName: $nodeName,\n   kernalRelease: $kernalRelease,\n   machine: $machine,\n   processor: $processor,\n   hardwarePlatform: $hardwarePlatform,\n   operatingSystem: $operatingSystem,\n   timeup: $timeup,\n   softVersion: $softVersion \n }' )"
    ],
    "identifier": 54351510,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am trying to extract the \r\nSecurityGroups>GroupId if any of the SecurityGroups>IpPermissions>IpRanges>CidrIp matches a certain IP. \r\n\r\nFor instance, searching for 11.11.11.11 should return sg-3jf32kj3j. There is a possibility that multiple SecurityGroups will contain that IP. I need to return every GroupId that does.",
      "The following filter meets the stated requirements:\r\n\r\n    .SecurityGroups[]\r\n    | select( any(.IpPermissions[].IpRanges[];\r\n                  .CidrIp | startswith(\"11.11.11.11/\") ) )\r\n    | .GroupId\r\n\r\nIn particular, with your input it yields the required value:\r\n\r\n    \"sg-3jf32kj3j\"",
      "You might want to consider more sophisticated pattern-matching, perhaps using:\r\n\r\n    test(\"^11.11.11.11($|/)\")"
    ],
    "utterance": "Return all GroupId values for which any IpRanges entry in any IpPermissions object contains a CidrIp that matches 11.11.11.11.",
    "expressions": [
      ".SecurityGroups[] | select( any(.IpPermissions[].IpRanges[]; .CidrIp | startswith(\"11.11.11.11/\")) ) | .GroupId",
      ".SecurityGroups[] | select( any(.IpPermissions[].IpRanges[]; .CidrIp | test(\"^11.11.11.11($|/)\") ) ) | .GroupId"
    ],
    "data": [
      {
        "input": {
          "SecurityGroups": [
            {
              "OwnerId": "111111111",
              "Description": "default VPC security group",
              "GroupId": "sg-1a1a1a1a1",
              "VpcId": "vpc-1a1a1a1a1",
              "IpPermissionsEgress": [
                {
                  "IpProtocol": "-1",
                  "PrefixListIds": [],
                  "Ipv6Ranges": [],
                  "UserIdGroupPairs": [],
                  "IpRanges": [
                    {
                      "CidrIp": "0.0.0.0/0"
                    }
                  ]
                }
              ],
              "GroupName": "default",
              "IpPermissions": [
                {
                  "IpProtocol": "-1",
                  "PrefixListIds": [],
                  "Ipv6Ranges": [],
                  "UserIdGroupPairs": [
                    {
                      "GroupId": "sg-5df45d5d5",
                      "UserId": "234234234"
                    }
                  ],
                  "IpRanges": []
                }
              ]
            },
            {
              "OwnerId": "22222222222",
              "Description": "EC2 Security Group",
              "Tags": [
                {
                  "Key": "aws:cloudformation:logical-id",
                  "Value": "EC2SecurityGroup"
                },
                {
                  "Key": "aws:cloudformation:stack-id",
                  "Value": "arn:aws:cloudformation:us-west-2:111111111:stack/blah-prod-vpc/asdfsdf-j3j3-22j1-39fj-sadfsadf"
                },
                {
                  "Key": "Name",
                  "Value": "blah-production-EC2"
                },
                {
                  "Key": "Owner",
                  "Value": "blah@blah.com"
                },
                {
                  "Key": "aws:cloudformation:stack-name",
                  "Value": "prod-vpc"
                },
                {
                  "Key": "Created",
                  "Value": "2018-05-21T09:40:55-07:00"
                }
              ],
              "GroupId": "sg-3jf32kj3j",
              "VpcId": "vpc-3kj3f2kj3",
              "IpPermissionsEgress": [
                {
                  "IpProtocol": "-1",
                  "PrefixListIds": [],
                  "Ipv6Ranges": [],
                  "UserIdGroupPairs": [],
                  "IpRanges": [
                    {
                      "CidrIp": "0.0.0.0/0"
                    }
                  ]
                }
              ],
              "GroupName": "blah-prod-vpc-EC2SecurityGroup-SJHS78F78SSH",
              "IpPermissions": [
                {
                  "IpProtocol": "tcp",
                  "ToPort": 80,
                  "UserIdGroupPairs": [],
                  "IpRanges": [
                    {
                      "CidrIp": "0.0.0.0/0"
                    }
                  ],
                  "FromPort": 80,
                  "PrefixListIds": [],
                  "Ipv6Ranges": []
                },
                {
                  "IpProtocol": "icmp",
                  "ToPort": 0,
                  "UserIdGroupPairs": [],
                  "IpRanges": [
                    {
                      "CidrIp": "11.11.11.11/32"
                    }
                  ],
                  "FromPort": 0,
                  "PrefixListIds": [],
                  "Ipv6Ranges": []
                },
                {
                  "IpProtocol": "-1",
                  "PrefixListIds": [],
                  "Ipv6Ranges": [],
                  "UserIdGroupPairs": [],
                  "IpRanges": [
                    {
                      "CidrIp": "22.22.22.22/16"
                    },
                    {
                      "CidrIp": "33.33.33.33/32"
                    }
                  ]
                },
                {
                  "IpProtocol": "tcp",
                  "ToPort": 22,
                  "UserIdGroupPairs": [],
                  "IpRanges": [
                    {
                      "CidrIp": "0.0.0.0/0"
                    }
                  ],
                  "FromPort": 22,
                  "PrefixListIds": [],
                  "Ipv6Ranges": [
                    {
                      "CidrIpv6": "::/0"
                    }
                  ]
                },
                {
                  "IpProtocol": "tcp",
                  "ToPort": 443,
                  "UserIdGroupPairs": [],
                  "IpRanges": [
                    {
                      "CidrIp": "0.0.0.0/0"
                    }
                  ],
                  "FromPort": 443,
                  "PrefixListIds": [],
                  "Ipv6Ranges": []
                },
                {
                  "IpProtocol": "icmp",
                  "ToPort": -1,
                  "UserIdGroupPairs": [],
                  "IpRanges": [
                    {
                      "CidrIp": "44.44.44.44/32"
                    },
                    {
                      "CidrIp": "55.55.55.55/29"
                    }
                  ],
                  "FromPort": -1,
                  "PrefixListIds": [],
                  "Ipv6Ranges": []
                }
              ]
            }
          ]
        },
        "output": [
          "sg-3jf32kj3j"
        ]
      }
    ],
    "identifier": 54353243,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "If I modify the filter in (it's an empty selection)\n\n    .results[0]|{label:\"a\",test:(.address_components[].types|select(.[0]| contains(\"administrative_area_level_3\"))|.[1]?)}\n\nI have a completely [empty result](https://jqplay.org/s/Wdqyvx-ZBS).\n\nHow to obtain something like\n\n    {\n      \"label\": \"a\",\n      \"test\": \"\"\n    }\n\nHow to manage it with jq?",
      "You can use the alternative operator `//`. The expression `E // \"\"` will produce the empty string if E is empty or `null` or `false`:\n\n    <file jq '.results[0]|{label:\"a\",test:((.address_components[].types|select(.[0]| contains(\"administrative_area_level_3\"))|.[1])//\"\")}'\n\nNotice that in the present case you don't need the `?` operator."
    ],
    "utterance": "Return an object with label \"a\" and test as the second element of the types array where the first element contains \"administrative_area_level_3\", or an empty string if no such element exists.",
    "expressions": [
      ".results[0]|{label:\"a\",test:((.address_components[].types|select(.[0]| contains(\"administrative_area_level_3\"))|.[1])//\"\")}"
    ],
    "data": [
      {
        "input": {
          "results": [
            {
              "address_components": [
                {
                  "long_name": "Amsterdam",
                  "short_name": "Amsterdam",
                  "types": [
                    "locality",
                    "political"
                  ]
                },
                {
                  "long_name": "Government of Amsterdam",
                  "short_name": "Government of Amsterdam",
                  "types": [
                    "administrative_area_level_2",
                    "political"
                  ]
                }
              ]
            }
          ]
        },
        "output": {
          "label": "a",
          "test": ""
        }
      }
    ],
    "identifier": 54361098,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Calling `curl localhost:8080/api` returns something like `[\"test1, \"test2\", \"test3\"]`",
      "I wanted to convert it to:",
      "test1",
      "test2",
      "test3",
      "$ echo '[\"test1\", \"test2\", \"test3\"]' | jq -r '.[]'",
      "test1",
      "test2",
      "test3"
    ],
    "utterance": "Print each string element of an array as a separate line, preserving order.",
    "expressions": [
      ".[]",
      "-r .[]"
    ],
    "data": [
      {
        "input": [
          "test1",
          "test2",
          "test3"
        ],
        "output": [
          "test1",
          "test2",
          "test3"
        ]
      }
    ],
    "identifier": 54375937,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to replace `$CONTENT` from template.json with the content under the tag \"content\" from the content.json file .",
      "jq -n --arg CONTENT \"$(jq -c .content sample.json)\" '{\"subject\": \"Some subject line\", \"content\": $CONTENT}'",
      "jq -n --arg CONTENT \"$(jq -c .content sample.json)\" -f template.jq"
    ],
    "utterance": "Insert the value under the key \"content\" from one file into another template at the position of a $CONTENT placeholder.",
    "expressions": [
      "jq -n --arg CONTENT \"$(jq -c .content sample.json)\" '{\"subject\": \"Some subject line\", \"content\": $CONTENT}'",
      "jq -n --arg CONTENT \"$(jq -c .content sample.json)\" -f template.jq"
    ],
    "data": [
      {
        "input": {
          "template.json": {
            "subject": "Some subject line",
            "content": "$CONTENT"
          },
          "sample.json": {
            "status": "ACTIVE",
            "id": 217,
            "type": "TEXT",
            "name": "string",
            "subject": "string",
            "url": "contenttemplates/217",
            "content": {
              "text": "hello ${user_name}",
              "variables": [
                {
                  "key": "${user_name}",
                  "value": null
                }
              ]
            },
            "content_footer": null,
            "audit": {
              "creator": "1000",
              "timestamp": 1548613800000,
              "product": "2",
              "channel": "10",
              "party": null,
              "event": {
                "type": null,
                "type_id": "0",
                "txn_id": "0"
              },
              "client_key": "pk6781gsfr5"
            }
          }
        },
        "output": {
          "subject": "Some subject line",
          "content": {
            "text": "hello ${user_name}",
            "variables": [
              {
                "key": "${user_name}",
                "value": null
              }
            ]
          }
        }
      }
    ],
    "identifier": 54402890,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": true,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am trying to extract value from a JSON file using `jq` on Ubuntu. When I `echo` out directly, it works and returns the value to the console. But when I try to assign the extracted value to a variable and `echo` out, returns nothing to console.",
      "The following works:\n\n    #!/bin/sh\n    echo '{\"USERNAME\":\"TRX-101\",\"PASSWORD\":\":g&fg#/H\"}' | jq .PASSWORD",
      "The following does NOT work:\n\n    #!/bin/sh\n    VAR = '{\"USERNAME\":\"TRX-101\",\"PASSWORD\":\":g&fg#/H\"}' | jq .PASSWORD\n    echo $VAR",
      "My ultimate goal is to use the value from `$VAR` to do other operations within the shell script"
    ],
    "utterance": "Assign the extracted value for the field PASSWORD from the input object to a shell variable for later use.",
    "expressions": [
      "VAR=$(echo '{\"USERNAME\":\"TRX-101\",\"PASSWORD\":\":g&fg#/H\"}' | jq -r .PASSWORD)"
    ],
    "data": [
      {
        "input": {
          "USERNAME": "TRX-101",
          "PASSWORD": ":g&fg#/H"
        },
        "output": ":g&fg#/H"
      }
    ],
    "identifier": 54407540,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "rule = \"\"\"[{id: .id, \r\n            tid: .things[].tId,\r\n            this: .things[].objs[].this}]\"\"\"",
      "The problem is that your \"rule\" creates a Cartesian product, whereas in effect you want nested iteration.",
      "So presumably your rule should look something like this:\r\n\r\n    [{id} + (.things[] | {tid: .tId} + (.objs[] | {this}))]",
      "or if you want to make the nested iteration clearer:\r\n\r\n    [ .id as $id\r\n      | .things[] as $thing\r\n      | $thing.objs[]\r\n      | {id: $id, this, tid: $thing.tId} ]"
    ],
    "utterance": "Produce an array containing objects with fields id, tid, and this, by pairing each id with the corresponding tid and this value from nested things[].objs[].this values.",
    "expressions": [
      "[{id} + (.things[] | {tid: .tId} + (.objs[] | {this}))]",
      "[ .id as $id | .things[] as $thing | $thing.objs[] | {id: $id, this, tid: $thing.tId} ]"
    ],
    "data": [
      {
        "input": {
          "id": 1,
          "things": [
            {
              "tId": 1,
              "objs": [
                {
                  "this": 99
                },
                {
                  "this": 100
                }
              ]
            },
            {
              "tId": 2,
              "objs": [
                {
                  "this": 222
                },
                {
                  "this": 22222
                }
              ]
            }
          ]
        },
        "output": [
          {
            "id": 1,
            "this": 99,
            "tid": 1
          },
          {
            "id": 1,
            "this": 100,
            "tid": 1
          },
          {
            "id": 1,
            "this": 222,
            "tid": 2
          },
          {
            "id": 1,
            "this": 22222,
            "tid": 2
          }
        ]
      }
    ],
    "identifier": 54426122,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Input:\r\n\r\n    {\r\n      \"a\": [\r\n        { \"b\": 1, \"d\": \"p\" },\r\n        { \"b\": 3, \"d\": \"r\" },\r\n        { \"b\": 4, \"d\": \"s\" }\r\n      ],\r\n      \"h\": [\r\n        { \"b\": 1, \"i\": \"k\" },\r\n        { \"b\": 2, \"i\": \"l\" },\r\n        { \"b\": 4, \"i\": \"n\" }\r\n      ]\r\n    }",
      "Desired output:\r\n\r\n    {\r\n      \"q\": [\r\n        { \"b\": 1, \"d\": \"p\", \"i\": \"k\" },\r\n        { \"b\": 2, \"i\": \"l\" },\r\n        { \"b\": 3, \"d\": \"r\" },\r\n        { \"b\": 4, \"d\": \"s\", \"i\": \"n\" }\r\n      ]\r\n    }",
      "Use `group_by` to group array elements by a filter's output (`.b` in this case):\r\n\r\n    jq '{ q : [ add | group_by(.b)[] | add ] }' file",
      "Here's a solution using the generic function `aggregate_by`, defined as follows:\r\n\r\n    def aggregate_by(s; f; g):\r\n      reduce s as $x  (null; .[$x|f] += [$x|g]);\r\n\r\nFirst, notice that with this def:\r\n\r\n    aggregate_by(.a[], .h[]; .b|tostring; .)[]\r\n    | add\r\n\r\nproduces the desired array except for the ordering of elements.  If the ordering of elements in the output array is unimportant, then the above could be tweaked to provide a simple solution that does not involve `group_by`, which entails sorting.\r\n\r\nTo produce the desired (sorted-by-b) output, we could tweak the above as follows:\r\n\r\n    {q: [aggregate_by(.a[], .h[]; .b|tostring; .)[]]\r\n        | map(add)\r\n        | sort_by(.b) }"
    ],
    "utterance": "Combine corresponding objects from two arrays with possible missing indexes, merging by 'b' field, and output a single array sorted by 'b' with merged properties.",
    "expressions": [
      "{ q : [ ([.a[], .h[]] | add) | group_by(.b)[] | add ] }",
      "{ q : [ add | group_by(.b)[] | add ] }",
      "def aggregate_by(s; f; g): reduce s as $x  (null; .[$x|f] += [$x|g]); {q: [aggregate_by(.a[], .h[]; .b|tostring; .)[]] | map(add) | sort_by(.b) }"
    ],
    "data": [
      {
        "input": {
          "a": [
            {
              "b": 1,
              "d": "p"
            },
            {
              "b": 3,
              "d": "r"
            },
            {
              "b": 4,
              "d": "s"
            }
          ],
          "h": [
            {
              "b": 1,
              "i": "k"
            },
            {
              "b": 2,
              "i": "l"
            },
            {
              "b": 4,
              "i": "n"
            }
          ]
        },
        "output": {
          "q": [
            {
              "b": 1,
              "d": "p",
              "i": "k"
            },
            {
              "b": 2,
              "i": "l"
            },
            {
              "b": 3,
              "d": "r"
            },
            {
              "b": 4,
              "d": "s",
              "i": "n"
            }
          ]
        }
      }
    ],
    "identifier": 54436748,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "How would I take the data above and use jq to split it by the symbol field?",
      "For example I&#39;d like to end up with:\r\n\r\nAAPL.json:\r\n\r\n\r\n\t[{\r\n\t\t&quot;date&quot;: &quot;2016-03-23 00:00:00&quot;,\r\n\t\t&quot;symbol&quot;: &quot;AAPL&quot;,\r\n\t\t&quot;open&quot;: &quot;121.470001&quot;,\r\n\t\t&quot;close&quot;: &quot;119.379997&quot;,\r\n\t\t&quot;low&quot;: &quot;119.099998&quot;,\r\n\t\t&quot;high&quot;: &quot;121.470001&quot;,\r\n\t\t&quot;volume&quot;: &quot;444200.0&quot;\r\n\t},\r\n\t{\r\n\t\t&quot;date&quot;: &quot;2016-03-24 00:00:00&quot;,\r\n\t\t&quot;symbol&quot;: &quot;AAPL&quot;,\r\n\t\t&quot;open&quot;: &quot;118.889999&quot;,\r\n\t\t&quot;close&quot;: &quot;119.410004&quot;,\r\n\t\t&quot;low&quot;: &quot;117.639999&quot;,\r\n\t\t&quot;high&quot;: &quot;119.440002&quot;,\r\n\t\t&quot;volume&quot;: &quot;534100.0&quot;\r\n\t}]",
      "You could use a little shell loop:\r\n\r\n       \r\n    #!/bin/bash\r\n    jq -r &#39;.[].symbol&#39; stockprices.json | while read -r symbol ; do\r\n        jq --arg s &quot;${symbol}&quot; \\\r\n            &#39;map(if .symbol == $s then . else empty end)&#39; \\\r\n        stockprices.json &gt; &quot;${symbol}&quot;.json\r\n    done ",
      "jq -rc 'group_by(.symbol)[] | \"\\(.[0].symbol)\\t\\(.)\"' stockprices.json |\r\n    while IFS=$'\\t' read -r symbol content; do\r\n        echo \"${content}\" > \"${symbol}.json\"\r\n    done"
    ],
    "utterance": "Split a list of stock price objects into separate files based on the value of the 'symbol' field, so each file contains all entries for one symbol.",
    "expressions": [
      "jq -r '.[].symbol' stockprices.json | while read -r symbol ; do jq --arg s \"${symbol}\" 'map(if .symbol == $s then . else empty end)' stockprices.json > \"${symbol}\".json; done",
      "jq -rc 'group_by(.symbol)[] | \"\\(.[0].symbol)\\t\\(.)\"' stockprices.json | while IFS=$'\\t' read -r symbol content; do echo \"${content}\" > \"${symbol}.json\"; done"
    ],
    "data": [
      {
        "input": [
          {
            "date": "2016-03-22 00:00:00",
            "symbol": "ACN",
            "open": "121.029999",
            "close": "121.470001",
            "low": "120.720001",
            "high": "122.910004",
            "volume": "711400.0"
          },
          {
            "date": "2016-03-23 00:00:00",
            "symbol": "AAPL",
            "open": "121.470001",
            "close": "119.379997",
            "low": "119.099998",
            "high": "121.470001",
            "volume": "444200.0"
          },
          {
            "date": "2016-03-24 00:00:00",
            "symbol": "AAPL",
            "open": "118.889999",
            "close": "119.410004",
            "low": "117.639999",
            "high": "119.440002",
            "volume": "534100.0"
          }
        ],
        "output": {
          "AAPL.json": [
            {
              "date": "2016-03-23 00:00:00",
              "symbol": "AAPL",
              "open": "121.470001",
              "close": "119.379997",
              "low": "119.099998",
              "high": "121.470001",
              "volume": "444200.0"
            },
            {
              "date": "2016-03-24 00:00:00",
              "symbol": "AAPL",
              "open": "118.889999",
              "close": "119.410004",
              "low": "117.639999",
              "high": "119.440002",
              "volume": "534100.0"
            }
          ],
          "ACN.json": [
            {
              "date": "2016-03-22 00:00:00",
              "symbol": "ACN",
              "open": "121.029999",
              "close": "121.470001",
              "low": "120.720001",
              "high": "122.910004",
              "volume": "711400.0"
            }
          ]
        }
      }
    ],
    "identifier": 54450909,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": true,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  }
]