[
  {
    "context": [
      "Im provided with the array name, for example, it would be \"array1\", also with the json key of the variable, for example array1[\"1\"], that would retrieve me \"xx\" value.",
      "jq --arg a \"array1\" --arg k \"1\" -r '.[$a][][$k]//empty'"
    ],
    "utterance": "Given an object with array values, retrieve the value for a provided array name and key inside each array element.",
    "expressions": [
      "jq --arg a \"array1\" --arg k \"1\" -r '.[$a][][$k]//empty'"
    ],
    "data": [
      {
        "input": {
          "array1": [
            {
              "1": "xx"
            },
            {
              "2": "zz"
            },
            {
              "3": "zz"
            },
            {
              "4": "xx"
            }
          ],
          "array2": [
            {
              "1": "xx"
            },
            {
              "2": "zz"
            },
            {
              "3": "zz"
            },
            {
              "4": "xx"
            }
          ]
        },
        "output": [
          "xx"
        ]
      }
    ],
    "identifier": 72587691,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "and I wanted to get all the items where the name property is John, in the following format",
      "{\r\n    \"test1\": {\r\n        \"name\": \"John\"\r\n    },\r\n    \"test4\": {\r\n        \"name\": \"John\"\r\n    }\r\n}",
      "If I use the following JQ command: `.[] | select(.name | ascii_downcase | contains(\"john\"))` it only returns\n\n    {\n      \"name\": \"John\"\n    }\n    {\n      \"name\": \"John\"\n    }\nomitting the keys.",
      "Use the update operator `|=` to (un)select each item while keeping the outer context:\njq '.[] |= select(.name | ascii_downcase | contains(\"john\"))'",
      "To keep original structure, use map_values :\n\n```\nmap_values(select(.name | ascii_downcase | contains(\"john\")))\n```"
    ],
    "utterance": "Return all entries whose 'name' property (case-insensitive) contains 'john', preserving the original keys.",
    "expressions": [
      ".[] |= select(.name | ascii_downcase | contains(\"john\"))",
      "map_values(select(.name | ascii_downcase | contains(\"john\")))"
    ],
    "data": [
      {
        "input": {
          "test1": {
            "name": "John"
          },
          "test2": {
            "name": "Jack"
          },
          "test3": {
            "name": "Jim"
          },
          "test4": {
            "name": "John"
          }
        },
        "output": {
          "test1": {
            "name": "John"
          },
          "test4": {
            "name": "John"
          }
        }
      }
    ],
    "identifier": 72595047,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I need a jq program that will allow me to pass the paths I want in the output document as --args on the command line.  All non-target paths are removed.",
      "Key names can be duplicated at different levels of the hierarchy, but only the path matching \"exactly\" should be included.  `.attr.x1` is always ignored in the examples provided.  In those examples, I want `.x1`, not `.attr.x1`.",
      "In practice, the input might be a small document with a dozen entries and < 20 keys to a reasonably large one with 100k entries and ~100 keys.  Performance is important, but not critical.",
      "Read or convert the path arguments as path arrays and Use `getpath` and `setpath` to filter and build your objects:",
      "jq '  map(. as $item | reduce ($ARGS.positional[] / \".\") as $p ({}; setpath($p; $item | getpath($p)) ))' file.json --args x1 attr.f1 attr.f3"
    ],
    "utterance": "Extract only the fields at specified paths provided as command-line arguments, including nested keys, and build new objects containing those exact paths with values or nulls for missing keys.",
    "expressions": [
      "map(. as $item | reduce ($ARGS.positional[] / \".\") as $p ({}; setpath($p; $item | getpath($p))))"
    ],
    "data": [
      {
        "input": [
          {
            "x1": "abc123",
            "x2": "Larry",
            "attr": {
              "f1": "one",
              "f2": [
                "two",
                "2"
              ],
              "f3": "three",
              "x1": "Never included in below examples as this is .attr.x1, not .x1"
            }
          },
          {
            "x1": "xyz789",
            "x2": "Curly",
            "attr": {
              "f1": [
                "one",
                "111"
              ],
              "f2": "two"
            }
          },
          {
            "x1": "def456",
            "x2": "moe",
            "attr": {
              "f4": {
                "a": 1,
                "b": "two"
              }
            }
          }
        ],
        "output": [
          {
            "x1": "abc123",
            "attr": {
              "f1": "one",
              "f3": "three"
            }
          },
          {
            "x1": "xyz789",
            "attr": {
              "f1": [
                "one",
                "111"
              ],
              "f3": null
            }
          },
          {
            "x1": "def456",
            "attr": {
              "f1": null,
              "f3": null
            }
          }
        ]
      },
      {
        "input": [
          {
            "x1": "abc123",
            "x2": "Larry",
            "attr": {
              "f1": "one",
              "f2": [
                "two",
                "2"
              ],
              "f3": "three",
              "x1": "Never included in below examples as this is .attr.x1, not .x1"
            }
          },
          {
            "x1": "xyz789",
            "x2": "Curly",
            "attr": {
              "f1": [
                "one",
                "111"
              ],
              "f2": "two"
            }
          },
          {
            "x1": "def456",
            "x2": "moe",
            "attr": {
              "f4": {
                "a": 1,
                "b": "two"
              }
            }
          }
        ],
        "output": [
          {
            "x1": "abc123",
            "x2": "Larry",
            "attr": {
              "f2": [
                "two",
                "2"
              ],
              "f4": null
            }
          },
          {
            "x1": "xyz789",
            "x2": "Curly",
            "attr": {
              "f2": "two",
              "f4": null
            }
          },
          {
            "x1": "def456",
            "x2": "moe",
            "attr": {
              "f2": null,
              "f4": {
                "a": 1,
                "b": "two"
              }
            }
          }
        ]
      }
    ],
    "identifier": 72601543,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have two JSON files that I want to merge together, keeping all records from the first file and merging in the fields from the second file based on a matching key field value.",
      "It's also OK if the `joinId`, `joinField1`, `joinField2` fields get added to the \"item4\" record, as long as they have `null` or empty string values.",
      "But then, you could also multiply out those two definitions to directly fit your use case:",
      "jq '\n  (reduce input[] as $row ({}; .[$row.joinId] = $row)) as $idx\n  | map(. + $idx[.join_id])\n' input1.json input2.json"
    ],
    "utterance": "Merge two arrays of objects by matching 'join_id' from the first array to 'joinId' in the second, adding fields from the second array to each matching object while keeping all objects from the first array.",
    "expressions": [
      "(reduce input[] as $row ({}; .[$row.joinId] = $row)) as $idx | map(. + $idx[.join_id])"
    ],
    "data": [
      {
        "input": [
          [
            {
              "id": "item1",
              "join_id": "123",
              "field1": "val1",
              "field2": "val2"
            },
            {
              "id": "item2",
              "join_id": "123",
              "field1": "val1",
              "field2": "val2"
            },
            {
              "id": "item3",
              "join_id": "456",
              "field1": "val3",
              "field2": "val4"
            },
            {
              "id": "item4",
              "join_id": "789",
              "field1": "val5",
              "field2": "val6"
            }
          ],
          [
            {
              "joinId": "123",
              "joinField1": "AAAAAA",
              "joinField2": "BBBBBB"
            },
            {
              "joinId": "456",
              "joinField1": "CCCCCC",
              "joinField2": "DDDDD"
            }
          ]
        ],
        "output": [
          {
            "id": "item1",
            "join_id": "123",
            "field1": "val1",
            "field2": "val2",
            "joinId": "123",
            "joinField1": "AAAAAA",
            "joinField2": "BBBBBB"
          },
          {
            "id": "item2",
            "join_id": "123",
            "field1": "val1",
            "field2": "val2",
            "joinId": "123",
            "joinField1": "AAAAAA",
            "joinField2": "BBBBBB"
          },
          {
            "id": "item3",
            "join_id": "456",
            "field1": "val3",
            "field2": "val4",
            "joinId": "456",
            "joinField1": "CCCCCC",
            "joinField2": "DDDDD"
          },
          {
            "id": "item4",
            "join_id": "789",
            "field1": "val5",
            "field2": "val6"
          }
        ]
      }
    ],
    "identifier": 72609247,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I need to find out lengths of user sessions given timestamps of individual visits.",
      "New session starts every time a delay between adjacent timestamps is longer than `limit`.",
      "for this set of timestamps ... and for value of `limit=10`, I need the following output:\n\n    [\n      3,\n      1,\n      2,\n      4\n    ]",
      "reduce .[] as $i (\n\t{prev:.[0], group:[], result:[]};\n\tif ($i - .prev > $limit)\n\t\tthen {prev:$i, group:[$i], result:(.result + [.group])}\n\t\telse {prev:$i, group:(.group + [$i]), result}\n\tend\n)\n| [(.result[], .group) | length]",
      "Here's a slightly modified version that just counts the values up.\n\n10 as $limit\n\t| reduce .[] as $i (\n\t\t{prev:.[0], count:0, result:[]};\n\t\tif ($i - .prev > $limit)\n\t\t\tthen {prev:$i, count:1, result:(.result + [.count])}\n\t\t\telse {prev:$i, count:(.count + 1), result}\n\t\tend\n\t)\n\t| [.result[], .count]",
      "using `indices` to calculate the breakpoint positions:\n\n10 as $limit\n| [\n    [0, indices(while(. != []; .[1:]) | select(.[0] + $limit <= .[1]))[] + 1, length]\n    | .[range(length-1):] | .[1] - .[0]\n  ]",
      "input: [\n  101,\n  102,\n  105,\n\n  116,\n\n  128,\n  129,\n\n  140,\n  145,\n  146,\n  152\n]"
    ],
    "utterance": "Return the lengths of contiguous sequences where each value is at most 10 greater than the previous; a new group starts when the gap exceeds 10.",
    "expressions": [
      "10 as $limit | reduce .[] as $i ( {prev:.[0], count:0, result:[]}; if ($i - .prev > $limit) then {prev:$i, count:1, result:(.result + [.count])} else {prev:$i, count:(.count + 1), result} end ) | [.result[], .count]",
      "10 as $limit | [ [0, indices(while(. != []; .[1:]) | select(.[0] + $limit <= .[1]))[] + 1, length] | .[range(length-1):] | .[1] - .[0] ]"
    ],
    "data": [
      {
        "input": [
          101,
          102,
          105,
          116,
          128,
          129,
          140,
          145,
          146,
          152
        ],
        "output": [
          3,
          1,
          2,
          4
        ]
      }
    ],
    "identifier": 72610402,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I can extract all dictionaries from the input where the \"uri\" field does not start with the string \"/themes\"",
      ".[] | select(.uri | startswith( \"/themes\" ) | not)",
      "Gives {\n  \"version\": 0,\n  \"permissionMask\": 2,\n  \"creationDate\": \"2022-06-09T14:42:43\",\n  \"updateDate\": \"2021-11-17T06:48:48\",\n  \"label\": \"workflow_icons_sprite@05x.png\",\n  \"uri\": \"/internal/default/images/workflow_icons_sprite@05x.png\",\n  \"resourceType\": \"file\"\n}"
    ],
    "utterance": "Extract all objects from an array where the uri field does not start with /themes.",
    "expressions": [
      ".[] | select(.uri | startswith(\"/themes\") | not)"
    ],
    "data": [
      {
        "input": [
          {
            "version": 0,
            "permissionMask": 2,
            "creationDate": "2022-06-09T14:42:41",
            "updateDate": "2021-11-17T06:48:52",
            "label": "webPageView.css",
            "uri": "/themes/default/webPageView.css",
            "resourceType": "file"
          },
          {
            "version": 0,
            "permissionMask": 2,
            "creationDate": "2022-06-09T14:42:43",
            "updateDate": "2021-11-17T06:48:48",
            "label": "workflow_icons_sprite@05x.png",
            "uri": "/internal/default/images/workflow_icons_sprite@05x.png",
            "resourceType": "file"
          }
        ],
        "output": {
          "version": 0,
          "permissionMask": 2,
          "creationDate": "2022-06-09T14:42:43",
          "updateDate": "2021-11-17T06:48:48",
          "label": "workflow_icons_sprite@05x.png",
          "uri": "/internal/default/images/workflow_icons_sprite@05x.png",
          "resourceType": "file"
        }
      }
    ],
    "identifier": 72614591,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "let's say I want to find all the companies where the CEO is named John.",
      "Use `and` to match both criteria, and use `any` within `select` to make pass a company if at least one of its employees fulfills the condition:",
      "map_values(select(any(.employees[]; .name == \"John\" and .title == \"CEO\")))"
    ],
    "utterance": "Find all companies where at least one employee's name is John and title is CEO.",
    "expressions": [
      "map_values(select(any(.employees[]; .name == \"John\" and .title == \"CEO\")))"
    ],
    "data": [
      {
        "input": {
          "company1": {
            "employees": [
              {
                "name": "John",
                "title": "CEO"
              },
              {
                "name": "Jason",
                "title": "CFO"
              },
              {
                "name": "Jackson",
                "title": "Sales Representative"
              }
            ]
          },
          "company2": {
            "employees": [
              {
                "name": "Amy",
                "title": "CEO"
              },
              {
                "name": "James",
                "title": "Software Engineer"
              },
              {
                "name": "John",
                "title": "Sales Representative"
              }
            ]
          }
        },
        "output": {
          "company1": {
            "employees": [
              {
                "name": "John",
                "title": "CEO"
              },
              {
                "name": "Jason",
                "title": "CFO"
              },
              {
                "name": "Jackson",
                "title": "Sales Representative"
              }
            ]
          }
        }
      }
    ],
    "identifier": 72623015,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I need to only perform this on brand_name and generic_name, while ensure that the manufacturer name is also first letter capitalized with the exception of things like LLC which should remain capitalized.",
      "Split at space characters to get an array of words, then split again at the empty string to get an array of characters. For the inner array, use `ascii_downcase` on all elements but the first, then put all back together using add on the inner and join with a space character on the outer array.",
      "To ignore certain words from being processed, capture them with an `if` condition:",
      "map_values((. / \" \") | map(\n  if IN(\"LLC\", \"AND\") then .\n  else . / \"\" | .[1:] |= map(ascii_downcase) | add end\n) | join(\" \"))",
      "Suppose we are given an array of words that are to be left as is, e.g.:\ndef exceptions: [\"LLC\", \"USA\"];",
      "We can then define a capitalization function as follows:\ndef capitalize:\n  INDEX(exceptions[]; .) as $e\n  | [splits(\"\\\\b\") | select(length>0)]\n  | map(if $e[.] then . else (.[:1]|ascii_upcase) + (.[1:] |ascii_downcase) end)\n  | join(\"\");"
    ],
    "utterance": "Capitalize the first letter of each word in 'brand_name' and 'generic_name', and in 'manufacturer' except for words like 'LLC' that should remain fully uppercase.",
    "expressions": [
      "(.brand_name, .generic_name) |= ((. / \" \") | map(. / \"\" | .[1:] |= map(ascii_downcase) | add) | join(\" \"))",
      "map_values((. / \" \") | map(if IN(\"LLC\", \"AND\") then . else . / \"\" | .[1:] |= map(ascii_downcase) | add end) | join(\" \"))",
      "def exceptions: [\"LLC\", \"USA\"];\ndef capitalize:\n  INDEX(exceptions[]; .) as $e\n  | [splits(\"\\\\b\") | select(length>0)]\n  | map(if $e[.] then . else (.[:1]|ascii_upcase) + (.[1:] |ascii_downcase) end)\n  | join(\"\");"
    ],
    "data": [
      {
        "input": {
          "brand_name": "VELTIN",
          "generic_name": "CLINDAMYCIN PHOSPHATE AND TRETINOIN",
          "manufacturer": "Almirall, LLC",
          "rxcui": "882548"
        },
        "output": {
          "brand_name": "Veltin",
          "generic_name": "Clindamycin Phosphate And Tretinoin",
          "manufacturer": "Almirall, LLC",
          "rxcui": "882548"
        }
      }
    ],
    "identifier": 72623107,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "What I want to achieve is to search for ```.*scsi0$``` in the values of the inner array and get ```sdb``` as the result.",
      "`.ids | to_entries[] | select(.value[] | test(\".*scsi0$\")) | .key`",
      "`select(.value[] | endswith(\"scsi0\"))`",
      "jq '.ids | to_entries[] | select(.value[] | endswith(\"scsi0\")) | .key'"
    ],
    "utterance": "Find the key of the object in 'ids' whose array contains at least one element ending with 'scsi0'.",
    "expressions": [
      ".ids | to_entries[] | select(.value[] | test(\".*scsi0$\")) | .key",
      ".ids | to_entries[] | select(.value[] | endswith(\"scsi0\")) | .key"
    ],
    "data": [
      {
        "input": {
          "ids": {
            "sda": [
              "scsi-0QEMU_QEMU_HARDDISK_drive-scsi2"
            ],
            "sdb": [
              "scsi-0QEMU_QEMU_HARDDISK_drive-scsi0"
            ],
            "sdb1": [
              "lvm-pv-uuid-lvld3A-oA4k-hC19-DXzv-D0Fq-xyME-BwgJid",
              "scsi-0QEMU_QEMU_HARDDISK_drive-scsi0-part1"
            ],
            "sdc": [
              "lvm-pv-uuid-pWes2W-dgYF-l8hG-La48-9ozH-hPdU-MOkOtf",
              "scsi-0QEMU_QEMU_HARDDISK_drive-scsi1"
            ]
          }
        },
        "output": "sdb"
      }
    ],
    "identifier": 72629126,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "\"jq -r ' .SecurityGroups[] | select(.IpPermissions[] | .FromPort == 22 and .IpRanges[].CidrIp == \\\"0.0.0.0/0\\\") | .GroupId'\"",
      "\"jq -r \\\".Reservations | .[] | .Instances | .[] | .InstanceId\\\"\""
    ],
    "utterance": "Extract the GroupId of all security groups that allow inbound SSH on port 22 from anywhere (0.0.0.0/0).",
    "expressions": [
      ".SecurityGroups[] | select(.IpPermissions[] | .FromPort == 22 and .IpRanges[].CidrIp == \"0.0.0.0/0\") | .GroupId"
    ],
    "identifier": 72633087,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "No example input or output is provided, so it's unclear what the input JSON structure is (e.g., AWS EC2 DescribeSecurityGroups output or something else). This makes it impossible to fully specify the jq transformation.",
      "other_environment": null
    }
  },
  {
    "context": [
      "How can I sort package json output **versions** by **datetime**?",
      "jq -r 'to_entries | sort_by(.value)[].key'"
    ],
    "utterance": "List the keys in the object, sorted by their associated date-time values.",
    "expressions": [
      "to_entries | sort_by(.value)[] .key",
      "to_entries | sort_by(.value) | .[].key"
    ],
    "data": [
      {
        "input": {
          "1.3.0-pr.515714": "2022-06-15T14:29:51.680Z",
          "created": "2022-06-14T23:22:13.692Z",
          "1.3.0-pr.515829": "2022-06-15T17:06:02.323Z",
          "1.3.0-pr.515697": "2022-06-15T13:45:24.534Z",
          "1.3.0-pr.515333": "2022-06-14T23:26:32.198Z",
          "1.3.0-pr.515861": "2022-06-15T17:30:36.429Z",
          "1.3.0-pr.515365": "2022-06-14T23:59:47.488Z",
          "1.3.0-pr.515497": "2022-06-15T02:15:01.895Z",
          "1.3.0-pr.515326": "2022-06-14T23:22:13.692Z",
          "1.3.0-pr.515479": "2022-06-15T02:01:56.968Z",
          "1.3.0": "2022-06-14T23:30:57.373Z",
          "1.3.0-pr.515482": "2022-06-15T02:05:14.094Z",
          "modified": "2022-06-15T17:30:36.429Z"
        },
        "output": [
          "created",
          "1.3.0-pr.515326",
          "1.3.0-pr.515333",
          "1.3.0",
          "1.3.0-pr.515365",
          "1.3.0-pr.515479",
          "1.3.0-pr.515482",
          "1.3.0-pr.515497",
          "1.3.0-pr.515697",
          "1.3.0-pr.515714",
          "1.3.0-pr.515829",
          "1.3.0-pr.515861",
          "modified"
        ]
      }
    ],
    "identifier": 72636440,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "And I want to change the value of `configName` which currently has a value of `config1`.",
      "I know I can do",
      ".configs[0].configName = \"foo\"",
      "But I don't want to rely on the position in the array of the one I want to change, how can I find that and then set the value?",
      ".configs |= map(select(.configName == \"config1\").configName |= \"foo-bar\")"
    ],
    "utterance": "Change the value of configName to \"foo-bar\" for the object in the configs array whose configName is \"config1\".",
    "expressions": [
      ".configs |= map(select(.configName == \"config1\").configName |= \"foo-bar\")"
    ],
    "data": [
      {
        "input": {
          "configs": [
            {
              "configName": "config1",
              "configTarget": "/app/appsettings.json",
              "uid": "0",
              "gid": "0",
              "mode": 292
            },
            {
              "configName": "config2",
              "configTarget": "/app/appsettings.json",
              "uid": "0",
              "gid": "0",
              "mode": 292
            }
          ]
        },
        "output": {
          "configs": [
            {
              "configName": "foo-bar",
              "configTarget": "/app/appsettings.json",
              "uid": "0",
              "gid": "0",
              "mode": 292
            },
            {
              "configName": "config2",
              "configTarget": "/app/appsettings.json",
              "uid": "0",
              "gid": "0",
              "mode": 292
            }
          ]
        }
      }
    ],
    "identifier": 72644640,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Instead of returning ALL key/value pairs, is there a way to modify that command to extract only the expiration date 1655384832 value?",
      "Then just specify which key you want:\r\n\r\n    jq -R 'split(\".\") | .[1] | @base64d | fromjson | .exp' <<< ..."
    ],
    "utterance": "Extract the expiration date value (1655384832) from the JWT payload.",
    "expressions": [
      "jq -R 'split(\".\") | .[1] | @base64d | fromjson | .exp'"
    ],
    "data": [
      {
        "input": "eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJzdWIiOiIxMjM0NTY3ODkwIiwibmFtZSI6IkpvaG4gRG9lIiwiaWF0IjoxNTE2MjM5MDIyLCJleHAiOjE2NTUzODQ4MzJ9.QVevY3qVLWHxfkNX1S4bas6PKSN75Nf96V7WmkcgbrE",
        "output": 1655384832
      }
    ],
    "identifier": 72645899,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I would like to pretty print a json string I copied from an API call which contains escaped double quotes.",
      "What you have is a JSON string which happens to contain a JSON object. You can decode the contents of the string with the `fromjson` function.",
      "$ pbpaste | jq \".\"",
      "$ pbpaste | jq \"fromjson\"",
      "{\r\n  \"name\": \"Hans\",\r\n  \"Hobbies\": [\r\n    \"Car\",\r\n    \"Swimming\"\r\n  ]\r\n}"
    ],
    "utterance": "Pretty print a string which contains an encoded JSON object with escaped double quotes.",
    "expressions": [
      "fromjson"
    ],
    "data": [
      {
        "input": "\"{\\\"name\\\":\\\"Hans\\\", \\\"Hobbies\\\":[\\\"Car\\\",\\\"Swimming\\\"]}\"",
        "output": {
          "name": "Hans",
          "Hobbies": [
            "Car",
            "Swimming"
          ]
        }
      }
    ],
    "identifier": 72648459,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "How can I make jq/yq consisently render an element in an array even as a single item.",
      "So you want every 3'th and 4'th `def` converted to an array with that object?",
      "Yes. Hopefully if I can see how that is done I can also see how to do convert `ex` and `kref` as well.",
      ".xdxf.lexicon.ar.def.def.def |= [ (.def |= [ . ]) ]"
    ],
    "utterance": "Wrap the 3rd-level and 4th-level 'def' properties in arrays, preserving their structure even when there is only one such element.",
    "expressions": [
      ".xdxf.lexicon.ar.def.def.def |= [ (.def |= [ . ]) ]"
    ],
    "data": [
      {
        "input": {
          "xdxf": {
            "@revision": "034",
            "lexicon": {
              "ar": {
                "k": {
                  "@xml:lang": "dyu",
                  "#text": "headword2"
                },
                "def": {
                  "@freq": "",
                  "co": null,
                  "def": {
                    "@xml:lang": "fr",
                    "def": {
                      "@def-id": "63491b7b-5d72-400c-95ee-f2e63f984832",
                      "gr": {
                        "abbr": null
                      },
                      "co": null,
                      "def": {
                        "@def-id": "32dc09fd-212f-42da-9f45-7d0fa5258c49",
                        "deftext": {
                          "dtrn": "definition1"
                        },
                        "ex": {
                          "@type": "exm",
                          "@ex-id": "dda450aa-121b-415f-92e4-63366fd78fb5",
                          "ex_orig": null,
                          "ex_tran": null
                        },
                        "etm": null,
                        "categ": null
                      }
                    }
                  },
                  "sr": {
                    "kref": {
                      "@type": "spv"
                    }
                  }
                }
              }
            }
          }
        },
        "output": {
          "xdxf": {
            "@revision": "034",
            "lexicon": {
              "ar": {
                "k": {
                  "@xml:lang": "dyu",
                  "#text": "headword2"
                },
                "def": {
                  "@freq": "",
                  "co": null,
                  "def": {
                    "@xml:lang": "fr",
                    "def": [
                      {
                        "@def-id": "63491b7b-5d72-400c-95ee-f2e63f984832",
                        "gr": {
                          "abbr": null
                        },
                        "co": null,
                        "def": [
                          {
                            "@def-id": "32dc09fd-212f-42da-9f45-7d0fa5258c49",
                            "deftext": {
                              "dtrn": "definition1"
                            },
                            "ex": {
                              "@type": "exm",
                              "@ex-id": "dda450aa-121b-415f-92e4-63366fd78fb5",
                              "ex_orig": null,
                              "ex_tran": null
                            },
                            "etm": null,
                            "categ": null
                          }
                        ]
                      }
                    ]
                  },
                  "sr": {
                    "kref": {
                      "@type": "spv"
                    }
                  }
                }
              }
            }
          }
        }
      }
    ],
    "identifier": 72651120,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      ".. | .keyA?, .keyB?",
      "But now I'd like to format the output like:",
      "{'key': 'keyA', 'value': {...}}",
      "{'key': 'keyB', 'value': {...}}",
      "Is it possible to reform the data so that the output of the search function gets labeled with the key that yielded the object?",
      "to_entries will turn an object into an array of key-value pairs. Then, checking the key name explicitly enables you to also find values of null.",
      ".. | objects | to_entries[] | select(.key == (\"keyA\", \"keyB\"))",
      "{\"key\":\"keyA\",\"value\":{\"foo\":\"foo\",\"bar\":\"bar\"}}",
      "{\"key\":\"keyB\",\"value\":{\"A\":\"123\",\"B\":\"c\"}}"
    ],
    "utterance": "Retrieve all objects with keys 'keyA' or 'keyB' anywhere in the data and label each result with its key and corresponding value.",
    "expressions": [
      ".. | objects | to_entries[] | select(.key == (\"keyA\", \"keyB\"))"
    ],
    "data": [
      {
        "input": {
          "doc": "foobar",
          "pages": [
            {
              "data": {
                "keyA": {
                  "foo": "foo",
                  "bar": "bar"
                },
                "keyB": {
                  "A": "123",
                  "B": "c"
                }
              }
            }
          ]
        },
        "output": [
          {
            "key": "keyA",
            "value": {
              "foo": "foo",
              "bar": "bar"
            }
          },
          {
            "key": "keyB",
            "value": {
              "A": "123",
              "B": "c"
            }
          }
        ]
      }
    ],
    "identifier": 72651790,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to preserve only a subset of the tags, not all of them, and pull them up into the main object, not leave them as nested.",
      "all the tags are optional, so some records will be missing some tags (e.g., `key2` in this example).",
      "You could instead filter `.tags` with `map(select(...))`",
      "jq '.[] | {\n            id,\n            key1:.tags | map(select(.Key | match(\"key1\")) | .Value)[0],\n            key2:.tags | map(select(.Key | match(\"key2\")) | .Value)[0]\n          }'",
      "As for a more idiomatic approach you could use `from_entries`",
      "jq '.[] | . + (.tags | from_entries) | { id, key1, key2 }'"
    ],
    "utterance": "For each record, extract the id and promote values of tags with Key equal to key1 and key2 to top-level fields, setting missing tag values to null.",
    "expressions": [
      ".[] | { id, key1:.tags | map(select(.Key | match(\"key1\")) | .Value)[0], key2:.tags | map(select(.Key | match(\"key2\")) | .Value)[0] }",
      ".[] | . + (.tags | from_entries) | { id, key1, key2 }"
    ],
    "data": [
      {
        "input": [
          {
            "id": "1234",
            "comment": "whatevz",
            "tags": [
              {
                "Key": "key1",
                "Value": "someval-1234-k1"
              },
              {
                "Key": "key2",
                "Value": "someval-1234-k2"
              },
              {
                "Key": "key3",
                "Value": "someval-1234-k3"
              }
            ]
          },
          {
            "id": "4567",
            "comment": "pff",
            "tags": [
              {
                "Key": "key1",
                "Value": "someval-4567-k1"
              },
              {
                "Key": "key3",
                "Value": "someval-4567-k3"
              }
            ]
          }
        ],
        "output": [
          {
            "id": "1234",
            "key1": "someval-1234-k1",
            "key2": "someval-1234-k2"
          },
          {
            "id": "4567",
            "key1": "someval-4567-k1",
            "key2": null
          }
        ]
      }
    ],
    "identifier": 72653076,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I would like to find which numbers are in array 1 but not array 2. So in this case it would be",
      "$OBSOLETE_SLOT_IDS = [1,2,5]",
      "A shorter approach that uses `jq` to get the difference of the two arrays:",
      "jq -n --argjson a \"$slot_ids\" --argjson b \"$pr_ids\" '$a - $b | .[]'"
    ],
    "utterance": "Select numbers that exist in the first array variable but not in the second array variable, producing [1,2,5] for the example [1,2,3,4,5] and [3,4].",
    "expressions": [
      "$a - $b",
      "jq -n --argjson a \"$slot_ids\" --argjson b \"$pr_ids\" '$a - $b | .[]'"
    ],
    "data": [
      {
        "input": {
          "slot_ids": [
            1,
            2,
            3,
            4,
            5
          ],
          "pr_ids": [
            3,
            4
          ]
        },
        "output": [
          1,
          2,
          5
        ]
      }
    ],
    "identifier": 72665744,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": "The task refers to 'the first array variable' and 'the second array variable', but in the example, these are fields in an object. If the jq program is to be written for this object structure, the task is valid. If the task expects jq variables (e.g., --argjson), then it would be invalid. Based on the example, it is valid.",
      "other_environment": null
    }
  },
  {
    "context": [
      "I want the values to be enclosed in single quotes.",
      "If you need it to escape for the shell, just use `@sh`:",
      "<cli> | jq -r '.[] | \"\\(.Name)=\\(.Value | @sh)\"'"
    ],
    "utterance": "Produce output in the format MY_KEY='MY_VALUE', with the value part enclosed in single quotes.",
    "expressions": [
      ".[] | \"\u000027(.Name)=\u000027(.Value | @sh)\u000027\"",
      ".[] | \"\\(.Name)=\\(.Value | @sh)\""
    ],
    "identifier": 72667395,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I would like to be able to add automatically some new commands inside **\"script\":{}** using jq",
      "I want to add \n\n    \"prettier\": \"prettier --fix\"\n    \"eslint\": \"eslint run --all\"\n    \"foo\": \"bar\"",
      "jq '.scripts += input' package.json <(cat << EOF\n{\n  \"prettier\": \"prettier --fix\",\n  \"eslint\": \"eslint run --all\",\n  \"foo\": \"bar\"\n}\nEOF\n)",
      "jq '.scripts += {\n  \"prettier\": \"prettier --fix\",\n  \"eslint\": \"eslint run --all\",\n  \"foo\": \"bar\"\n}' package.json"
    ],
    "utterance": "Add the commands 'prettier': 'prettier --fix', 'eslint': 'eslint run --all', and 'foo': 'bar' to the scripts object.",
    "expressions": [
      ".scripts += {\"prettier\": \"prettier --fix\", \"eslint\": \"eslint run --all\", \"foo\": \"bar\"}",
      ".scripts += input"
    ],
    "data": [
      {
        "input": {
          "name": "test-project",
          "version": "1.0.0",
          "description": "A Vue.js project",
          "main": "src/main.js",
          "private": true,
          "scripts": {
            "dev": "webpack-dev-server --inline --progress --config build/webpack.dev.conf.js",
            "start": "npm run dev",
            "unit": "jest --config test/unit/jest.conf.js --coverage",
            "test": "npm run unit",
            "lint": "eslint --ext .js,.vue src test/unit",
            "build": "node build/build.js"
          },
          "dependencies": {
            "vue": "^2.5.2"
          }
        },
        "output": {
          "name": "test-project",
          "version": "1.0.0",
          "description": "A Vue.js project",
          "main": "src/main.js",
          "private": true,
          "scripts": {
            "dev": "webpack-dev-server --inline --progress --config build/webpack.dev.conf.js",
            "start": "npm run dev",
            "unit": "jest --config test/unit/jest.conf.js --coverage",
            "test": "npm run unit",
            "lint": "eslint --ext .js,.vue src test/unit",
            "build": "node build/build.js",
            "prettier": "prettier --fix",
            "eslint": "eslint run --all",
            "foo": "bar"
          },
          "dependencies": {
            "vue": "^2.5.2"
          }
        }
      }
    ],
    "identifier": 72673335,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have an array of array as following",
      "I'm looking for output like this\na /tmp/file1 2\na /tmp/file2 3\nb /tmp/tfile 4\nb /var/mfile 5",
      "What I have tried\ncat myfile.json | jq '.[] | \"\\(.name), \\(.devices[].path) \\(.devices[].files[].id)\"'",
      "and result is\n\"a, /tmp/file1 2\"\n\"a, /tmp/file2 2\"\n\"a, /tmp/file1 3\"\n\"a, /tmp/file2 3\"\n\"b, /tmp/tfile 4\"\n\"b, /var/mfile 4\"\n\"b, /tmp/tfile 5\"\n\"b, /var/mfile 5\"",
      "but I'm unable to select a key for the value is in parent map.",
      "jq -r '.[] | \"\\(.name) \\(.devices[] | \\(.path) \\(.files[].id))\"'"
    ],
    "utterance": "For each element, extract the name, each device's path, and each file's id, aligned so each row contains the name, corresponding path, and corresponding id.",
    "expressions": [
      ".[] | \"\u000b(.name) \u000b(.devices[] | \"\u000b(.path) \u000b(.files[].id)\")\""
    ],
    "data": [
      {
        "input": [
          {
            "devices": [
              {
                "files": [
                  {
                    "id": "2",
                    "type": "file"
                  }
                ],
                "path": "/tmp/file1"
              },
              {
                "files": [
                  {
                    "id": "3",
                    "type": "file"
                  }
                ],
                "path": "/tmp/file2"
              }
            ],
            "name": "a"
          },
          {
            "devices": [
              {
                "files": [
                  {
                    "id": "4",
                    "type": "file"
                  }
                ],
                "path": "/tmp/tfile"
              },
              {
                "files": [
                  {
                    "id": "5",
                    "type": "file"
                  }
                ],
                "path": "/var/mfile"
              }
            ],
            "name": "b"
          }
        ],
        "output": [
          "a /tmp/file1 2",
          "a /tmp/file2 3",
          "b /tmp/tfile 4",
          "b /var/mfile 5"
        ]
      }
    ],
    "identifier": 72683094,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Is it possible to do a general search over every single key? For example, if you search for \"horror\" it would return\n\n    {\n          \"Book1\": {\n            \"Genres\": [\n              \"horror\",\n              \"thriller\",\n              \"drama\"\n            ],\n            \"Description\": \"Extremely scary book.\",\n            \"Author\": \"Jane Doe\"\n          }\n    }\n\nas the word is found in the \"Genres\" or if you search for \"romantic\" it would return Book2 as the word is contained in the description. I want to avoid having to specify the key in which the value is found in but rather have a general search over the entire object.",
      "Use `..` to traverse the tree, `strings` to compare only strings, `any` to return true on first match, and `map_values` to retain the original tree structure.",
      "jq 'map_values(select(any(..; strings == \"horror\")))'",
      "To include substrings in the search, use `contains` instead:",
      "jq 'map_values(select(any(.. | strings; contains(\"romantic\"))))'"
    ],
    "utterance": "Return all top-level objects whose nested string values or array elements exactly match or contain the term 'horror' or 'romantic', without specifying any particular keys.",
    "expressions": [
      "map_values(select(any(..; strings == \"horror\")))",
      "map_values(select(any(.. | strings; contains(\"romantic\"))))"
    ],
    "data": [
      {
        "input": {
          "Book1": {
            "Genres": [
              "horror",
              "thriller",
              "drama"
            ],
            "Description": "Extremely scary book.",
            "Author": "Jane Doe"
          },
          "Book2": {
            "Genres": [
              "Romance",
              "drama"
            ],
            "Description": "Very romantic book.",
            "Author": "John Doe"
          }
        },
        "output": {
          "Book1": {
            "Genres": [
              "horror",
              "thriller",
              "drama"
            ],
            "Description": "Extremely scary book.",
            "Author": "Jane Doe"
          }
        }
      },
      {
        "input": {
          "Book1": {
            "Genres": [
              "horror",
              "thriller",
              "drama"
            ],
            "Description": "Extremely scary book.",
            "Author": "Jane Doe"
          },
          "Book2": {
            "Genres": [
              "Romance",
              "drama"
            ],
            "Description": "Very romantic book.",
            "Author": "John Doe"
          }
        },
        "output": {
          "Book2": {
            "Genres": [
              "Romance",
              "drama"
            ],
            "Description": "Very romantic book.",
            "Author": "John Doe"
          }
        }
      }
    ],
    "identifier": 72690208,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "The task description and examples are inconsistent: the description says to match 'horror' or 'romantic', but the examples only output one book at a time, not both. It is unclear if the output should include all matching objects or just one. The matching is also ambiguous: should it be case-insensitive? Should it match substrings or only exact matches? The task is missing information to unambiguously specify the transformation.",
      "other_environment": null
    }
  },
  {
    "context": [
      "Let\u2019s say I want to set **options.cylinders = \"v4\"**",
      "Let\u2019s replace the path with variable. It complains, but just to let you know where I\u2019ve been:",
      "path=\"options.cylinder\"\necho $JSON | jq --arg path \"$path\" --arg value \"$cyl\" '.$path = $value'",
      "Now  you would think the former would work. It succeeds but it *adds* the value with the entire path as key, instead of setting it as if it where the path.",
      "# Process with jq\njq \\\n  --null-input \\\n  --argjson json \"$json\" \\\n  --arg path \"$path\" \\\n  --arg value \"$cyl\" \\\n  '$json | setpath($path / \".\"; $value)'",
      "`setpath(path_array; value)` sets the value at path with path elements as an array.",
      "So the `$path` string is: `/` divided by its `.` dots, to turn it into an array."
    ],
    "utterance": "Set the value at an arbitrary nested key specified by a dot-delimited string variable, such as setting 'options.cylinders' to 'v4'.",
    "expressions": [
      "$json | setpath($path / \".\"; $value)"
    ],
    "data": [
      {
        "input": {
          "make": "ford",
          "model": "ranger",
          "year": "2020",
          "options": {
            "cylinders": "v6",
            "cab": "true"
          }
        },
        "output": {
          "make": "ford",
          "model": "ranger",
          "year": "2020",
          "options": {
            "cylinders": "v4",
            "cab": "true"
          }
        }
      }
    ],
    "identifier": 72691192,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "- Select the subarray with value \"brian\" at index 0",
      "- Change the value at index [2] in the selected array to \"cats\"",
      "- Return the complete modified array",
      "Update the required value by specifying the array indices and using the `|=` update select construct",
      "map(select(.[0] == \"brian\")[2] |= \"cats\" )"
    ],
    "utterance": "Change the value at index 2 to 'cats' in the subarray where index 0 is 'brian', and return the modified outer array.",
    "expressions": [
      "map(select(.[0] == \"brian\")[2] |= \"cats\")",
      "map(select(any(.[]; . == \"brian\"))[2] |= \"cats\")"
    ],
    "data": [
      {
        "input": [
          [
            "dave",
            "likes",
            "rabbits"
          ],
          [
            "brian",
            "likes",
            "fish"
          ]
        ],
        "output": [
          [
            "dave",
            "likes",
            "rabbits"
          ],
          [
            "brian",
            "likes",
            "cats"
          ]
        ]
      }
    ],
    "identifier": 72703763,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have the following JSON;",
      "I\u2019m wanting to add 2 new values to each array above using JQ, `dimension` and `date`, but `date` needs to be a copy of the existing key value `start`. The expected output is as below;",
      "The jq I have at present can add the new key `dimension`, but I can\u2019t figure out how to copy `start -> date`",
      "Just create the new key / value pairs.",
      "jq 'map(.date = .start | .dimension = \"new\")' input.json",
      "map( . += { date: .start, dimension: \"new\" } )",
      "map( .date = .start | .dimension = \"new\" )"
    ],
    "utterance": "Add a date key to each object in the array with the value from the object's start key, and add a dimension key with the value new.",
    "expressions": [
      "map(.date = .start | .dimension = \"new\")",
      "map(. += { date: .start, dimension: \"new\" })"
    ],
    "data": [
      {
        "input": [
          {
            "id": 1,
            "start": "2022-06-20",
            "result": 24
          },
          {
            "id": 2,
            "start": "2022-06-21",
            "result": 56
          },
          {
            "id": 3,
            "start": "2022-06-21",
            "result": 78
          }
        ],
        "output": [
          {
            "id": 1,
            "start": "2022-06-20",
            "result": 24,
            "date": "2022-06-20",
            "dimension": "new"
          },
          {
            "id": 2,
            "start": "2022-06-21",
            "result": 56,
            "date": "2022-06-21",
            "dimension": "new"
          },
          {
            "id": 3,
            "start": "2022-06-21",
            "result": 78,
            "date": "2022-06-21",
            "dimension": "new"
          }
        ]
      }
    ],
    "identifier": 72703865,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am using `jq --arg e1 ${domain[@]}` to set the domain variable to e1 environment variable.",
      "You can use `--args` instead which populates `$ARGS.positional` with a list of remaining arguments.",
      "jq '.Items = $ARGS.positional' <<<'{\"Items\":[]}' --args \"${domain[@]}\"",
      "You could turn the bash array into a string and separate the items by, say, a newline character, then import the string using `--arg`, and split it up again into a jq array using `/`:",
      "jq -n --arg e1 \"$(printf '%s\\n' \"${domain[@]}\")\" '{Items: ($e1 / \"\\n\")}'",
      "You can use `--args` to pass the array to `jq` as arguments, preserving the list/array structure:",
      "jq -n --args '.Items = $ARGS.positional' \"${domain[@]}\""
    ],
    "utterance": "Populate the Items array with all values from a bash array variable such that each entry is a quoted string value in the output array.",
    "expressions": [
      "jq '.Items = $ARGS.positional' <<<'{\"Items\":[]}' --args \"${domain[@]}\"",
      "jq -n --arg e1 \"$(printf '%s\\n' \"${domain[@]}\")\" '{Items: ($e1 / \"\\n\")}'",
      "jq -n --args '{Items: $ARGS.positional}' \"${domain[@]}\""
    ],
    "data": [
      {
        "input": {
          "Items": []
        },
        "output": {
          "Items": [
            "*.abc",
            "*.xyz",
            "*.123"
          ]
        }
      }
    ],
    "identifier": 72712592,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have following JSON input.  I want to convert this to other json format using JQ.",
      "Output format I want",
      "{\r\n    \"env\": [\r\n        {\r\n            \"name\": \"key1\",\r\n            \"value\": \"value1\"\r\n        },\r\n        {\r\n            \"name\": \"key2\",\r\n            \"value\": \"value2\"\r\n        },\r\n        {\r\n            \"name\": \"key3\",\r\n            \"value\": \"value3\"\r\n        }\r\n    ]\r\n}",
      "Using [`to_entries[]`]...",
      "{ env: [ to_entries[] | { name: .key, value } ] }",
      "Will get the desired output as you can [try in this online demo](https://jqplay.org/s/-LJqKQSoqYX)"
    ],
    "utterance": "Transform an object with arbitrary key-value pairs into an object with an env array of {name, value} objects for each key-value pair.",
    "expressions": [
      "{ env: [ to_entries[] | { name: .key, value } ] }"
    ],
    "data": [
      {
        "input": {
          "key1": "value1",
          "key2": "value2",
          "key3": "value3"
        },
        "output": {
          "env": [
            {
              "name": "key1",
              "value": "value1"
            },
            {
              "name": "key2",
              "value": "value2"
            },
            {
              "name": "key3",
              "value": "value3"
            }
          ]
        }
      }
    ],
    "identifier": 72718796,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "How can i get the `state` for rest of the JSON under `services` only if it matches `\"service_id\": \"test@*\"`",
      ".services[] | select(.service_id | test(\"test@*\")).state",
      "Will output `.state` where `service_id` matches the `test@*` regex."
    ],
    "utterance": "Retrieve the state of entries in services where service_id matches the pattern test@*.",
    "expressions": [
      ".services[] | select(.service_id | test(\"test@*\")).state"
    ],
    "data": [
      {
        "input": {
          "router": {
            "node_id": "ip-1-2-3-4",
            "state": "HEALTHY",
            "message": "OK"
          },
          "services": [
            {
              "service_id": "test@1234",
              "state": "HEALTHY",
              "message": "OK"
            },
            {
              "service_id": "something@45678",
              "state": "NOT HEALTHY",
              "message": "OK"
            }
          ]
        },
        "output": "HEALTHY"
      }
    ],
    "identifier": 72719031,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I\u2019m trying to find the assets that failed (```.executions[].operationStatus.state != \"SUCCESS\"```), for each wave and identify those that are recurrent for the same reason on the same operation, meaning that failed more than once.",
      "I would like to have an output like:\n\n```\n{ assetId: myhostname,\n  error: 'NET_ERROR',\n  operationName: 'Patch Linux OS'\n  failure_count: 'n'\n}\n```",
      "The following should work:\n\n```\n.waves[].assets[]\n| .assetId as $assetId\n| .executions\n| map(select(.operationStatus.state != \"SUCCESS\"))\n| map({ $assetId, error: .operationStatus.error, operationName })\n| group_by(.)\n| .[]\n| first + { failure_count: length }\n| select(.failure_count > 1)\n```"
    ],
    "utterance": "Find assets that failed more than once with the same error and operation name, include assetId, error, operationName, and number of failures.",
    "expressions": [
      ".waves[].assets[]\n| .assetId as $assetId\n| .executions\n| map(select(.operationStatus.state != \"SUCCESS\"))\n| map({ $assetId, error: .operationStatus.error, operationName })\n| group_by(.)\n| .[]\n| first + { failure_count: length }\n| select(.failure_count > 1)"
    ],
    "identifier": 72721625,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I need a performance-efficient way to append respective `category` to each object in `weights`, resulting in output:",
      "Ideally I would like to deal with a not-that-SQL object for categories: full input looking like this",
      "...still getting a similar output:",
      "For the updated structure, the `INDEX` of `.categories` needs to be tweaked (e.g. employing `to_entries`) in order to provide the `.key` as `.category` for each array item:",
      "~~~sh\n{weights: [JOIN(\n  # INDEX(.categories[]; .name);    # This has been replaced with:\n    INDEX(.categories | to_entries[]; .value[]) | .[] |= {category: .key};\n.weigths[]; .name; add)]}\n~~~",
      "~~~json\n{\n  \"weights\": [\n    {\n      \"name\": \"apple\",\n      \"weigth\": 200,\n      \"category\": \"fruit\"\n    },\n    {\n      \"name\": \"orange\",\n      \"weigth\": 300,\n      \"category\": \"fruit\"\n    },\n    {\n      \"name\": \"tomato\",\n      \"weigth\": 100,\n      \"category\": \"vegetable\"\n    },\n    {\n      \"name\": \"spinach\",\n      \"weigth\": 50,\n      \"category\": \"vegetable\"\n    }\n  ]\n}\n~~~"
    ],
    "utterance": "For each item in 'weigths', add a 'category' field by looking up the item name in the 'categories' object where each key is a category and each value is an array of names.",
    "expressions": [
      "{weights: [JOIN(INDEX(.categories | to_entries[]; .value[]) | .[] |= {category: .key}; .weigths[]; .name; add)]}"
    ],
    "data": [
      {
        "input": {
          "weigths": [
            {
              "name": "apple",
              "weigth": 200
            },
            {
              "name": "orange",
              "weigth": 300
            },
            {
              "name": "tomato",
              "weigth": 100
            },
            {
              "name": "spinach",
              "weigth": 50
            }
          ],
          "categories": {
            "fruit": [
              "apple",
              "orange"
            ],
            "vegetable": [
              "tomato",
              "spinach"
            ]
          }
        },
        "output": {
          "weights": [
            {
              "name": "apple",
              "weigth": 200,
              "category": "fruit"
            },
            {
              "name": "orange",
              "weigth": 300,
              "category": "fruit"
            },
            {
              "name": "tomato",
              "weigth": 100,
              "category": "vegetable"
            },
            {
              "name": "spinach",
              "weigth": 50,
              "category": "vegetable"
            }
          ]
        }
      }
    ],
    "identifier": 72723706,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "The output field is 'weights' (with correct spelling), but the input field is 'weigths' (misspelled). The task does not specify whether to correct the spelling or preserve it, which is a missing information issue.",
      "other_environment": null
    }
  },
  {
    "context": [
      "jq '.require | to_entries | map(select(.key | contains(\"rapidstatic\")))' composer.json",
      "You could use `del` and provide the key names as follows:",
      "jq '.require |= del(.[keys[] | select(contains(\"rapidstatic\"))])' composer.json",
      "Or you could generate an array of path expressions and use `delpaths` on them:",
      "jq '.require |= delpaths([keys[] | select(contains(\"rapidstatic\")) | [.]])' composer.json"
    ],
    "utterance": "Remove all dependencies from the require object whose key includes the substring rapidstatic.",
    "expressions": [
      ".require |= del(.[keys[] | select(contains(\"rapidstatic\"))])",
      ".require |= delpaths([keys[] | select(contains(\"rapidstatic\")) | [.]])"
    ],
    "data": [
      {
        "input": {
          "name": "laravel/laravel",
          "type": "project",
          "description": "The Laravel Framework.",
          "keywords": [
            "framework",
            "laravel"
          ],
          "license": "MIT",
          "require": {
            "php": "^8.0",
            "kreait/laravel-firebase": "^4.1",
            "laravel/cashier-paddle": "^1.5",
            "laravel/framework": "^8.75",
            "laravel/sanctum": "^2.11",
            "laravel/socialite": "^5.5",
            "laravel/tinker": "^2.5",
            "livewire/livewire": "^2.5",
            "lorisleiva/laravel-actions": "^2.4",
            "owenvoke/blade-fontawesome": "*",
            "protonemedia/laravel-verify-new-email": "^1.5",
            "pusher/pusher-php-server": "^7.0",
            "rapidstatic/rapidstatic": "dev-master",
            "rapidstatic/scraper": "dev-master",
            "razorui/blade-application-ui": "^0.3.0"
          }
        },
        "output": {
          "name": "laravel/laravel",
          "type": "project",
          "description": "The Laravel Framework.",
          "keywords": [
            "framework",
            "laravel"
          ],
          "license": "MIT",
          "require": {
            "php": "^8.0",
            "kreait/laravel-firebase": "^4.1",
            "laravel/cashier-paddle": "^1.5",
            "laravel/framework": "^8.75",
            "laravel/sanctum": "^2.11",
            "laravel/socialite": "^5.5",
            "laravel/tinker": "^2.5",
            "livewire/livewire": "^2.5",
            "lorisleiva/laravel-actions": "^2.4",
            "owenvoke/blade-fontawesome": "*",
            "protonemedia/laravel-verify-new-email": "^1.5",
            "pusher/pusher-php-server": "^7.0",
            "razorui/blade-application-ui": "^0.3.0"
          }
        }
      }
    ],
    "identifier": 72726574,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I need to change the double qoutes to single qoutes for all nested level children/values of JSON using jq.",
      "and expected output is:",
      "{&quot;foo&quot;:&quot;bar&quot;,&quot;ping&quot;:{&#39;pong&#39;:&#39;ping-pong&#39;,&#39;n&#39;:[{&#39;a&#39;:1},{&#39;b&#39;:2}]}, &quot;alpha&quot;:&quot;beta&quot;,&quot;hello&quot;:&quot;there!&quot;,&quot;you&quot;:&quot;2&quot;}",
      "To turn any JSON object into a JSON-encoded string use jq's `tojson` or `@json` functions:",
      "jq '.ping |= tojson' example.json",
      "jq '.ping |= @json' example.json"
    ],
    "utterance": "Replace all double quotes with single quotes in keys and values of a specific child object while keeping the rest of the structure unchanged.",
    "expressions": [
      ".ping |= tojson | .ping |= gsub(\"\\\"\";\"'\")"
    ],
    "data": [
      {
        "input": {
          "foo": "bar",
          "ping": {
            "pong": "ping-pong",
            "n": [
              {
                "a": 1
              },
              {
                "b": 2
              }
            ]
          },
          "alpha": "beta",
          "hello": "there!",
          "you": "2"
        },
        "output": {
          "foo": "bar",
          "ping": "{'pong':'ping-pong','n':[{'a':1},{'b':2}]}",
          "alpha": "beta",
          "hello": "there!",
          "you": "2"
        }
      }
    ],
    "identifier": 72727036,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "When I run it though, it produces this error: jq: error (at <stdin>:0): Cannot iterate over null (null)",
      "getPage \\\n| jq -c '.items | .[] | {landing_id: .landing_id, submitted_at: .submitted_at, answers: .answers, email: .hidden.email}' \\",
      "That means your field `.items` doesn't exist or is null if interpreting the output of your curl as a JSON.",
      "the shorter version of your query:\n\n```.items[] | {landing_id, submitted_at, answers, email: .hidden.email}```"
    ],
    "utterance": "Extract landing_id, submitted_at, answers, and hidden.email from each item in the items array.",
    "expressions": [
      ".items[] | {landing_id, submitted_at, answers, email: .hidden.email}"
    ],
    "identifier": 72733302,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I need to find those ethernet interfaces, which are up.",
      "$ ip -d -j a s | jq 'map(select ((.link_type == \"ether\") and (.flags[] | contains (\"UP\")))) | map(.ifname)'",
      "But it returns two results,",
      "[\n  \"enp0s3\",\n  \"enp0s3\"\n]",
      "It seems to me that the `and` in the `select` behaves like an `or`. It returns those interfaces, which have `.link_type = \"ether\"` or `.flags[] | contains (\"UP\")`.",
      "You are actually expanding the data twice, once w.r.t. to the root level and once again with `.flags[]` leading to [combinatorial explosion](https://en.wikipedia.org/wiki/Combinatorial_explosion#Computing)",
      "The solution would be to move `.flags[]` outside select as below",
      "map(.flags[] as $x | select ((.link_type == \"ether\") and ( $x | contains (\"UP\"))))",
      "A simple lookup for just `UP` in case if you are _not_ running into the above explosion would be to just do",
      "map(select( (.link_type == \"ether\") and (.flags[] == \"UP\") ))",
      "Changing `.flags[] | contains (\"UP\")` to `.flags[] | . == \"UP\"` fixes the problem:",
      "$ ip -d -j a s | jq 'map (select (.link_type == \"ether\")) | map (select (.flags[] | . == \"UP\")) | map(.ifname)'"
    ],
    "utterance": "Find all network interfaces with link_type equal to ether and with at least one flag exactly UP.",
    "expressions": [
      "map(select((.link_type == \"ether\") and (.flags[] == \"UP\")))",
      "map(select(.link_type == \"ether\")) | map(select(.flags[] | . == \"UP\"))",
      "map(.flags[] as $x | select((.link_type == \"ether\") and ($x | contains(\"UP\"))))"
    ],
    "identifier": 72743279,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "The task does not specify the structure of the input JSON (e.g., how interfaces, link_type, and flags are represented), so it is unclear how to write the jq filter.",
      "other_environment": null
    }
  },
  {
    "context": [
      "I waanted to get only `name` whose `id = 3`",
      "yq -r '.[] | select(.id == \"id_3\" )' myres.json",
      "and output is",
      "{ \"id\": \"id_3\", \"name\": \"name3\", }",
      "I made few changes in query as follows according to my requirements.",
      "yq -r '.[] | select(.id==\"id_2\").name' s.txt"
    ],
    "utterance": "Return the name value for the object with id equal to id_3.",
    "expressions": [
      ".[] | select(.id == \"id_3\").name"
    ],
    "data": [
      {
        "input": [
          {
            "id": "id_1",
            "name": "default"
          },
          {
            "id": "id_2",
            "name": "name2"
          },
          {
            "id": "id_3",
            "name": "name3"
          }
        ],
        "output": "name3"
      }
    ],
    "identifier": 72745313,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "and I wish to extract only a set of keys (which are possibly nested), like `.a` and `.c[].d`, giving us the following output:",
      "{\n  \"a\": 1,\n  \"c\": [\n    {\n      \"d\": 1\n    },\n    {\n      \"d\": 2\n    }\n  ]\n}",
      "Implementing `pick` function to filter an object for specific keys:",
      "def pick(paths):\n  . as $root |\n    reduce path(paths) as $path\n      ({}; setpath($path; $root | getpath($path)));",
      "$ jq \"$(cat query.jq)\"' pick(.a, .c[].f, .c[].d)' test.json  \n{\n  \"a\": 1,\n  \"c\": [\n    {\n      \"f\": null,\n      \"d\": 1\n    },\n    {\n      \"f\": null,\n      \"d\": 2\n    }\n  ]\n}",
      "If we'd like non-existent paths to be omitted instead of set to `null`, we can add `haspath` function to check if a path exists inside an object like so:",
      "def haspath($path):\n  def h:\n    . as [$json, $p]\n    | (($p|length)==0) or\n      ($json | (has($p[0]) and ( [getpath([$p[0]]), $p[1:] ] | h)));\n  [., $path] | h;",
      "def pick(paths):\n  . as $root |\n    reduce path(paths) as $path\n      ({}; if $root|haspath($path) then . + setpath($path; $root | getpath($path)) else . end);",
      "$ jq \"$(cat query_haspath.jq)\"' pick(.a, .c[].f, .c[].d)' test.json \n{\n  \"a\": 1,\n  \"c\": [\n    {\n      \"d\": 1\n    },\n    {\n      \"d\": 2\n    }\n  ]\n}"
    ],
    "utterance": "Extract the values corresponding to keys 'a' and the 'd' field of each object in the 'c' list, omitting fields that do not exist.",
    "expressions": [
      "def haspath($path):\n  def h:\n    . as [$json, $p]\n    | (($p|length)==0) or\n      ($json | (has($p[0]) and ( [getpath([$p[0]]), $p[1:] ] | h)));\n  [., $path] | h;\ndef pick(paths):\n  . as $root |\n    reduce path(paths) as $path\n      ({}; if $root|haspath($path) then . + setpath($path; $root | getpath($path)) else . end);\npick(.a, .c[].d)"
    ],
    "data": [
      {
        "input": {
          "a": 1,
          "b": 2,
          "c": [
            {
              "d": 1,
              "e": 2
            },
            {
              "d": 2,
              "e": 3
            }
          ]
        },
        "output": {
          "a": 1,
          "c": [
            {
              "d": 1
            },
            {
              "d": 2
            }
          ]
        }
      }
    ],
    "identifier": 72745488,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Now I want the average price column of 2nd and 3rd columns in the end of each row\n(For the 1st row, 6.455 should be added in the end of row since 6.54 + 6.37 = 12.91 / 2 = 6.455)\nand \nI also want 1st column(Unix Time) to be changed to our local timestamp(Tokyo UTC+9)\njust like this style `2022/06/25 10:00:00`",
      "If `jq -r '.[] | .[:5] | @tsv'` produced your original output, try changing it to\n~~~sh\njq -r '\n  .[] | .[:5]\n  | .[0] |= (./1000 | strflocaltime(\"%Y/%m/%d %H:%M:%S\"))\n  | .[2,3] |= tonumber\n  | .[5] = (.[2] + .[3]) / 2\n  | @tsv\n' \n~~~\nThis updates column `0` by dividing it by 1000 to turn milliseconds into seconds, then applying `strflocaltime` with your desired format.\nThen it updates columns `2` and `3` to turn them into numbers, as then they are used to calculate column `5` by adding them up and dividing the sum by 2."
    ],
    "utterance": "Output each row with the first column as a Tokyo time formatted timestamp and append a new column containing the average of the third and fourth columns.",
    "expressions": [
      ".[] | .[:5]\n  | .[0] |= (./1000 | strflocaltime(\"%Y/%m/%d %H:%M:%S\"))\n  | .[2,3] |= tonumber\n  | .[5] = (.[2] + .[3]) / 2\n  | @tsv"
    ],
    "data": [
      {
        "input": [
          [
            1656118800000,
            "6.41000000",
            "6.54000000",
            "6.37000000",
            "6.49000000",
            "222382.87000000",
            1656122399999,
            "1436781.15290000",
            3073,
            "115733.77000000",
            "747659.36930000",
            "0"
          ]
        ],
        "output": "2022/06/25 10:00:00\t6.41000000\t6.54\t6.37\t6.49\t6.455"
      }
    ],
    "identifier": 72751857,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "The task does not specify the input format (is it a single array, an array of arrays, or lines of arrays?), nor does it specify the output format for multiple rows. Also, the example output does not match the input: the output has fewer columns and the formatting of the average is not fully specified.",
      "other_environment": null
    }
  },
  {
    "context": [
      "so dynamically build the matrix from the terraform.tfvars file",
      "and dynamically build the CFN JSON formats from either the matrix or similar method.",
      "So a GitHub matrix with the following JSON input/output\n{\n  \"include\": [\n    {\n      \"unique_prefix\": \"a-unique-value\",\n      \"base_region\": \"ap-southeast-2\",\n    }\n  ]\n}",
      "for input for a cloudformation Stack (--parameter-overrides file://param.json), the  result would be:\n[\n  {\n    \"ParameterKey\": \"unique_prefix\",\n    \"ParameterValue\": \"a-unique-value\"\n  },\n  {\n    \"ParameterKey\": \"base_region\",\n    \"ParameterValue\": \"ap-southeast-2\"\n  }\n]",
      "for CloudFormation StackSets (--parameters), the result would be:\n[\n  {\n    \"Key\": \"unique_prefix\",\n    \"Value\": \"a-unique-value\"\n  },\n  {\n    \"Key\": \"base_region\",\n    \"Value\": \"ap-southeast-2\"\n  }\n]"
    ],
    "utterance": "Convert an object with arbitrary key-value pairs into an array of objects with keys ParameterKey and ParameterValue mapping to the original keys and values, respectively.",
    "expressions": [
      "to_entries | map({ParameterKey: .key, ParameterValue: .value})"
    ],
    "data": [
      {
        "input": {
          "unique_prefix": "a-unique-value",
          "base_region": "ap-southeast-2"
        },
        "output": [
          {
            "ParameterKey": "unique_prefix",
            "ParameterValue": "a-unique-value"
          },
          {
            "ParameterKey": "base_region",
            "ParameterValue": "ap-southeast-2"
          }
        ]
      }
    ],
    "identifier": 72752290,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "jq -s '[0] as $a |.[1] as $b | $a * $b' old_env.json new_env.json | jq -c .",
      "my goal for the command is to merge two json output and assign it to the UPDATE_ENVVARS for it to be reused as an input for another command that will accept the json. Per testing, it came out empty when echo, when I execute the jq solo, the merge output is functional.",
      "UPDATE_ENVVARS=$$(jq -s '[0] as $$a |.[1] as $$b | $$a * $$b' old_env.json new_env.json | jq -c . )"
    ],
    "utterance": "Merge the contents of old_env.json and new_env.json using object update logic and output a compact merged result.",
    "expressions": [
      "jq -s '.[0] as $a | .[1] as $b | $a * $b' old_env.json new_env.json | jq -c ."
    ],
    "data": [
      {
        "input": {
          "old_env.json": {
            "Variables": {
              "HOST_URL": "https://old.com",
              "PORT": 8000
            }
          },
          "new_env.json": {
            "Variables": {
              "HOST_URL": "https://test.com"
            }
          }
        },
        "output": {
          "Variables": {
            "HOST_URL": "https://test.com",
            "PORT": 8000
          }
        }
      }
    ],
    "identifier": 72754461,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": true,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Now, I want those additional columns below in the end of columns",
      "1. Average of 7th column (Among all 48 rows. It should be -> 2.35297)",
      "2. Average of 7th column (Among first 12 hours(1-12 rows) only. It should be -> 2.68432)",
      "3. Average of 7th column (Among next 12 hours(13-24 rows) only. It should be -> 2.49099)",
      "4. Average of 7th column (Among next 12 hours(25-36 rows) only. It should be -> 2.13418)",
      "5. Average of 7th column (Among next 12 hours(37-48 rows) only. It should be -> 2.10239)",
      "6. The fixed string \"FLAG\" for all 48 rows",
      "Of course, it doesn't matter even if all the same numbers are comes in those columns among all 48 records."
    ],
    "utterance": "Add columns containing the averages of the 7th column over all 48 rows and for each group of 12 consecutive rows, as well as a fixed 'FLAG' column for each row.",
    "expressions": [
      "def averages(xs): [ (xs|map(.[6])|add/length) ]\n  + [ (xs|.[0:12]|map(.[6])|add/length) ]\n  + [ (xs|.[12:24]|map(.[6])|add/length) ]\n  + [ (xs|.[24:36]|map(.[6])|add/length) ]\n  + [ (xs|.[36:48]|map(.[6])|add/length) ];\n\n(. as $all |\n  averages($all) as $avgs |\n  $all[] | . + $avgs + [\"FLAG\"]\n)"
    ],
    "identifier": 72755580,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "The JSON like this:\r\n[\r\n  {\r\n    \"id\": 1,\r\n    \"names\": [\r\n      \"apple\",\r\n      \"google\"\r\n    ]\r\n  },\r\n  {\r\n    \"id\": 2,\r\n    \"names\": [\r\n      \"iphone\",\r\n      \"ipad\",\r\n      \"macbook\"\r\n    ]\r\n  }\r\n]",
      "expected output in tsv\r\n\r\n    1 apple\r\n    1 google\r\n    2 iphone\r\n    2 ipad\r\n    2 macbook",
      "jq -r 'map( \"\\(.id) \" + .names[] )[]'",
      "jq -r 'map( \"\\(.id) \" + .names[] ) | @tsv'",
      "jq -r 'map( \"\\(.id)\\t\" + .names[])[]'"
    ],
    "utterance": "For each object, output a line containing the id and each name, separated by a tab.",
    "expressions": [
      "map(\"\\(.id) \\(.names[])\")[]",
      "map(\"\\(.id)\\t\" + .names[])[]",
      "map(\"\\(.id) \" + .names[] )[]",
      "map(\"\\(.id) \" + .names[] ) | @tsv"
    ],
    "data": [
      {
        "input": [
          {
            "id": 1,
            "names": [
              "apple",
              "google"
            ]
          },
          {
            "id": 2,
            "names": [
              "iphone",
              "ipad",
              "macbook"
            ]
          }
        ],
        "output": [
          "1 apple",
          "1 google",
          "2 iphone",
          "2 ipad",
          "2 macbook"
        ]
      }
    ],
    "identifier": 72760968,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "what would be ????? look like to get the key by either one of json structure",
      "One way would be to do an explicit check of the type of the content within `keys` as",
      ".keys[] | if type == \"string\" then . elif type == \"object\" then .key else empty end",
      "You can use the [Destructuring Alternative Operator](https://stedolan.github.io/jq/manual/#DestructuringAlternativeOperator%3A%3F%2F%2F) `?//`:",
      ".keys as [{$key}] ?// [$key] | $key",
      "Just to add a further alternative as a mixture of the conciseness of @pmf's solution and the readability of @inian's solution (both great approaches):",
      ".[].keys|first|(.key? // .)"
    ],
    "utterance": "Extract the value '1234567890' from the first element of 'keys', which may be either an object with a 'key' field or a string.",
    "expressions": [
      ".keys[] | if type == \"string\" then . elif type == \"object\" then .key else empty end",
      ".keys as [{$key}] ?// [$key] | $key",
      ".[].keys|first|(.key? // .)"
    ],
    "data": [
      {
        "input": {
          "keys": [
            {
              "key": "1234567890"
            }
          ]
        },
        "output": "1234567890"
      },
      {
        "input": {
          "keys": [
            "1234567890"
          ]
        },
        "output": "1234567890"
      }
    ],
    "identifier": 72776480,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have a JSON body with template to pass to an API endpoint in Postman/Newman as follows:",
      "Everything enclosed in double braces is a variable that needs to read from an external CSV file, which looks like: \r\n\r\n```\r\np,yyyymmdd,yyyy,mmdd\r\n1,19991231,1999,1231\r\n2,20001230,2000,1230\r\n```",
      "I'd ideally like it to take in a dynamic number of entries that append after the last.",
      "Ideally, I'd get rid of the last two columns in the CSV file (`yyyy` and `mmdd`) and have the code just do a slice as needed.",
      "what is a quick one-liner in Javascript or Python that can quickly give me the output I need so that I can just manually place it in the request body?"
    ],
    "utterance": "Transform an array of objects with keys 'p' and 'yyyymmdd' into a new array where for each object, 'var1a' is set to 'ABCDE', 'var2a' is constructed as 'http://url/site/{yyyy}/{mmdd}/{yyyymmdd}.zip', 'var3a' is '{yyyymmdd}', 'var3b' is 'GKG', 'var2c' is 'ABCDE', and 'var2d' is '{p}', extracting 'yyyy' and 'mmdd' from the 'yyyymmdd' string.",
    "expressions": [
      "[.[] | {var1a: \"ABCDE\", var1b: {var2a: (\"http://url/site/\" + (.yyyymmdd[:4]) + \"/\" + (.yyyymmdd[4:]) + \"/\" + .yyyymmdd + \".zip\"), var2b: {var3a: .yyyymmdd, var3b: \"GKG\"}, var2c: \"ABCDE\", var2d: (.p|tostring)}}]"
    ],
    "data": [
      {
        "input": [
          {
            "p": 1,
            "yyyymmdd": "19991231"
          },
          {
            "p": 2,
            "yyyymmdd": "20001230"
          }
        ],
        "output": [
          {
            "var1a": "ABCDE",
            "var1b": {
              "var2a": "http://url/site/1999/1231/19991231.zip",
              "var2b": {
                "var3a": "19991231",
                "var3b": "GKG"
              },
              "var2c": "ABCDE",
              "var2d": "1"
            }
          },
          {
            "var1a": "ABCDE",
            "var1b": {
              "var2a": "http://url/site/2000/1230/20001230.zip",
              "var2b": {
                "var3a": "20001230",
                "var3b": "GKG"
              },
              "var2c": "ABCDE",
              "var2d": "2"
            }
          }
        ]
      }
    ],
    "identifier": 72777022,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I just want to get the latest snapshot id (24274, in above case) by parsing the output using ```jq``` on windows.",
      "What I tried is as follows: - \r\n```\r\n... | jq \".snapshotsForStream[-1].id\"\r\n```",
      "But when I ran same command on a linux machine then it is giving me the correct output as ```24274```",
      "Accessing elements [with negative indices][1] was introduced in `jq-1.5`, you seem to be running a version before that which is really old."
    ],
    "utterance": "Retrieve the id of the last object in the snapshotsForStream array.",
    "expressions": [
      ".snapshotsForStream[-1].id"
    ],
    "data": [
      {
        "input": {
          "snapshotsForStream": [
            {
              "id": 24221
            },
            {
              "id": 24230
            },
            {
              "id": 24274
            }
          ],
          "code": null,
          "message": null
        },
        "output": 24274
      }
    ],
    "identifier": 72781115,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I'm trying to add a variable while getting the info from a JSON file as shown below.",
      "email=$(cat profiles.json | jq '.central.[Need to add $stack variable here].email')",
      "password=$(cat profiles.json | jq '.central.[Need to add $stack variable here].password')",
      "I tried few things like `jq --arg v $stack '.central[$v]password*'` but it didnt work.",
      "$ jq --arg MYVAR one '.central[$MYVAR].password' profiles.json",
      "email=$(jq -r '.central[.generic.category].email' profiles.json)",
      "password=$(jq -r '.central[.generic.category].password' profiles.json)"
    ],
    "utterance": "Retrieve the email and password fields from the nested object in central whose key matches the value of generic.category.",
    "expressions": [
      ".central[.generic.category].email",
      ".central[.generic.category].password"
    ],
    "data": [
      {
        "input": {
          "central": {
            "one": {
              "tenant": "xxx-yyy-zzz",
              "email": "xyz@gmail.com",
              "password": "1111"
            },
            "two": {
              "tenant": "aaa-bbb-ccc",
              "email": "xyz@gmail.com",
              "password": "2222"
            }
          },
          "generic": {
            "username": "root",
            "password": "xyz",
            "project": "ABC",
            "category": "two"
          }
        },
        "output": {
          "email": "xyz@gmail.com",
          "password": "2222"
        }
      }
    ],
    "identifier": 72790882,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to generate a new array with just the `a` and `b` keys.",
      "I know I can do it like this:\n\n```\nARRAY | jq '. | map({ a: .a, b: .b })'\n```",
      "Is there a cleaner way to do this?",
      "You will have to type `a` and `b` once in order to specify which ones to keep:\n~~~sh\njq 'map({a,b})'\n~~~",
      "Alternatively, specify what to delete, here `.c` and `.d`:\n~~~sh\njq 'map(del(.c,.d))'\n~~~",
      "In any case, `.|` is also superfluous."
    ],
    "utterance": "Produce an array of objects, each containing only the keys a and b from each object in the input array.",
    "expressions": [
      "map({a,b})",
      "map(del(.c,.d))"
    ],
    "data": [
      {
        "input": [
          {
            "a": 1,
            "b": 2,
            "c": 3,
            "d": 4
          },
          {
            "a": 10,
            "b": 20,
            "c": 30,
            "d": 40
          },
          {
            "a": 100,
            "b": 200,
            "c": 300,
            "d": 400
          }
        ],
        "output": [
          {
            "a": 1,
            "b": 2
          },
          {
            "a": 10,
            "b": 20
          },
          {
            "a": 100,
            "b": 200
          }
        ]
      }
    ],
    "identifier": 72791903,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am using the following to extract DNS Name, LB Name and Scheme for each LB",
      "Is there a way in JQ to print my values as a CSV or at least insert a space after every iteration of the loop ?",
      "When I add @csv I get the following error \"string (\"XXXX...) cannot be csv-formatted, only array\"",
      "`@csv` expects an array, so construct one with your desired fields in it:\n~~~sh\njq -r '.LoadBalancers[] | [.DNSName, .LoadBalancerName, .Scheme] | @csv'\n~~~",
      "Alternatively, construct each output line yourself by formatting a string to your likings:\n~~~sh\njq -r '.LoadBalancers[] | \"\\(.DNSName): \\(.LoadBalancerName), \\(.Scheme)\"'\n~~~"
    ],
    "utterance": "Print DNSName, LoadBalancerName, and Scheme for each Load Balancer as a CSV line.",
    "expressions": [
      ".LoadBalancers[] | [.DNSName, .LoadBalancerName, .Scheme] | @csv",
      ".LoadBalancers[] | \"\\(.DNSName): \\(.LoadBalancerName), \\(.Scheme)\""
    ],
    "data": [
      {
        "input": {
          "LoadBalancers": [
            {
              "LoadBalancerArn": "arn:aws:elasticloadbalancing:us-west-1:XXXXXXXXXXXXXXXXXXXXXXXX",
              "DNSName": "DNS-XXXXXXXXXXXXXXXXXXXXXXXX.elb.us-west-1.amazonaws.com",
              "CanonicalHostedZoneId": "xxxxxxxxx",
              "CreatedTime": "2021-10-24T07:37:29.568000+00:00",
              "LoadBalancerName": "LBNAME-xxxxxxxxx",
              "Scheme": "internal",
              "VpcId": "vpc-XXXXXXXXXX",
              "State": {
                "Code": "active"
              },
              "Type": "network",
              "AvailabilityZones": [
                {
                  "ZoneName": "us-west-1a",
                  "SubnetId": "XXXXXXXXXX",
                  "LoadBalancerAddresses": []
                },
                {
                  "ZoneName": "us-west-1c",
                  "SubnetId": "XXXXXXXXXX",
                  "LoadBalancerAddresses": []
                }
              ],
              "IpAddressType": "ipv4"
            },
            {
              "LoadBalancerArn": "arn:aws:elasticloadbalancing:us-west-1:XXXXXXXXXXXXXXXXXXXXXXXX",
              "DNSName": "DNS-XXXXXXXXXX.elb.us-west-1.amazonaws.com",
              "CanonicalHostedZoneId": "XXXXXXXXXX",
              "CreatedTime": "2022-01-05T14:14:14.570000+00:00",
              "LoadBalancerName": "LBNAME-XXXXXXXXXXXXXXXXXXXXXXXX",
              "Scheme": "internal",
              "VpcId": "vpc-XXXXXXXXXX",
              "State": {
                "Code": "active"
              },
              "Type": "network",
              "AvailabilityZones": [
                {
                  "ZoneName": "us-west-1c",
                  "SubnetId": "subnet-XXXXXXXXXX",
                  "LoadBalancerAddresses": []
                },
                {
                  "ZoneName": "us-west-1a",
                  "SubnetId": "subnet-ZZZZZZZ",
                  "LoadBalancerAddresses": []
                }
              ],
              "IpAddressType": "ipv4"
            }
          ]
        },
        "output": [
          "\"DNS-XXXXXXXXXXXXXXXXXXXXXXXX.elb.us-west-1.amazonaws.com\",\"LBNAME-xxxxxxxxx\",\"internal\"",
          "\"DNS-XXXXXXXXXX.elb.us-west-1.amazonaws.com\",\"LBNAME-XXXXXXXXXXXXXXXXXXXXXXXX\",\"internal\""
        ]
      }
    ],
    "identifier": 72792751,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I would like to call another command inside of jq query.",
      "I need to transform to:",
      "{\n   hostA: \"123.123.123.123\",\n   hostB: \"132.132.132.132\"\n}",
      "Applying this command on every value: `dig +short $value`",
      "So, each host is transformed to IP.",
      "You can use bash arrays and iterate over it in a loop:"
    ],
    "utterance": "Replace each object's value with the result of running the dig +short command on that value.",
    "expressions": [
      "jq -r 'to_entries[] | \"\\(.key) $(dig +short \\(.value))\"' input.json | awk '{print \"\\\"\"$1\"\\\": \\\"\"$2\"\\\",\"}' | sed '$s/,$//' | awk 'BEGIN{print \"{\"}{print}{END{print \"}\"}}'"
    ],
    "data": [
      {
        "input": {
          "hostA": "somedomain.com",
          "hostB": "anotherdomain.com"
        },
        "output": {
          "hostA": "123.123.123.123",
          "hostB": "132.132.132.132"
        }
      }
    ],
    "identifier": 72795402,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "The task requires running an external command (`dig +short`) for each value, which is not possible with jq alone. The output depends on external system state (DNS resolution), and jq does not natively support invoking shell commands or integrating their results into the JSON transformation.",
      "other_environment": null
    }
  },
  {
    "context": [
      "I have a JSON object that looks like this:\r\n\r\n    [{\"name\":\"NAME_1\"},\"NAME_2\"]\r\n\r\nI would like an output of\r\n\r\n    [\"NAME_1\", \"NAME_2\"]",
      "Some of the entries in the array are an object with a key \"name\" and some are just a string of the name. I am trying to extract an array of the names.",
      "jq '[ .[] | if (.|type) == \"object\" then .name else . end ]'",
      "jq '[ .[] | .name? // . ]'",
      "jq 'map( .name?, scalars )'"
    ],
    "utterance": "Extract an array where each entry is the value of the \"name\" key if the element is an object, or the element itself if it is a string, from an array containing both objects and strings.",
    "expressions": [
      "[ .[] | if (.|type) == \"object\" then .name else . end ]",
      "[ .[] | .name? // . ]",
      "map( .name?, scalars )"
    ],
    "data": [
      {
        "input": [
          {
            "name": "NAME_1"
          },
          "NAME_2"
        ],
        "output": [
          "NAME_1",
          "NAME_2"
        ]
      }
    ],
    "identifier": 72801812,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I tried many things to pick up all the values in *relatives* from JSON but didn&#39;t find how to make it works.",
      "I need to do it with the `export ID=1` at first, I can only edit the second line with curl and jq.",
      "`curl 'url.json' | jq -r '.ID.relatives'`",
      "\"connections\": {",
      "  \"relatives\": \"Marlo Chandler-Jones (wife); Polly (aunt); Mrs. Chandler (mother-in-law); Keith Chandler, Ray Chandler, three unidentified others (brothers-in-law)\""
    ],
    "utterance": "Extract the value of the 'relatives' field under 'connections'.",
    "expressions": [
      ".connections.relatives"
    ],
    "data": [
      {
        "input": {
          "biography": {
            "fullName": "Richard Milhouse Jones",
            "placeOfBirth": "Scarsdale, Arizona",
            "firstAppearance": "Hulk Vol 2 #2 (April, 2008) (as A-Bomb)",
            "publisher": "Marvel Comics"
          },
          "connections": {
            "groupAffiliation": "Hulk Family; Excelsior (sponsor), Avengers (honorary member)",
            "relatives": "Marlo Chandler-Jones (wife); Polly (aunt); Mrs. Chandler (mother-in-law); Keith Chandler, Ray Chandler, three unidentified others (brothers-in-law)"
          }
        },
        "output": "Marlo Chandler-Jones (wife); Polly (aunt); Mrs. Chandler (mother-in-law); Keith Chandler, Ray Chandler, three unidentified others (brothers-in-law)"
      }
    ],
    "identifier": 72807630,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I need to prepare a query using the following fields:",
      "Patient Unique ID number.",
      "Patient First Name.",
      "Patient Last Name.",
      "Lab Test Date.",
      "HgbA1c Level.",
      "LDL Level.",
      "Creatinine Level.",
      "When using [object identifiers](https://stedolan.github.io/jq/manual/#ObjectIdentifier-Index%3A.foo%2C.foo.bar), you have to provide the field names exactly as used in the JSON file. Also, if they contain special characters such as white spaces (or start with a digit), surround them with double quotes for clarification:",
      "jq -r '[\n  .Patient.\"Unique ID\",\n  .Patient.Name.\"Patient First Name\",\n  .Patient.Name.\"Patient Last Name\",\n  .Patient.Labs.LDLCholesterol.LDLResultDate,\n  .Patient.IntakeCriteria.InitialHgbA1c,\n  .Patient.Labs.LDLCholesterol.LDLLevel,\n  .Patient.Labs.SerumCreatinine.CreatinineLevel\n] | @csv' data.json > csvtest.out",
      "jq -r '[.Patient |\n  .\"Unique ID\",\n  (.Name | .\"Patient First Name\", .\"Patient Last Name\"),\n  .Labs.LDLCholesterol.LDLResultDate,\n  .IntakeCriteria.InitialHgbA1c,\n  (.Labs | .LDLCholesterol.LDLLevel, .SerumCreatinine.CreatinineLevel)\n] | @csv' data.json > csvtest.out",
      "163983,\"James\",\"Manchester\",\"5/6/2013\",8.1,200,0.9"
    ],
    "utterance": "Extract the patient's unique ID, first name, last name, LDL cholesterol test date, initial HgbA1c value, LDL cholesterol level, and serum creatinine level as a CSV row.",
    "expressions": [
      "[.Patient.\"Unique ID\", .Patient.Name.\"Patient First Name\", .Patient.Name.\"Patient Last Name\", .Patient.Labs.LDLCholesterol.LDLResultDate, .Patient.IntakeCriteria.InitialHgbA1c, .Patient.Labs.LDLCholesterol.LDLLevel, .Patient.Labs.SerumCreatinine.CreatinineLevel] | @csv",
      "[.Patient | .\"Unique ID\", (.Name | .\"Patient First Name\", .\"Patient Last Name\"), .Labs.LDLCholesterol.LDLResultDate, .IntakeCriteria.InitialHgbA1c, (.Labs | .LDLCholesterol.LDLLevel, .SerumCreatinine.CreatinineLevel)] | @csv"
    ],
    "data": [
      {
        "input": {
          "Patient": {
            "Name": {
              "Patient First Name": "James",
              "Patient Middle Name": "Adam",
              "Patient Last Name": "Manchester"
            },
            "Unique ID": 163983,
            "Demographics": {
              "Sex": "M",
              "Birthdate": "7/24/1940"
            },
            "IntakeCriteria": {
              "DmDxDate": "8/25/2012",
              "InitialHgbA1c": 8.1,
              "Comorbidity": [
                "Diabetes Mellitus",
                "CHF",
                "DVT"
              ]
            },
            "Labs": {
              "LDLCholesterol": {
                "LDLResultDate": "5/6/2013",
                "LDLLevel": 200
              },
              "SerumCreatinine": {
                "CreatinineResultDate": "11/1/2016",
                "CreatinineLevel": 0.9
              }
            },
            "CareLocation": {
              "Facility Name": "East Side Clinic",
              "Facility Contact Name": "Mary Silverman",
              "Facility Contact Phone": "618-348-1891"
            }
          }
        },
        "output": "163983,\"James\",\"Manchester\",\"5/6/2013\",8.1,200,0.9"
      }
    ],
    "identifier": 72809543,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I wanna merge them and create another hierarchy(ja/en) for the duplicate key like:",
      "{\n    \"link_generate\": {\n    \t\"en\": \"Randomise\",\n    \t\"ja\": \"\u30e9\u30f3\u30c0\u30e0\u5316\u3059\u308b\"\n\t},\n    \"label_apple\": {\n    \t\"en\": \"Apple\",\n    \t\"ja\": \"\u30ea\u30f3\u30b4\"\n    }\n}",
      "jq -n --slurpfile ja ja.json --slurpfile en en.json '[ $en[0] | keys[] | { (.):{ en:$en[0][.], ja:$ja[0][.] } } ] | add'",
      "jq -n 'reduce (inputs | to_entries[] | [input_filename[:2], .]) as [$lang, $ent] ({}; setpath([$ent.key, $lang]; $ent.value))' ja.json en.json",
      "jq -n 'reduce ((\"ja\",\"en\") as $lang | input | to_entries[] | [$lang, .]) as [$lang, $ent] ({}; setpath([$ent.key, $lang]; $ent.value))' ja.json en.json"
    ],
    "utterance": "Group corresponding values from two files under their language codes for each key, so that each key has an object with 'en' and 'ja' as properties.",
    "expressions": [
      "jq -n --slurpfile ja ja.json --slurpfile en en.json '[ $en[0] | keys[] | { (.):{ en:$en[0][.], ja:$ja[0][.] } } ] | add'",
      "jq -n 'reduce (inputs | to_entries[] | [input_filename[:2], .]) as [$lang, $ent] ({}; setpath([$ent.key, $lang]; $ent.value))' ja.json en.json",
      "jq -n 'reduce ((\"ja\",\"en\") as $lang | input | to_entries[] | [$lang, .]) as [$lang, $ent] ({}; setpath([$ent.key, $lang]; $ent.value))' ja.json en.json"
    ],
    "data": [
      {
        "input": {
          "ja.json": {
            "link_generate": "\u30e9\u30f3\u30c0\u30e0\u5316\u3059\u308b",
            "label_apple": "\u30ea\u30f3\u30b4"
          },
          "en.json": {
            "link_generate": "Randomise",
            "label_apple": "Apple"
          }
        },
        "output": {
          "link_generate": {
            "en": "Randomise",
            "ja": "\u30e9\u30f3\u30c0\u30e0\u5316\u3059\u308b"
          },
          "label_apple": {
            "en": "Apple",
            "ja": "\u30ea\u30f3\u30b4"
          }
        }
      }
    ],
    "identifier": 72814666,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": true,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Here, `jq '[.[].color] | unique'` give the same output as `jq 'map(.color) | unique'`",
      "`jq '[.[] | has(\"name\")]'` give the same output as `jq '. | map(has(\"name\"))'`"
    ],
    "utterance": "Extract a list of unique color values from an array of fruit objects.",
    "expressions": [
      "[.[].color] | unique",
      "map(.color) | unique"
    ],
    "data": [
      {
        "input": [
          {
            "name": "apple",
            "color": "green",
            "price": 1.2
          },
          {
            "name": "banana",
            "color": "yellow",
            "price": 0.5
          },
          {
            "name": "kiwi",
            "color": "green",
            "price": 1.25
          }
        ],
        "output": [
          "green",
          "yellow"
        ]
      }
    ],
    "identifier": 72819899,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "my goal is to get a list of all repositories that have 0 pull requests so that I can act on those (archive).",
      "How can I filter the list of repositories to only show the repository name where `pullRequests.totalCount = 0`",
      "If you are only interested in the names, add that to the mapping:\n~~~sh\njq 'map(select(.pullRequests.totalCount == 0).name)'\n~~~\n~~~json\n[\n  \"repository-4\"\n]\n~~~",
      "If you want the names as a list of newline-separated raw texts rather than a JSON array of strings, iterate over the elements and use the `--raw-output` or `-r` for the output:\n~~~sh\njq -r '.[] | select(.pullRequests.totalCount == 0).name'\n~~~\n~~~json\nrepository-4\n~~~"
    ],
    "utterance": "List the names of repositories whose pullRequests.totalCount equals 0.",
    "expressions": [
      "map(select(.pullRequests.totalCount == 0).name)",
      ".[] | select(.pullRequests.totalCount == 0).name"
    ],
    "data": [
      {
        "input": [
          {
            "name": "repository-1",
            "pullRequests": {
              "totalCount": 129
            }
          },
          {
            "name": "repository-2",
            "pullRequests": {
              "totalCount": 1143
            }
          },
          {
            "name": "repository-3",
            "pullRequests": {
              "totalCount": 78
            }
          },
          {
            "name": "repository-4",
            "pullRequests": {
              "totalCount": 0
            }
          }
        ],
        "output": [
          "repository-4"
        ]
      }
    ],
    "identifier": 72834805,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "How can I change \\r\\n to \\n with jq please ?",
      "walk(if type == \"string\" then gsub(\"\\r\\n\"; \"\\n\") else . end)"
    ],
    "utterance": "Replace all occurrences of carriage return and line feed (\\r\\n) with line feed (\\n) in all string values at any depth.",
    "expressions": [
      "walk(if type == \"string\" then gsub(\"\\r\\n\"; \"\\n\") else . end)"
    ],
    "identifier": 72854730,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "need help to parse the JSON data received from Oracle Integration Cloud. The expected output is mentioned below alongwith the command i am trying to use.",
      "Below is the modified one. Please suggest for any better options.",
      "[   \r\n { id: .id },   \r\n    { \r\n\t   integrations: [\r\n\t   .integrations[]|{\r\n\t       id: .id,\r\n\t\t   code: .code,\r\n\t\t     dependencies: {\r\n\t\t\t  connections: (.dependencies.connections//[]|map({id,status})),\r\n\t\t\t  lookups: (.dependencies.lookups//[]|map({name,status})),\r\n\t\t\t  certificates: (.dependencies.certificates//[]|map({id,status})),\r\n\t\t\t  libraries: (.dependencies.libraries//[]|map({code,status,version})), \t\t  \r\n\t\t\t  integrations: (.dependencies.integrations//[]|map({code,version}))         \r\n\t\t\t  }       \r\n\t\t\t }     \r\n\t\t]   \r\n\t} \r\n]|del(..|select(.==[]))",
      "Note: To remove the empty arrays del function is added which is giving the below output :",
      "[\r\n  {\r\n    \"id\": \"SAMPLE_PACKAGE\"\r\n  },\r\n  {\r\n    \"integrations\": [\r\n      {\r\n        \"id\": \"HELLO_INTEGRATION|01.00.0000\",\r\n        \"code\": \"HELLO_INTEGRATION\",\r\n        \"dependencies\": {\r\n          \"connections\": [\r\n            {\r\n              \"id\": \"HELLO_WORLD1\",\r\n              \"status\": \"CONFIGURED\"\r\n            }, ...\r\n          ]\r\n        }\r\n      }, ...\r\n    ]\r\n  }\r\n]"
    ],
    "utterance": "Extract the id of the package, and for each integration, list its id, code, and dependencies (connections, lookups, certificates, libraries, integrations) showing only selected fields, omitting empty arrays.",
    "expressions": [
      "[\n  { id: .id },\n  {\n    integrations: [\n      .integrations[] | {\n        id: .id,\n        code: .code,\n        dependencies: {\n          connections: (.dependencies.connections // [] | map({id,status})),\n          lookups: (.dependencies.lookups // [] | map({name,status})),\n          certificates: (.dependencies.certificates // [] | map({id,status})),\n          libraries: (.dependencies.libraries // [] | map({code,status,version})),\n          integrations: (.dependencies.integrations // [] | map({code,version}))\n        }\n      }\n    ]\n  }\n] | del(..|select(.==[]))"
    ],
    "data": [
      {
        "input": {
          "bartaType": "DEVELOPED",
          "countOfIntegrations": 3,
          "id": "SAMPLE_PACKAGE",
          "integrations": [
            {
              "code": "HELLO_INTEGRATION",
              "dependencies": {
                "connections": [
                  {
                    "id": "HELLO_WORLD1",
                    "lockedFlag": false,
                    "name": "Hello World1",
                    "role": "SOURCE",
                    "status": "CONFIGURED",
                    "type": "rest",
                    "usage": 6
                  }
                ]
              },
              "description": "",
              "eventSubscriptionFlag": false,
              "filmstrip": [
                {
                  "code": "HELLO_WORLD1",
                  "iconUrl": "/images/rest/rest_icon_46.png",
                  "name": "Hello World1",
                  "role": "SOURCE",
                  "status": "CONFIGURED"
                }
              ],
              "id": "HELLO_INTEGRATION|01.00.0000",
              "lockedFlag": false,
              "name": "HELLO_INTEGRATION",
              "pattern": "Orchestration",
              "patternDescription": "Map Data",
              "payloadTracingEnabledFlag": true,
              "publishFlag": false,
              "scheduleApplicable": false,
              "scheduleDefined": false,
              "status": "ACTIVATED",
              "style": "FREEFORM",
              "styleDescription": "Orchestration",
              "tempCopyExists": false,
              "tracingEnabledFlag": true,
              "version": "01.00.0000",
              "warningMsg": "ACTIVATE_PUBLISH_NO_CONN"
            },
            {
              "code": "HELLO_INTEGRATIO_LOOKUP",
              "dependencies": {
                "connections": [
                  {
                    "id": "HELLO_WORLD1",
                    "lockedFlag": false,
                    "name": "Hello World1",
                    "role": "SOURCE",
                    "status": "CONFIGURED",
                    "type": "rest",
                    "usage": 6
                  }
                ],
                "lookups": [
                  {
                    "lockedFlag": false,
                    "name": "COMMON_LOOKUP_VARIABLES",
                    "status": "CONFIGURED",
                    "usage": 1
                  }
                ]
              },
              "description": "",
              "eventSubscriptionFlag": false,
              "filmstrip": [
                {
                  "code": "HELLO_WORLD1",
                  "iconUrl": "/images/rest/rest_icon_46.png",
                  "name": "Hello World1",
                  "role": "SOURCE",
                  "status": "CONFIGURED"
                }
              ],
              "id": "HELLO_INTEGRATIO_LOOKUP|01.00.0000",
              "lockedFlag": false,
              "name": "HELLO_INTEGRATION_LOOKUP",
              "pattern": "Orchestration",
              "patternDescription": "Map Data",
              "payloadTracingEnabledFlag": true,
              "publishFlag": false,
              "scheduleApplicable": false,
              "scheduleDefined": false,
              "status": "ACTIVATED",
              "style": "FREEFORM",
              "styleDescription": "Orchestration",
              "tempCopyExists": false,
              "tracingEnabledFlag": true,
              "version": "01.00.0000",
              "warningMsg": "ACTIVATE_PUBLISH_NO_CONN"
            },
            {
              "code": "HI_INTEGRATION",
              "dependencies": {
                "connections": [
                  {
                    "id": "HELLO_WORLD1",
                    "lockedFlag": false,
                    "name": "Hello World1",
                    "role": "SOURCE",
                    "status": "CONFIGURED",
                    "type": "rest",
                    "usage": 6
                  }
                ]
              },
              "description": "",
              "eventSubscriptionFlag": false,
              "filmstrip": [
                {
                  "code": "HELLO_WORLD1",
                  "iconUrl": "/images/rest/rest_icon_46.png",
                  "name": "Hello World1",
                  "role": "SOURCE",
                  "status": "CONFIGURED"
                }
              ],
              "id": "HI_INTEGRATION|01.00.0000",
              "lockedFlag": false,
              "name": "HI_INTEGRATION",
              "pattern": "Orchestration",
              "patternDescription": "Map Data",
              "payloadTracingEnabledFlag": true,
              "publishFlag": false,
              "scheduleApplicable": false,
              "scheduleDefined": false,
              "status": "ACTIVATED",
              "style": "FREEFORM",
              "styleDescription": "Orchestration",
              "tempCopyExists": false,
              "tracingEnabledFlag": true,
              "version": "01.00.0000",
              "warningMsg": "ACTIVATE_PUBLISH_NO_CONN"
            }
          ],
          "isCloneAllowed": false,
          "isViewAllowed": false,
          "name": "SAMPLE_PACKAGE",
          "type": "DEVELOPED"
        },
        "output": [
          {
            "id": "SAMPLE_PACKAGE"
          },
          {
            "integrations": [
              {
                "id": "HELLO_INTEGRATION|01.00.0000",
                "code": "HELLO_INTEGRATION",
                "dependencies": {
                  "connections": [
                    {
                      "id": "HELLO_WORLD1",
                      "status": "CONFIGURED"
                    }
                  ]
                }
              },
              {
                "id": "HELLO_INTEGRATIO_LOOKUP|01.00.0000",
                "code": "HELLO_INTEGRATIO_LOOKUP",
                "dependencies": {
                  "connections": [
                    {
                      "id": "HELLO_WORLD1",
                      "status": "CONFIGURED"
                    }
                  ],
                  "lookups": [
                    {
                      "name": "COMMON_LOOKUP_VARIABLES",
                      "status": "CONFIGURED"
                    }
                  ]
                }
              },
              {
                "id": "HI_INTEGRATION|01.00.0000",
                "code": "HI_INTEGRATION",
                "dependencies": {
                  "connections": [
                    {
                      "id": "HELLO_WORLD1",
                      "status": "CONFIGURED"
                    }
                  ]
                }
              }
            ]
          }
        ]
      }
    ],
    "identifier": 72857271,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "expected: using jq to add onto vm_maps map. I will have an empty vm_map and each time it runs, i will add a x amount of new entries.",
      "You can use this:",
      "jq \\\n   --arg VMHOST     \"$VMHOST\"     \\\n   --arg NETWORK_IP \"$NETWORK_IP\" \\\n   --arg VM_NAME    \"$VM_NAME\"    \\\n'\n   .vm_map[ $VMHOST ] = {\n      host:    $VMHOST,\n      network: $NETWORK_IP,\n      name:    $VM_NAME\n   }\n'"
    ],
    "utterance": "Add or update a key in the vm_map object with a new entry using the values of VMHOST, NETWORK_IP, and VM_NAME.",
    "expressions": [
      ".vm_map[ $VMHOST ] = { host: $VMHOST, network: $NETWORK_IP, name: $VM_NAME }"
    ],
    "data": [
      {
        "input": {
          "gcs_config": [
            {
              "bucket_name": "somebucket",
              "bucket_readers": [],
              "bucket_writers": []
            }
          ],
          "label_application": "someapp",
          "label_environment": "dev",
          "lits_vm_zone": "somezone",
          "project_id": "someproject",
          "region": "someregion",
          "storage_bucket_required": true,
          "vm_map": {}
        },
        "output": {
          "gcs_config": [
            {
              "bucket_name": "somebucket",
              "bucket_readers": [],
              "bucket_writers": []
            }
          ],
          "label_application": "someapp",
          "label_environment": "dev",
          "lits_vm_zone": "somezone",
          "project_id": "someproject",
          "region": "someregion",
          "storage_bucket_required": true,
          "vm_map": {
            "vm1": {
              "host": "vm1",
              "network": "10.1.1.1",
              "name": "vm1"
            },
            "vm2": {
              "host": "123",
              "network": "10.1.12",
              "name": "vm2"
            }
          }
        }
      }
    ],
    "identifier": 72860104,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I need to add a float field to a json, I'm doing somthing like :",
      "echo '{\"field\":' '\"value\"}' | jq --arg id 21841453190106 --arg price 1.2 '.+ {$price}'",
      "getting:",
      "{ \"field\": \"value\", \"price\": \"1.2\" }",
      "I'd like to have:",
      "{ \"field\": \"value\", \"price\": 1.2 }",
      "echo '{\"field\":' '\"value\"}' | jq --arg id 21841453190106 --argjson price 1 '.+ {$price}'",
      "gives:",
      "{ \"field\": \"value\", \"price\": 1 }",
      "any way to have",
      "{ \"field\": \"value\", \"price\": 1.0 }"
    ],
    "utterance": "Add a field named price with float value 1.0 to the object, ensuring the type is numeric and not a string.",
    "expressions": [
      ". + {\"price\": 1.0}"
    ],
    "data": [
      {
        "input": {
          "field": "value"
        },
        "output": {
          "field": "value",
          "price": 1.0
        }
      }
    ],
    "identifier": 72870726,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am trying to combine the two objects ( Cities and array addresss) and create a new array mailing_address as shown below",
      "Is there a way to perform this using jq?",
      ".Cities as $c | { mailing_address: (.addresss | map(. * $c)) }",
      "Then we create a new object, with your `mailing_address` key, loop over each address and add (*) the Cities variable to it"
    ],
    "utterance": "Combine each element of the 'addresss' array with the fields from 'Cities', producing an array under the key 'mailing_address' where each object merges address fields with the city information.",
    "expressions": [
      ".Cities as $c | { mailing_address: (.addresss | map(. * $c)) }"
    ],
    "data": [
      {
        "input": {
          "Cities": {
            "city": "New York",
            "zip_code": "10034",
            "state": "NY"
          },
          "addresss": [
            {
              "address_line1": "123 Main Street",
              "address_line2": "Unit 130"
            },
            {
              "address_line1": "5th Avenue",
              "address_line2": "East River Tower"
            }
          ]
        },
        "output": {
          "mailing_address": [
            {
              "address_line1": "123 Main Street",
              "address_line2": "Unit 130",
              "city": "New York",
              "zip_code": "10034",
              "state": "NY"
            },
            {
              "address_line1": "5th Avenue",
              "address_line2": "East River Tower",
              "city": "New York",
              "zip_code": "10034",
              "state": "NY"
            }
          ]
        }
      }
    ],
    "identifier": 72872157,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am using the following jq query to extract the AWS ARN and associated protocols. However I only need the ARN to be listed once followed by the ports and protocols",
      "my code is `jq -r '.Listeners[] | (.LoadBalancerArn), (.Protocol)'` and the results are",
      "Desired results\n\n    \"arn:aws:elasticloadbalancing:us-xxxx-1:123456789:loadbalancer/app/msword-123456789/20b73abcde\"\n    \"HTTP\"\n    \"HTTP\"\n    \"HTTPS\"",
      "Group by the common field and iterate over the groups, then output the common field of the first (which is the same for the whole group), and iterate again to output other fields from the same group:\n~~~sh\njq -r '.Listeners | group_by(.LoadBalancerArn)[]\n  | .[0].LoadBalancerArn, .[].Protocol'\n~~~\n~~~json\narn:aws:elasticloadbalancing:us-xxxx-1:123456789:loadbalancer/app/msword-123456789/20b73abcde\nHTTP\nHTTP\nHTTPS\n~~~"
    ],
    "utterance": "Output each unique LoadBalancerArn once, followed by all its associated Protocol values in order.",
    "expressions": [
      ".Listeners | group_by(.LoadBalancerArn)[] | .[0].LoadBalancerArn, .[].Protocol"
    ],
    "data": [
      {
        "input": {
          "Listeners": [
            {
              "LoadBalancerArn": "arn:aws:elasticloadbalancing:us-xxxx-1:123456789:loadbalancer/app/msword-123456789/20b73abcde",
              "Port": 9090,
              "Protocol": "HTTP"
            },
            {
              "LoadBalancerArn": "arn:aws:elasticloadbalancing:us-xxxx-1:123456789:loadbalancer/app/msword-123456789/20b73abcde",
              "Port": 80,
              "Protocol": "HTTP"
            },
            {
              "LoadBalancerArn": "arn:aws:elasticloadbalancing:us-xxxx-1:123456789:loadbalancer/app/msword-123456789/20b73abcde",
              "Port": 443,
              "Protocol": "HTTPS"
            }
          ]
        },
        "output": [
          "arn:aws:elasticloadbalancing:us-xxxx-1:123456789:loadbalancer/app/msword-123456789/20b73abcde",
          "HTTP",
          "HTTP",
          "HTTPS"
        ]
      }
    ],
    "identifier": 72872361,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am trying to get \"d_usage_65\" for input \"D usage 65 | #site\".",
      "You can access keys and values at once using `to_entries`. For instance:",
      ".[] | .alerts | to_entries[] | select(.value.title == \"D Usage 65 | #site\").key",
      "`to_entries` and `select` are the important things here:",
      "$ jq -r --arg t 'D Usage 65 | #site' '[].alerts | to_entries[] | select(.value.title==$t) | .key'"
    ],
    "utterance": "Return the alert key whose title equals 'D Usage 65 | #site'.",
    "expressions": [
      ".[] | .alerts | to_entries[] | select(.value.title == \"D Usage 65 | #site\").key",
      ".[] | .alerts | tostream | select(.[1] == \"D Usage 65 | #site\")[0][0]",
      ".[].alerts | to_entries[] | select(.value.title == $t) | .key"
    ],
    "data": [
      {
        "input": {
          "Server monitoring - Disk, Memory, CPU": {
            "alerts": {
              "d_usage_65": {
                "title": "D Usage 65 | #site"
              },
              "d_usage_75": {
                "title": "D Usage 75 | #site"
              }
            }
          },
          "Categoty 2": {
            "alerts": {
              "thread_count_50": {
                "title": "Thread Count above 50 | #site"
              },
              "upload_manager_thread_count_100": {
                "title": "Thread Count above 100 | #site"
              }
            }
          }
        },
        "output": "d_usage_65"
      }
    ],
    "identifier": 72881941,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "i want to output two values from this below json.",
      "the value `.next` and value of `.digest` when `.name = \"latest\"` and `architecture = \"amd64\"`.",
      "What i want is the two results as one ouput like this:\n`sha256:1e180fcaede581b8cc8e2ce8bc010daff9c64972023a0faaf226d92bd9840113, https://hub.docker.com/v2/repositories/grafana/grafana/tags/?page=2`",
      "(.results[] | select(.name==\"latest\").images[] | select(.architecture==\"amd64\").digest), .next"
    ],
    "utterance": "Output the digest where name equals \"latest\" and architecture equals \"amd64\", together with the next field, as a single output.",
    "expressions": [
      "(.results[] | select(.name==\"latest\").images[] | select(.architecture==\"amd64\").digest), .next"
    ],
    "data": [
      {
        "input": {
          "count": 685,
          "next": "https://hub.docker.com/v2/repositories/grafana/grafana/tags/?page=2&page_size=100",
          "previous": null,
          "results": [
            {
              "creator": 4132603,
              "id": 190096972,
              "images": [
                {
                  "architecture": "amd64",
                  "features": "",
                  "variant": null,
                  "digest": "sha256:1e180fcaede581b8cc8e2ce8bc010daff9c64972023a0faaf226d92bd9840113",
                  "os": "linux",
                  "os_features": "",
                  "os_version": null,
                  "size": 91402044,
                  "status": "active",
                  "last_pulled": "2022-07-06T03:17:56.450051Z",
                  "last_pushed": "2022-06-28T14:15:09.891118Z"
                },
                {
                  "architecture": "arm64",
                  "features": "",
                  "variant": "v8",
                  "digest": "sha256:834e26ec5e5f4e5eb2410147c12b060143906de04f35eb51cae8285e9b646a56",
                  "os": "linux",
                  "os_features": "",
                  "os_version": null,
                  "size": 81476918,
                  "status": "active",
                  "last_pulled": "2022-07-06T09:03:47.33578Z",
                  "last_pushed": "2022-06-28T14:15:10.030013Z"
                },
                {
                  "architecture": "arm",
                  "features": "",
                  "variant": "v7",
                  "digest": "sha256:3bc954cb89f35bdc0705a381ab9b9a49b014074fbc555c61bf7572cb0965f323",
                  "os": "linux",
                  "os_features": "",
                  "os_version": null,
                  "size": 82128685,
                  "status": "active",
                  "last_pulled": "2022-07-06T09:33:22.228128Z",
                  "last_pushed": "2022-06-28T14:15:10.159549Z"
                }
              ],
              "last_updated": "2022-06-28T14:15:10.331474Z",
              "last_updater": 2978857,
              "last_updater_username": "grafanaci",
              "name": "latest",
              "repository": 135617,
              "full_size": 91402044,
              "v2": true,
              "tag_status": "active",
              "tag_last_pulled": "2022-07-06T09:33:22.228128Z",
              "tag_last_pushed": "2022-06-28T14:15:10.331474Z"
            }
          ]
        },
        "output": [
          "sha256:1e180fcaede581b8cc8e2ce8bc010daff9c64972023a0faaf226d92bd9840113",
          "https://hub.docker.com/v2/repositories/grafana/grafana/tags/?page=2&page_size=100"
        ]
      }
    ],
    "identifier": 72883941,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "And as you see, the \"mode\" value doesnt matter, what really matters is that the \"signatureId\" must not be duplicate, so when we remove the whole block, which ever \"mode\" stays, its not a problem.",
      "Use `unique_by` with the field to be checked for duplicates as its argument. It will always take the first of a kind (here, the one with \"mode\": 0)",
      "jq '.signatures |= unique_by(.signatureId)'"
    ],
    "utterance": "Remove duplicate objects from the signatures array so that each signatureId appears only once, regardless of the mode value.",
    "expressions": [
      ".signatures |= unique_by(.signatureId)"
    ],
    "data": [
      {
        "input": {
          "signatures": [
            {
              "signatureId": 50,
              "mode": 0
            },
            {
              "signatureId": 12,
              "mode": 0
            },
            {
              "signatureId": 12,
              "mode": 1
            }
          ]
        },
        "output": {
          "signatures": [
            {
              "signatureId": 50,
              "mode": 0
            },
            {
              "signatureId": 12,
              "mode": 0
            }
          ]
        }
      }
    ],
    "identifier": 72884002,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Need to convert the header array into object:",
      ".header |= (map(split(\": \") | { (.[0]): .[1] }) | add)",
      "Will produce:",
      "{\n  \"header\": {\n    \"key1\": \"value1\",\n    \"key2\": \"value2\",\n    \"key3\": \"value3\"\n  }\n}"
    ],
    "utterance": "Convert an array of strings in the form \"key: value\" under the key \"header\" into an object with those key-value pairs assigned under \"header\".",
    "expressions": [
      ".header |= (map(split(\": \") | { (.[0]): .[1] }) | add)"
    ],
    "data": [
      {
        "input": {
          "header": [
            "key1: value1",
            "key2: value2",
            "key3: value3"
          ]
        },
        "output": {
          "header": {
            "key1": "value1",
            "key2": "value2",
            "key3": "value3"
          }
        }
      }
    ],
    "identifier": 72886481,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am trying to sort following json response based on \"startTime\" and also want to filter based on \"name\" and fetch only \"dataCenter\" of matched record.",
      "How about this:\n\n```bash\njq --arg name JPCSKELT '\nmap(select(.name==$name))|sort_by(.startTime)[].dataCenter\n' data.json \n```",
      "Based on your sample data, this produces:\n\n```\n\"mvsADM\"\n\"mvsADM\"\n\"mvsADM\"\n```"
    ],
    "utterance": "Select records where name equals JPCSKELT, sort them by startTime, and output only the dataCenter field of each matched record.",
    "expressions": [
      "map(select(.name==$name))|sort_by(.startTime)[].dataCenter",
      "map(select(.name==\"JPCSKELT\"))|sort_by(.startTime)[].dataCenter"
    ],
    "data": [
      {
        "input": [
          {
            "name": "JPCSKELT",
            "dataCenter": "mvsADM",
            "orderId": "G9HC8",
            "scheduleTable": "FD33515",
            "nodeGroup": null,
            "controlmApp": "P/C-DEVELOPMENT-LRSP",
            "groupName": "SCMTEST",
            "assignmentGroup": "HOST_CONFIG_MGMT",
            "owner": "PC00000",
            "description": null,
            "startTime": "2021-11-11 17:45:48.0",
            "endTime": "2021-11-11 17:45:51.0",
            "successCount": 1,
            "failureCount": 0,
            "dailyRunCount": 0,
            "scriptName": "JPCSKELT"
          },
          {
            "name": "JPCSKELT",
            "dataCenter": "mvsADM",
            "orderId": "FWX98",
            "scheduleTable": "JPCS1005",
            "nodeGroup": null,
            "controlmApp": "P/C-DEVELOPMENT-LRSP",
            "groupName": "SCMTEST",
            "assignmentGroup": "HOST_CONFIG_MGMT",
            "owner": "PC00000",
            "description": null,
            "startTime": "2021-07-13 10:49:47.0",
            "endTime": "2021-07-13 10:49:49.0",
            "successCount": 1,
            "failureCount": 0,
            "dailyRunCount": 0,
            "scriptName": "JPCSKELT"
          },
          {
            "name": "JPCSKELT",
            "dataCenter": "mvsADM",
            "orderId": "FWX98",
            "scheduleTable": "JPCS1005",
            "nodeGroup": null,
            "controlmApp": "P/C-DEVELOPMENT-LRSP",
            "groupName": "SCMTEST",
            "assignmentGroup": "HOST_CONFIG_MGMT",
            "owner": "PC00000",
            "description": null,
            "startTime": "2021-10-13 10:49:47.0",
            "endTime": "2021-10-13 10:49:49.0",
            "successCount": 1,
            "failureCount": 0,
            "dailyRunCount": 0,
            "scriptName": "JPCSKELT"
          }
        ],
        "output": [
          "mvsADM",
          "mvsADM",
          "mvsADM"
        ]
      }
    ],
    "identifier": 72887277,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "In my specific case, I'm looking to convert input like [\"a\", 1, \"b\", 2, \"c\", 3] into an object like {\"a\": 1, \"b\": 2, \"c\": 3}, but the general technique is processing an array using a sliding window (in this case, of size 2).",
      "You could use `reduce` to go through the array, and take two items at a time:\n~~~sh\njq 'reduce while(. != []; .[2:]) as [$key, $val] ({}; .[$key] = $val)'\n~~~",
      "def sliding($a;$b): [while(. != []; .[$a:])[:$b]];",
      "sliding(2;2) | map({key: (.[0]|tostring), value: .[1]}) | from_entries",
      "reduce _nwise(2) as [$k, $v] ({}; .[$k] = $v)"
    ],
    "utterance": "Convert an array of alternating keys and values, like [\"a\", 1, \"b\", 2, \"c\", 3], into an object where each key points to its corresponding value.",
    "expressions": [
      "reduce while(. != []; .[2:]) as [$key, $val] ({}; .[$key] = $val)",
      "def sliding($a;$b): [while(. != []; .[$a:])[:$b]]; sliding(2;2) | map({key: (.[0]|tostring), value: .[1]}) | from_entries",
      "reduce _nwise(2) as [$k, $v] ({}; .[$k] = $v)"
    ],
    "data": [
      {
        "input": [
          "a",
          1,
          "b",
          2,
          "c",
          3
        ],
        "output": {
          "a": 1,
          "b": 2,
          "c": 3
        }
      }
    ],
    "identifier": 72889257,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "`I would like to have output like this with second IP: intranet.site.com : 59.89.8.8`",
      "`cat ./droplet_list.json | jq '.[] | .name , .networks.v4[].ip_address'`",
      "`I was trying run: cat ./droplet_list.json | jq '.[] | .name , .networks.v4[].ip_address[1]' and got the error`",
      "Just use `+` to concatenate the strings:",
      "`jq -r '.[] | .name + \": \" + .networks.v4[].ip_address'`",
      "`jq -r '.[] | .name + \": \" + .networks.v4[1].ip_address'`",
      "One option would be using **`map`** along with **`+`** operator such as `jq -r 'map(.name + \": \" + .networks.v4[].ip_address)[]' yourfile.json`"
    ],
    "utterance": "Print the droplet name followed by the second IPv4 address in a single line, separated by ': '.",
    "expressions": [
      ".[] | .name + \": \" + .networks.v4[1].ip_address",
      "map(.name + \": \" + .networks.v4[1].ip_address)[]"
    ],
    "data": [
      {
        "input": [
          {
            "id": 106,
            "name": "intranet.site.com",
            "networks": {
              "v4": [
                {
                  "ip_address": "10.136.95.40",
                  "netmask": "255.255.0.0",
                  "gateway": "10.136.0.1"
                },
                {
                  "ip_address": "59.89.8.8",
                  "netmask": "255.255.240.0",
                  "gateway": "159.0.0.1"
                },
                {
                  "ip_address": "57.23.3.1",
                  "netmask": "255.255.252.0",
                  "gateway": "157.0.0.1"
                }
              ]
            }
          }
        ],
        "output": "intranet.site.com: 59.89.8.8"
      }
    ],
    "identifier": 72894856,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to search for \"[\\\"res\\\",\\\"q3\\\"]\" and replace it with a variable var1 value \"[\\\"res\\\"]\"",
      "`jq '.ingress.args.params[] | select(.name==\"srm\").value |= [\"'${var1}'\"]' env.json`",
      "Since you want to update `ingress`, and not return only the result of the loops, use:",
      ".ingress.args.params |= map(select(.name==\"srm\").value |= \"new-value\")"
    ],
    "utterance": "Replace the value of the object with name equal to srm in ingress.args.params with \"[\\\"res\\\"]\".",
    "expressions": [
      ".ingress.args.params |= map(select(.name==\"srm\").value |= \"[\\\"res\\\"]\")"
    ],
    "data": [
      {
        "input": {
          "idsb": "marqd",
          "data": {
            "name": "bcon-dv-alert"
          },
          "ingress": {
            "args": {
              "params": [
                {
                  "name": "spt",
                  "value": "cld"
                },
                {
                  "name": "scv",
                  "value": "sdv"
                },
                {
                  "name": "scr",
                  "value": "ord"
                },
                {
                  "name": "srm",
                  "value": "[\"res\",\"q3\"]"
                },
                {
                  "name": "tgo",
                  "value": "pbc"
                }
              ]
            },
            "wfr": {
              "name": "t-r-e"
            },
            "snm": "as-r"
          }
        },
        "output": {
          "idsb": "marqd",
          "data": {
            "name": "bcon-dv-alert"
          },
          "ingress": {
            "args": {
              "params": [
                {
                  "name": "spt",
                  "value": "cld"
                },
                {
                  "name": "scv",
                  "value": "sdv"
                },
                {
                  "name": "scr",
                  "value": "ord"
                },
                {
                  "name": "srm",
                  "value": "[\"res\"]"
                },
                {
                  "name": "tgo",
                  "value": "pbc"
                }
              ]
            },
            "wfr": {
              "name": "t-r-e"
            },
            "snm": "as-r"
          }
        }
      }
    ],
    "identifier": 72898561,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I need to change it to the following structure wherever in a JSON the above structure is appearing:",
      "The rest of the JSON should keep unmodified. I hope some jq expert can help me out here. :)",
      "The only keys I always know is value and type.",
      "Use `walk` to recursively visit each subitem. With each visit, `select` only those `objects` that have fields called `value` and `type` which are of `type` `\"object\"` and `\"string\"`, respectively. Update (`|=`) these objects by first storing their `.type` field in a variable `$type`. Then, consider the `.value` field, and modify each of its fields by updating (`|=`) them to an object using their original value `.` and the previously stored `$type` variable.",
      "walk(\n  (objects | select([.value, .type | type] == [\"object\", \"string\"])) |=\n  (.type as $type | .value | with_entries(.value |= {value: ., $type}))\n)"
    ],
    "utterance": "Transform all objects containing 'value' (object) and 'type' (string) fields so that each key inside 'value' is elevated to the parent level, with its value replaced by an object containing the original value and the 'type', and preserve all other data unchanged.",
    "expressions": [
      "walk(\n  (objects | select([.value, .type | type] == [\"object\", \"string\"])) |=\n  (.type as $type | .value | with_entries(.value |= {value: ., $type}))\n)"
    ],
    "data": [
      {
        "input": {
          "a": {
            "b": {
              "1": {
                "anyKey1": {
                  "value": {
                    "fontFamily": "value1",
                    "fontWeight": "value2",
                    "lineHeight": "value3",
                    "fontSize": "value4",
                    "letterSpacing": "value5",
                    "textDecoration": "value6",
                    "textCase": "value7"
                  },
                  "type": "typography"
                }
              }
            },
            "anyKey2": {
              "value": {
                "fontFamily": "value1",
                "fontWeight": "value2",
                "lineHeight": "value3",
                "fontSize": "value4",
                "letterSpacing": "value5",
                "textDecoration": "value6",
                "textCase": "value7"
              },
              "type": "typography"
            }
          }
        },
        "output": {
          "a": {
            "b": {
              "1": {
                "anyKey1": {
                  "fontFamily": {
                    "value": "value1",
                    "type": "typography"
                  },
                  "fontWeight": {
                    "value": "value2",
                    "type": "typography"
                  },
                  "lineHeight": {
                    "value": "value3",
                    "type": "typography"
                  },
                  "fontSize": {
                    "value": "value4",
                    "type": "typography"
                  },
                  "letterSpacing": {
                    "value": "value5",
                    "type": "typography"
                  },
                  "textDecoration": {
                    "value": "value6",
                    "type": "typography"
                  },
                  "textCase": {
                    "value": "value7",
                    "type": "typography"
                  }
                }
              }
            },
            "anyKey2": {
              "fontFamily": {
                "value": "value1",
                "type": "typography"
              },
              "fontWeight": {
                "value": "value2",
                "type": "typography"
              },
              "lineHeight": {
                "value": "value3",
                "type": "typography"
              },
              "fontSize": {
                "value": "value4",
                "type": "typography"
              },
              "letterSpacing": {
                "value": "value5",
                "type": "typography"
              },
              "textDecoration": {
                "value": "value6",
                "type": "typography"
              },
              "textCase": {
                "value": "value7",
                "type": "typography"
              }
            }
          }
        }
      }
    ],
    "identifier": 72909369,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "What do now is `jq '.default | keys' Pipfile.lock` which produces:\n\n    [\n        \"value1\",\n        \"value2\",\n    ...\n    ]\n\nI would need the output to:\n\n1. Not produce the first line with `[` and last line with `]`\n2. Have the values printed without quotes",
      "You want the elements of the array, not the array itself.",
      "Use the `-r` option to output raw strings, rather than JSON strings.",
      "jq -r '.default | keys[]' Pipfile.lock"
    ],
    "utterance": "List all keys of the 'default' object, printing each on a new line without brackets or quotes.",
    "expressions": [
      "jq -r '.default | keys[]' Pipfile.lock"
    ],
    "identifier": 72913987,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Is there anyway to inject an input permanently into the json file using the jq command which would be equivalent to sed -i?",
      "jq does not have a command-line option like sed's `-i`.  One way to avoid having to fiddle with a temporary file is to use `sponge` (part of the `moreutils` collection) if it is available in your computing environment:\n\njq .... taskdef.json | sponge taskdef.json"
    ],
    "utterance": "Set the image field to the value of the IMAGE environment variable within containerDefinitions and save the changes directly back to the original file.",
    "expressions": [
      "jq \".containerDefinitions[].image=\\\"$IMAGE\\\"\" taskdef.json | sponge taskdef.json"
    ],
    "data": [
      {
        "input": {
          "containerDefinitions": [
            {
              "name": "container_name",
              "image": "",
              "essential": true,
              "logConfiguration": {
                "logDriver": "awslogs",
                "options": {
                  "awslogs-group": "/ecs/taskdef",
                  "awslogs-region": "us-east-2",
                  "awslogs-stream-prefix": "ecs"
                }
              }
            }
          ]
        },
        "output": {
          "containerDefinitions": [
            {
              "name": "container_name",
              "image": "########.dkr.ecr.us-west-1.amazonaws.com/container-repo:latest",
              "essential": true,
              "logConfiguration": {
                "logDriver": "awslogs",
                "options": {
                  "awslogs-group": "/ecs/taskdef",
                  "awslogs-region": "us-east-2",
                  "awslogs-stream-prefix": "ecs"
                }
              }
            }
          ]
        }
      }
    ],
    "identifier": 72918109,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "query=$(cat $basename/test.json | jq -r '.DesignCode | to_entries[] | \"\\(.key):\\(.value)\"')",
      "query=( $(cat $basename/test.json | jq -r '.DesignCode | to_entries[] | \"\\(.key):\\(.value)\"') )"
    ],
    "utterance": "Output all key-value pairs from the DesignCode object as colon-separated strings in the form key:value, including those with spaces.",
    "expressions": [
      ".DesignCode | to_entries[] | \"\u001b[0;34m\\(.key):\\(.value)\u001b[0m\""
    ],
    "identifier": 72935371,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "what I want is to search for a specific string `SEARCH-TEXT` in the JSON below and then output the parent value in key `name:` when it doesn&#39;t have a value `LATEST`.",
      "Expecting result: `NOT-LATEST`",
      "jq -r '.results[] | select(.name!=\"LATEST\" and .images[].digest==\"SEARCH-TEXT\")| .name'"
    ],
    "utterance": "Find items whose image digest is SEARCH-TEXT and output their name if it is not LATEST.",
    "expressions": [
      ".results[] | select(.name!=\"LATEST\" and .images[].digest==\"SEARCH-TEXT\") | .name"
    ],
    "data": [
      {
        "input": {
          "count": 685,
          "next": "https://hub.docker.com/v2/repositories/grafana/grafana/tags?page=2",
          "previous": null,
          "results": [
            {
              "creator": 4132603,
              "id": 190096972,
              "images": [
                {
                  "architecture": "amd64",
                  "features": "",
                  "variant": null,
                  "digest": "SEARCH-TEXT",
                  "os": "linux",
                  "os_features": "",
                  "os_version": null,
                  "size": 91402044,
                  "status": "active",
                  "last_pulled": "2022-07-11T00:19:44.535475Z",
                  "last_pushed": "2022-06-28T14:15:09.891118Z"
                }
              ],
              "last_updated": "2022-06-28T14:15:10.331474Z",
              "last_updater": 2978857,
              "last_updater_username": "grafanaci",
              "name": "LATEST",
              "repository": 135617,
              "full_size": 91402044,
              "v2": true,
              "tag_status": "active",
              "tag_last_pulled": "2022-07-11T04:34:43.293126Z",
              "tag_last_pushed": "2022-06-28T14:15:10.331474Z"
            },
            {
              "creator": 2978857,
              "id": 148674560,
              "images": [
                {
                  "architecture": "amd64",
                  "features": "",
                  "variant": null,
                  "digest": "SEARCH-TEXT",
                  "os": "linux",
                  "os_features": "",
                  "os_version": null,
                  "size": 116299353,
                  "status": "active",
                  "last_pulled": "2022-07-11T06:32:32.095452Z",
                  "last_pushed": "2022-07-08T23:02:18.104164Z"
                }
              ],
              "last_updated": "2022-07-08T23:02:18.806479Z",
              "last_updater": 2978857,
              "last_updater_username": "grafanaci",
              "name": "NOT-LATEST",
              "repository": 135617,
              "full_size": 116299353,
              "v2": true,
              "tag_status": "active",
              "tag_last_pulled": "2022-07-11T06:32:32.095452Z",
              "tag_last_pushed": "2022-07-08T23:02:18.806479Z"
            }
          ]
        },
        "output": "NOT-LATEST"
      }
    ],
    "identifier": 72935518,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "echo $my_json | jq \".values[].name\"",
      "\"Hans\"\n\"John\"",
      "# What I want\n    [\"Hans\", \"John\"]",
      "You can add *square brackets* (*preferably* along with ***c***(*ompact output*) option in order to return the array on a single line as in the question) such as\n```sh\necho $my_json | jq -c '[.values[].name]'\n```"
    ],
    "utterance": "Extract all the 'name' values from the 'values' array and return them as an array.",
    "expressions": [
      "[.values[].name]",
      "-c '[.values[].name]'"
    ],
    "data": [
      {
        "input": {
          "count": 2,
          "values": [
            {
              "name": "Hans",
              "age": 22
            },
            {
              "name": "John",
              "age": 88
            }
          ]
        },
        "output": [
          "Hans",
          "John"
        ]
      }
    ],
    "identifier": 72936306,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have a JSON body that looks something like this.",
      "I need a command that would put this `JSON` into a file called something like `user.json`.",
      "pre_build:\n  commands:\n    - CREDS=\"$(aws secretsmanager get-secret-value --secret-id arn:aws:secretsmanager:{secret} | jq '.SecretString | fromjson')\"",
      "build:\n  commands:\n    - echo $CREDS (prints the correct credentials in json format)\n    - Need to send $CREDS > user.json or something similar. Is this possible?"
    ],
    "utterance": "Write the value of an environment variable containing credentials in JSON format to a file called user.json during the build phase.",
    "expressions": [
      "echo \"$CREDS\" > user.json"
    ],
    "data": [
      {
        "input": {
          "CREDS": {
            "user1": {
              "username": "user",
              "password": "123"
            },
            "user2": {
              "username": "user2",
              "password": "1234"
            }
          }
        },
        "output": "{\n  \"user1\": {\n    \"username\": \"user\",\n    \"password\": \"123\"\n  },\n  \"user2\": {\n    \"username\": \"user2\",\n    \"password\": \"1234\"\n  }\n}\n"
      }
    ],
    "identifier": 72939501,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have an array of arrays:",
      "I want to add the values where the strings match to produce:",
      "group_by(.[0]) | .[] |= [.[0][0], ([.[][1]] | add)]",
      "Is there a better way?",
      "totals(.[1]) | [unpack]",
      "group_by(.[0]) | map(transpose | .[0] |= .[0] | .[1] |= add)",
      "group(first) | map_values(map(last)|add) | to_entries | map(map(.))"
    ],
    "utterance": "Sum the numbers grouped by the matching string in the first element of each subarray, returning an array of arrays where each contains the string and its sum.",
    "expressions": [
      "group_by(.[0]) | map(transpose | .[0] |= .[0] | .[1] |= add)",
      "group_by(.[0]) | .[] |= [.[0][0], ([.[][1]] | add)]",
      "totals(.[1]) | [unpack]",
      "group(first) | map_values(map(last)|add) | to_entries | map(map(.))",
      "group(first) | map_values(map(last)|add) | to_entries | map(map(.))"
    ],
    "data": [
      {
        "input": [
          [
            "stringA",
            1
          ],
          [
            "stringA",
            2
          ],
          [
            "stringB",
            3
          ],
          [
            "stringB",
            4
          ],
          [
            "stringC",
            5
          ],
          [
            "stringC",
            6
          ]
        ],
        "output": [
          [
            "stringA",
            3
          ],
          [
            "stringB",
            7
          ],
          [
            "stringC",
            11
          ]
        ]
      }
    ],
    "identifier": 72940065,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to append an object after `date`, so the resulting file will be like this.",
      "The idea is to convert the JSON records into k/v pairs, find the position of the key named `\"date\"` and then use slice expressions to append the required object after the position and then append back the rest of the pairs.",
      "to_entries | \n( map(.key == \"date\") | index(true) ) as $pos | \n.[0:$pos+1] + [{\"key\":\"hw\",\"value\":{\"cpu\":\"intel\"}}] + .[$pos+1:] |\nfrom_entries"
    ],
    "utterance": "Insert an object {\"hw\":{\"cpu\":\"intel\"}} immediately after the key \"date\", preserving key order as shown.",
    "expressions": [
      "to_entries | ( map(.key == \"date\") | index(true) ) as $pos | .[0:$pos+1] + [{\"key\":\"hw\",\"value\":{\"cpu\":\"intel\"}}] + .[$pos+1:] | from_entries"
    ],
    "data": [
      {
        "input": {
          "ver": "v2.0",
          "date": "11 Jul 2022 21:28 WIB",
          "disk": {},
          "network": {},
          "bench": {}
        },
        "output": {
          "ver": "v2.0",
          "date": "11 Jul 2022 21:28 WIB",
          "hw": {
            "cpu": "intel"
          },
          "disk": {},
          "network": {},
          "bench": {}
        }
      }
    ],
    "identifier": 72940610,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Expecting a csv with data in selected columns ( even mutilple values in each cell accepted )",
      "jq: error (at <stdin>:1): object ({\"id\":\"MDU6...) is not valid in a csv row",
      "As .labels is an array of objects, you need to process it :",
      "jq -r '[\"number\",\"title\",\"assignees\",\"state\",\"labels\",\"url\"],\n       (.[] | [.number, .title,\n               (.assignees | if length==0 then \"Unassigned\" else map(.login)|join(\",\") end),\n               .state,\n               (.labels | map(.name)|join(\",\")),\n               .url\n              ]\n       ) | @csv' > issues-$(date '+%Y-%m-%d').csv"
    ],
    "utterance": "Export a table where each row contains issue number, title, assignees as a comma-separated string or 'Unassigned', state, labels as a comma-separated string, and URL.",
    "expressions": [
      "[\"number\",\"title\",\"assignees\",\"state\",\"labels\",\"url\"],\n       (.[] | [.number, .title,\n               (.assignees | if length==0 then \"Unassigned\" else map(.login)|join(\",\") end),\n               .state,\n               (.labels | map(.name)|join(\",\")),\n               .url\n              ]\n       ) | @csv"
    ],
    "identifier": 72944307,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "The task does not specify the input JSON structure (e.g., what the issues look like, how assignees and labels are represented), so it's unclear how to map the fields.",
      "other_environment": null
    }
  },
  {
    "context": [
      "I have a legacy cli tool which outputs a structured list with sub-items intended with a `tab` (stackoverflow won't let me put tabs here so I replaced them with 4 spaces in this example).",
      "I try to achieve an JSON output like",
      "{\r\n  \"Heading One\": {\r\n    \"Sub One\": \"Value 1\",\r\n    \"Sub Two\": \"Value 2\"\r\n  },\r\n  \"Heading Two\": {\r\n    \"Sub Three\": \"Value 3\",\r\n    \"Sub Four\": \"Value 4\"\r\n  },\r\n  \"Key One\": \"This key has no heading\"\r\n}",
      "This is an approach for a deeply nested input. It splits on top-level items using a negative look-ahead regex on tabs following newlines, then separates the head and \"unindents\" the rest by removing one tab following a newline, which serves as input for a recursive call.",
      "jq -Rs '\n  def comp:\n    reduce (splits(\"\\n(?!\\\\t)\") | select(length > 0)) as $item ({};\n      ($item | index(\":\")) as $hpos | .[$item[:$hpos]] = (\n        $item[$hpos + 1:] | gsub(\"\\n\\t\"; \"\\n\")\n        | if test(\"\\n\") then comp else .[index(\"'\\''\") + 1: rindex(\"'\\''\")] end\n      )\n    );\n  comp\n'"
    ],
    "utterance": "Convert indented, colon-separated key-value text with headings and sub-items into a nested object where top-level lines become keys and indented sub-lines become values or objects, as shown in the provided example.",
    "expressions": [
      "jq -Rs '\n  def comp:\n    reduce (splits(\"\\n(?!\\\\t)\") | select(length > 0)) as $item ({};\n      ($item | index(\":\")) as $hpos | .[$item[:$hpos]] = (\n        $item[$hpos + 1:] | gsub(\"\\n\\t\"; \"\\n\")\n        | if test(\"\\n\") then comp else .[index(\"'\\''\") + 1: rindex(\"'\\''\")] end\n      )\n    );\n  comp\n'"
    ],
    "data": [
      {
        "input": "Heading One:\n    Sub One: 'Value 1'\n    Sub Two: 'Value 2'\nHeading Two:\n    Sub Three: 'Value 3'\n    Sub Four: 'Value 4'\nKey One: 'This key has no heading'",
        "output": {
          "Heading One": {
            "Sub One": "Value 1",
            "Sub Two": "Value 2"
          },
          "Heading Two": {
            "Sub Three": "Value 3",
            "Sub Four": "Value 4"
          },
          "Key One": "This key has no heading"
        }
      }
    ],
    "identifier": 72954503,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have a requirement where in 2 parameter files needs to be merged to one using Jq",
      "param1.json\n[\n   \"name=xyz\",\n   \"age=40\",\n   \"email=qqqq\"\n] ",
      "param2.json\n[\n  \"name=xyz\",\n  \"age=42\",\n  \"drivingLicense=2761\"\n]",
      "I need a resultant value to be\n[\n  \"name=xyz\",\n  \"age=42\",\n  \"email=qqqq\",\n  \"drivingLicense=2761\"\n]",
      "When I try to use Jq add **jq -s '.[0] + .[1]' param1.json param2.json** the resultant \n[\n  \"name=xyz\",\n  \"age=40\",\n  \"email=qqqq\",\n  \"name=xyz\",\n  \"age=42\",\n  \"drivingLicense=2761\"\n]\n",
      "jq -s '[add | with_entries(.key = (.value | .[:index(\"= \")]))[]]'",
      "jq -s 'map(map(./\"=\"|{(first):.[1:]|join(\"=\")})|add)|add|to_entries|map(join(\"=\"))' param1.json  param2.json",
      "add\n|map(./\"=\"|{(first):.[1:]|join(\"=\")})\n|add\n|to_entries\n|map(join(\"=\"))",
      "jq -s 'add | map(./\"=\") | group_by(first) | map(last|join(\"=\"))' param1.json param2.json"
    ],
    "utterance": "Merge two files containing arrays of key=value strings so that each key appears only once with the value from the second file taking precedence, and the result is a flat array of key=value strings.",
    "expressions": [
      "[add | with_entries(.key = (.value | .[:index(\"=\")]))[]]",
      "map(map(./\"=\"|{(first):.[1:]|join(\"=\")})|add)|add|to_entries|map(join(\"=\"))",
      "add\n|map(./\"=\"|{(first):.[1:]|join(\"=\")})\n|add\n|to_entries\n|map(join(\"=\"))",
      "add | map(./\"=\") | group_by(first) | map(last|join(\"=\"))"
    ],
    "data": [
      {
        "input": [
          [
            "name=xyz",
            "age=40",
            "email=qqqq"
          ],
          [
            "name=xyz",
            "age=42",
            "drivingLicense=2761"
          ]
        ],
        "output": [
          "name=xyz",
          "age=42",
          "email=qqqq",
          "drivingLicense=2761"
        ]
      }
    ],
    "identifier": 72959345,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to select and return the object **ID** which is of **type** *S*, **count** > *20*, and **remainingMoney** for **FOOD** inside **categories** on a **friday** is the *least* among all the other objects.",
      "Use `-s` to turn the stream into an array, then `map(select(\u2026))` to filter by your criteria, find the minimum using `min_by(\u2026)` with your other criteria, and finally output your desired field `.id`.",
      "jq -s '\n  map(select(.type == \"S\" and .count > 20))\n  | min_by(.friday.remainingMoney.categories.FOOD)\n  | .id\n'"
    ],
    "utterance": "Return the id of the object with type equal to S, count greater than 20, and minimal value of friday.remainingMoney.categories.FOOD among all objects.",
    "expressions": [
      "map(select(.type == \"S\" and .count > 20)) | min_by(.friday.remainingMoney.categories.FOOD) | .id"
    ],
    "identifier": 72959798,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "am trying to jq sort in descending order by the highest weight, but having trouble with these decimal points, which the sort seems to ignore?",
      "Ideally I just want the full output the objects but correctly sorted descending by weight, i.e. the full data, just sorted.",
      "Since `weight` values are represented by string values (i.e. **without** taking into account the recommendation), it is necessary to perform the `tonumber` conversion to sort them as numeric values.",
      "jq -r 'sort_by(.weight | tonumber) | reverse' input.json",
      "### Output",
      "[ { \"name\": \"car\", \"amount\": \"27\", \"weight\": \"527561695272.42000000\" }, ... ]"
    ],
    "utterance": "Return all objects sorted descending by the highest value of the weight field, where weight is stored as a stringified decimal number.",
    "expressions": [
      "sort_by(.weight | tonumber) | reverse"
    ],
    "data": [
      {
        "input": [
          {
            "name": "train",
            "amount": "4",
            "weight": "89129.70000000"
          },
          {
            "name": "plane",
            "amount": "200",
            "weight": "819002.68900000"
          },
          {
            "name": "car",
            "amount": "27",
            "weight": "527561695272.42000000"
          },
          {
            "name": "bike",
            "amount": "14",
            "weight": "9914795.00000000"
          },
          {
            "name": "truck",
            "amount": "92",
            "weight": "999147.00000000"
          }
        ],
        "output": [
          {
            "name": "car",
            "amount": "27",
            "weight": "527561695272.42000000"
          },
          {
            "name": "bike",
            "amount": "14",
            "weight": "9914795.00000000"
          },
          {
            "name": "truck",
            "amount": "92",
            "weight": "999147.00000000"
          },
          {
            "name": "plane",
            "amount": "200",
            "weight": "819002.68900000"
          },
          {
            "name": "train",
            "amount": "4",
            "weight": "89129.70000000"
          }
        ]
      }
    ],
    "identifier": 72960023,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "What would a good approach be to parse the parameter array json and generate curl API requests respectful of the parameter limit, with the minimum requests?",
      "You can [chunk the array with undocumented `_nwise` function](https://stackoverflow.com/questions/51412721/how-to-split-an-array-into-chunks-with-jq) and then use that, e.g.:",
      "<<JSON jq -r '_nwise(3) | \"/api/endpoint?parameters=\\(.)\"'",
      "[\"1\",\"2\",\"3\",\"4\",\"5\",\"6\",\"7\",\"8\"]",
      "Output:",
      "/api/endpoint?parameters=[\"1\",\"2\",\"3\"]",
      "/api/endpoint?parameters=[\"4\",\"5\",\"6\"]",
      "/api/endpoint?parameters=[\"7\",\"8\"]",
      "<input.json jq -c '_nwise(3)' | while read -r qs; do curl -s -g -XGET \"/api/endpoint?parameters=$qs\"; done"
    ],
    "utterance": "Split an array of parameter values into chunks of 20 and output each chunk as a string for API requests with at most 20 parameters each.",
    "expressions": [
      "_nwise(20) | \"/api/endpoint?parameters=\\(.)\"",
      "_nwise(20)"
    ],
    "data": [
      {
        "input": [
          "1",
          "2",
          "3",
          "4",
          "5",
          "6",
          "7",
          "8"
        ],
        "output": [
          "/api/endpoint?parameters=[\"1\",\"2\",\"3\"]",
          "/api/endpoint?parameters=[\"4\",\"5\",\"6\"]",
          "/api/endpoint?parameters=[\"7\",\"8\"]"
        ]
      }
    ],
    "identifier": 72963043,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Here I am trying to fetch specific data like `\"path\"` with condition like if its a `\"type\": \"docker-image\"` under `\"dependencies\"` then fetch it.",
      "So how to fetch all the `\"path\"` having `\"type\"` as `\"docker-image\"` under `\"dependencies\"` from the **packages.json** file and keep it in a file.",
      "Use `select` to filter an array:",
      "jq -r '.dependencies[] | select(.type == \"docker-image\").path' packages.json"
    ],
    "utterance": "Fetch all path values from dependencies where type is docker-image.",
    "expressions": [
      ".dependencies[] | select(.type == \"docker-image\").path",
      ".dependencies | map(select(.type == \"docker-image\").path)[]"
    ],
    "data": [
      {
        "input": {
          "dependencies": [
            {
              "name": "powershell-core",
              "type": "zip",
              "path": "Dependencies",
              "filename": "PowerShell-Core-6.2.3-win-x64.msi"
            },
            {
              "name": "redis",
              "type": "docker-image",
              "path": "redis:6.0.10-alpine3.12",
              "filename": ""
            },
            {
              "name": "keycloak",
              "type": "docker-image",
              "path": "jboss/keycloak:12.0.1",
              "filename": ""
            }
          ]
        },
        "output": [
          "redis:6.0.10-alpine3.12",
          "jboss/keycloak:12.0.1"
        ]
      }
    ],
    "identifier": 72964439,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have a large lists of arrays in my json file and am needing to do some sort | uniq -c types of stuff on them. Specifically I have a relatively nasty looking fruit array that needs to break down what is inside.",
      "I can't get the most basic part of counting the unique values per that fruit array, and especially not without breaking the rest of the content",
      "I'd like to turn this;",
      "[{ \"uid\": \"123abc\", ... \"fruit\": [ \"Kiwi\",\"Apple\",\"\",\"\",\"Kiwi\",\"\",\"Kiwi\",\"\",\"\",\"Mango\",\"Kiwi\" ] }, ...]",
      "Into this;",
      "[{ \"uid\": \"123abc\", \"tID\": [\"T19\"], \"metadata\": [ { \"name\": \"fruit\", \"value\": \"Kiwi - 3\" }, { \"name\": \"fruit\", \"value\": \"Mango - 1\" }, { \"name\": \"fruit\", \"value\": \"Apple - 1\" } ] }, ...]",
      "Using `group_by` and `length` would be one way:",
      "jq '\n  map(with_entries(select(.key == \"fruit\") |= (\n    .value |= (group_by(.) | map(\n      {name: \"fruit\", value: \"\\(.[0] | select(. != \"\")) - \\(length)\"}\n    ))\n    | .key = \"metadata\"\n  )))\n'"
    ],
    "utterance": "For each object, count each unique non-empty value in the fruit array and create a metadata array with entries showing the name \"fruit\" and the value as \"<FruitName> - <Count>\".",
    "expressions": [
      "map(with_entries(select(.key == \"fruit\") |= (\n    .value |= (group_by(.) | map(\n      {name: \"fruit\", value: \"\\(.[0] | select(. != \"\")) - \\(length)\"}\n    ))\n    | .key = \"metadata\"\n  )))"
    ],
    "data": [
      {
        "input": [
          {
            "uid": "123abc",
            "tID": [
              "T19"
            ],
            "fruit": [
              "Kiwi",
              "Apple",
              "",
              "",
              "",
              "Kiwi",
              "",
              "Kiwi",
              "",
              "",
              "Mango",
              "Kiwi"
            ]
          },
          {
            "uid": "456xyz",
            "tID": [
              "T15"
            ],
            "fruit": [
              "",
              "Orange"
            ]
          }
        ],
        "output": [
          {
            "uid": "123abc",
            "tID": [
              "T19"
            ],
            "metadata": [
              {
                "name": "fruit",
                "value": "Apple - 1"
              },
              {
                "name": "fruit",
                "value": "Kiwi - 4"
              },
              {
                "name": "fruit",
                "value": "Mango - 1"
              }
            ]
          },
          {
            "uid": "456xyz",
            "tID": [
              "T15"
            ],
            "metadata": [
              {
                "name": "fruit",
                "value": "Orange - 1"
              }
            ]
          }
        ]
      }
    ],
    "identifier": 72971154,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "and I want to get all the paths that contain `\"ID\": \"something\"`:",
      "It should be possible to do with `getpath/1`. Collect the paths matching your condition and extract the value at each path to see if matches your condition",
      "( paths | select(.[-1] == \"ID\")) as $p |   if getpath($p) == \"something\" then ( $p | join(\".\") ) else empty end",
      "Another approach would be using the `--stream` option (or the `tostream` function), and check both criteria at once:",
      "jq -r --stream 'select(.[0][-1] == \"ID\" and .[1] == \"something\")[0] | join(\".\")'"
    ],
    "utterance": "List every path where the last key is ID and its value is something, outputting the dot-separated path.",
    "expressions": [
      "(paths | select(.[-1] == \"ID\")) as $p | if getpath($p) == \"something\" then ($p | join(\".\")) else empty end",
      "--stream 'select(.[0][-1] == \"ID\" and .[1] == \"something\")[0] | join(\".\")'"
    ],
    "data": [
      {
        "input": {
          "A": {
            "ID": "something"
          },
          "B": {
            "ID": "something-else"
          },
          "C": {
            "ID": "something"
          },
          "D": {
            "E": {
              "ID": "something"
            }
          }
        },
        "output": [
          "A.ID",
          "C.ID",
          "D.E.ID"
        ]
      }
    ],
    "identifier": 72977106,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to use the key names I get with that to iterate over the nested info for each one, for example:",
      "how can I do this? I just found questions to iterate but ignoring which key name it has and in this case I need to have that info inside the loop.",
      "you can do the iteration inside `jq` using variables with the `as` notation as in `keys[] as $i | \u2026`. For example:",
      "jq -r '\n  keys[] as $i\n  | \"Instance \\($i): Serial Number is: \\(.[$i].instance_ec2_tags.Serial_number)\"\n'"
    ],
    "utterance": "For each top-level key, display the key name and the Serial_number in its instance_ec2_tags object.",
    "expressions": [
      "keys[] as $i | \"Instance \\($i): Serial Number is: \\(.[$i].instance_ec2_tags.Serial_number)\""
    ],
    "data": [
      {
        "input": {
          "instance-01": {
            "instance_aws_az": "eu-west-1a",
            "instance_aws_dns_name": "<ip>eu-west-1.compute.amazonaws.com",
            "instance_ec2_tags": {
              "Name": "2ed97a79c27be6bca883",
              "Serial_number": "22000528"
            },
            "instance_sg_groups": []
          },
          "instance-29": {
            "instance_aws_az": "eu-west-1a",
            "instance_aws_dns_name": "<ip>eu-west-1.compute.amazonaws.com",
            "instance_ec2_tags": {
              "Name": "2ed97a79c27be6bca773",
              "Serial_number": "22000578"
            },
            "instance_sg_groups": []
          }
        },
        "output": [
          "Instance instance-01: Serial Number is: 22000528",
          "Instance instance-29: Serial Number is: 22000578"
        ]
      }
    ],
    "identifier": 72978043,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Essentially, I want to retrieve the **ID** of an employee above age *25* that is on the *Payments* team that has spent the *most* **years** working at their **primary** office **in-state** and is *open* to remote work.",
      "I'm missing the final command which ties everything together which will basically run the second command for each id returned by the first command, and then filter the output by checking whether that employee is open to work and has the most years spent working at their in-state primary state compared to any other employee in the list, and then return that employee's ID at the end.",
      "You could do all of this in just one jq call. Load one file with `-s` or `--slurp` (as you already do), and the other one with `--slurpfile` to have it accessible as a variable. Then, use `JOIN` to combine by their IDs (one is a number, the other one a string, so use `tostring` or string interpolation `\"\\()\"` to convert). Finally, employ `max_by` to filter by your \"most years\" criteria, and output the ID:",
      "zcat employees.gz | jq --slurpfile a <(zcat addresses.gz) -s '\n  map(select(.age > 25 and .team == \"Payments\"))\n  | JOIN(INDEX($a[]; .id); \"\\(.id)\")\n  | map(.[1] | select(.openToRemoteWork == \"yes\"))\n  | max_by(.workHistoryInState.primaryOffice).id\n'"
    ],
    "utterance": "Find the ID of the employee over age 25 on the Payments team who is open to remote work and has the most years spent at their in-state primary office.",
    "expressions": [
      "map(select(.age > 25 and .team == \"Payments\"))\n| JOIN(INDEX($a[]; .id); \"\\(.id)\")\n| map(.[1] | select(.openToRemoteWork == \"yes\"))\n| max_by(.workHistoryInState.primaryOffice.address.YEARSSPENT).id"
    ],
    "data": [
      {
        "input": {
          "employees": [
            {
              "name": "Billy",
              "age": 43,
              "team": "Payments",
              "id": 12345
            },
            {
              "name": "Matt",
              "age": 29,
              "team": "Payments",
              "id": 54321
            },
            {
              "name": "Jose",
              "age": 23,
              "team": "Data Center",
              "id": 89110
            }
          ],
          "addresses": [
            {
              "id": "12345",
              "name": "Billy",
              "age": 43,
              "workHistoryInState": {
                "primaryOffice": {
                  "address": {
                    "STREET": "1329 Prescott",
                    "STATE": "CA",
                    "ZIP": 97780,
                    "YEARSSPENT": 19
                  }
                }
              },
              "openToRemoteWork": "no"
            },
            {
              "id": "54321",
              "name": "Matt",
              "count": 29,
              "workHistoryInState": {
                "primaryOffice": {
                  "address": {
                    "STREET": "920 Rainier",
                    "STATE": "WA",
                    "ZIP": 98004,
                    "YEARSSPENT": 3
                  }
                }
              },
              "openToRemoteWork": "yes"
            }
          ]
        },
        "output": "54321"
      }
    ],
    "identifier": 72979374,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "for an array of arrays with 2 elements as strings, how would jq multiply element 1 * element 2 in each array? and from that generate a new array...",
      "and for that sum the total across all?",
      ".pricexquant|map(map(tonumber)|.[0] * .[1])|add",
      "yields",
      "379.5797119"
    ],
    "utterance": "Sum all products of the pairs in the 'pricexquant' array, where each pair contains two numeric strings to be multiplied.",
    "expressions": [
      ".pricexquant|map(map(tonumber)|.[0] * .[1])|add"
    ],
    "data": [
      {
        "input": {
          "pricexquant": [
            [
              "20233.54000000",
              "0.00877000"
            ],
            [
              "20233.39000000",
              "0.00999000"
            ]
          ]
        },
        "output": 379.5797119
      }
    ],
    "identifier": 72988688,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "which i need to generate a json list of all the iterated api query results in json format, for which i can run some other queries with jq, like sorting etc.",
      "but this is currently just outputting a bunch of these objects one after the other, without comma separation.",
      "what's a better approach here? each result straight into a json array? post processing after the fact? or?",
      "for p in ${param} ; \n    do\n        curl -s -g GET '/api/query?param='[${p}]\n    done | jq -s ."
    ],
    "utterance": "Combine multiple object results from several API queries into a single array, ensuring the output is a properly formatted list suitable for further processing such as sorting.",
    "expressions": [
      "jq -s ."
    ],
    "identifier": 72989210,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "The task does not specify the input format (e.g., are the objects on separate lines, in separate files, or in a single array?), nor does it provide example input or output, making it unclear how to perform the transformation.",
      "other_environment": null
    }
  },
  {
    "context": [
      "My script processes ~30 lines per second and uses just one CPU core.",
      "I have 16 (or more, if makes sense) cores (vCPUs on GCP) and want to run this process in parallel. I know, hadoop is the way to go. But it's a one-time thing, how do I speed up the process to ~600 lines per second simply?",
      "Lines ordering is not important.",
      "parallel -j16 --spreadstdin '(transformation)' < input.json > output.json",
      "parallel -j0 --spreadstdin '(transformation)' < input.json > output.json"
    ],
    "utterance": "Process each line from a large newline-delimited input, applying a transformation using multiple CPU cores to achieve higher throughput without preserving input order.",
    "expressions": [
      "parallel -j16 --spreadstdin 'jq -c {some-tansfomration-logic}' < input.json > output.json",
      "parallel -j0 --spreadstdin 'jq -c {some-tansfomration-logic}' < input.json > output.json"
    ],
    "identifier": 72990490,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": true,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "From this Json file I am fetching the value with the type as `docker-image` and redirecting the values to a file `output.txt`",
      "jq -r '.dependencies[] | select(.type == \"docker-image\").path' packages.json > output.txt",
      "But I am trying to append this output to a file like: \n\ndocker-virtual.artifactory.com/redis:6.0.10-alpine3.12",
      "How about:\n\n    jq -r '.dependencies[] | select(.type == \"docker-image\").path' packages.json |  sed 's/.*/docker-virtual.artifactory.com\\/&/' > output.txt"
    ],
    "utterance": "Extract all entries with type equal to docker-image and prepend docker-virtual.artifactory.com/ to their path values.",
    "expressions": [
      ".dependencies[] | select(.type == \"docker-image\") | \"docker-virtual.artifactory.com/\" + .path"
    ],
    "data": [
      {
        "input": {
          "dependencies": [
            {
              "name": "powershell-core",
              "type": "zip",
              "path": "Dependencies",
              "filename": "PowerShell-Core-6.2.3-win-x64.msi"
            },
            {
              "name": "redis",
              "type": "docker-image",
              "path": "redis:6.0.10-alpine3.12",
              "filename": ""
            },
            {
              "name": "keycloak",
              "type": "docker-image",
              "path": "jboss/keycloak:12.0.1",
              "filename": ""
            }
          ]
        },
        "output": [
          "docker-virtual.artifactory.com/redis:6.0.10-alpine3.12",
          "docker-virtual.artifactory.com/jboss/keycloak:12.0.1"
        ]
      }
    ],
    "identifier": 72992497,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Goal is a CSV with:",
      "en,2275.28",
      "fa,327.14",
      "...",
      "cat cv-corpus-10.0-2022-07-04.json| jq -r '.locales | keys[] as $k | \"\\($k),\\(.[$k] | .validHrs)\"'",
      "$ jq -r '.locales | to_entries[] | \"\\(.key),\\(.value.validHrs)\"' cv.json"
    ],
    "utterance": "Output a CSV with each language code and its validHrs value from all locales.",
    "expressions": [
      ".locales | keys[] as $k | \"\u0000($k),\u0000(.[$k] | .validHrs)\"",
      ".locales | to_entries[] | \"\u0000(.key),\u0000(.value.validHrs)\""
    ],
    "data": [
      {
        "input": {
          "locales": {
            "en": {
              "validHrs": 2275.28
            },
            "fa": {
              "validHrs": 327.14
            }
          }
        },
        "output": [
          "en,2275.28",
          "fa,327.14"
        ]
      }
    ],
    "identifier": 72994933,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "How to merge two json streams/lists into a single stream of jsons like the following.",
      "file 1:\n```\n{\"key1\": 1}\n{\"key1\": 2}\n{\"key1\": 3}\n\n```",
      "file 2:\n```\n{\"key2\": -1}\n{\"key2\": -2}\n{\"key2\": -3}\n\n```",
      "Expected output:\n\n```\n{\"key1\": 1, \"key2\": -1}\n{\"key1\": 2, \"key2\": -2}\n{\"key1\": 3, \"key2\": -3}\n```",
      "Here is a \"streaming\" solution that is more efficient (both memory-wise and otherwise) than one that requires that both files be read in their entirety before producing any output:",
      "< file1.json jq -nc --slurpfile file2 file2.json '\n  # For each item $s in the stream s,\n  # emit [$s, $t] where $t is the corresponding item in the input array\n  def zip(s):\n    . as $in\n    | foreach s as $s (-1; .+1; [$s, $in[.]]);\n\n  $file2 | zip(inputs) | add\n'",
      "If your jq does not support the `--slurpfile` command-line option, then use `--argfile` instead.",
      "As a one-liner\n\n    foreach inputs as $s (-1; .+1; $s + $file2[.])"
    ],
    "utterance": "Combine objects from two files element-wise so each result merges the nth object from file 1 with the nth object from file 2.",
    "expressions": [
      "$file2 | zip(inputs) | add",
      "foreach inputs as $s (-1; .+1; $s + $file2[.])"
    ],
    "data": [
      {
        "input": [
          "{\"key1\": 1}",
          "{\"key1\": 2}",
          "{\"key1\": 3}"
        ],
        "output": [
          "{\"key1\": 1, \"key2\": -1}",
          "{\"key1\": 2, \"key2\": -2}",
          "{\"key1\": 3, \"key2\": -3}"
        ]
      }
    ],
    "identifier": 73000167,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": true,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am trying to get the env1 by using height and weight as key\nso the output can look like:\n\nenv1:height:150\nenv1:weight:80\nenv2:height:160\nenv2:weight:70\nenv3:height:xxx\n.",
      "My shell command jq .env1.height... id.json tried can only get the output by using env1, env2 as key, but it cannot handle env3, env4.",
      "Using your data as initially presented, the following jq program:\n\n    keys_unsorted[] as $k \n    | select($k|startswith(\"env\"))\n    | .[$k] | to_entries[]\n    | select(.key|IN(\"height\",\"weight\"))\n    | [$k, .key, .value]\n    | join(\":\")\n\nproduces\n\nenv1:height:150\nenv1:weight:80\nenv2:height:160\nenv2:weight:70",
      "According to one interpretation of the supplementary question,\na solution would be:\n```\nkeys_unsorted[] as $k \n| .[$k]\n| objects\n| select(.height and .weight)\n| to_entries[]\n| select(.key|IN(\"height\",\"weight\"))\n| [$k, .key, .value]\n| join(\":\")\n```"
    ],
    "utterance": "List all environment keys whose values have height and weight fields, outputting lines in the format envKey:height:value and envKey:weight:value.",
    "expressions": [
      "keys_unsorted[] as $k | select($k|startswith(\"env\")) | .[$k] | to_entries[] | select(.key|IN(\"height\",\"weight\")) | [$k, .key, .value] | join(\":\")",
      "keys_unsorted[] as $k | .[$k] | objects | select(.height and .weight) | to_entries[] | select(.key|IN(\"height\",\"weight\")) | [$k, .key, .value] | join(\":\")",
      "[\"height\",\"weight\"] as $hw | keys_unsorted[] as $k | .[$k] | objects | . as $object | select(all($hw[]; $object[.])) | $hw[] | [$k, ., $object[.]] | join(\":\")"
    ],
    "data": [
      {
        "input": {
          "name": "peter",
          "path": "desktop/name",
          "description": "male",
          "env1": {
            "school": "AAA",
            "height": "150",
            "weight": "80"
          },
          "env2": {
            "school": "BBB",
            "height": "160",
            "weight": "70"
          }
        },
        "output": [
          "env1:height:150",
          "env1:weight:80",
          "env2:height:160",
          "env2:weight:70"
        ]
      }
    ],
    "identifier": 73013675,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "1. I want to find the `parent1key` and `parent2key` and if found one or both, then replace them with a variable that&#39;s generated during runtime. If none of the key: values are found just ignore it rather than troughing up an error.",
      "`jq --arg parent1keyarg $MyVAL1 --arg parent2keyarg $MyVAL2 ...`",
      "walk(if type==\"object\"\n     then if has(\"parent1key\") then .parent1key = $a\n          else . end \n          | if has(\"parent2key\") then .parent2key = $b\n            else . end\n     else . end)",
      "def replace(k; v):\n   walk(if type == \"object\" and has(k) then .[k] = v else . end);",
      "replace(\"parent1key\"; \"foo\") | replace(\"parent2key\"; \"bar\")"
    ],
    "utterance": "Replace all occurrences of parent1key and parent2key with specified variables if they exist; ignore if not found.",
    "expressions": [
      "walk(if type==\"object\" then if has(\"parent1key\") then .parent1key = $parent1keyarg else . end | if has(\"parent2key\") then .parent2key = $parent2keyarg else . end else . end)",
      "def replace(k; v): walk(if type == \"object\" and has(k) then .[k] = v else . end); replace(\"parent1key\"; $parent1keyarg) | replace(\"parent2key\"; $parent2keyarg)",
      "( .parent1.parent1key | select( . ) ) |= $parent1keyarg | ( .parent2.parent2key | select( . ) ) |= $parent2keyarg"
    ],
    "data": [
      {
        "input": {
          "parent1": {
            "parent1key": "parent1value"
          },
          "parent2": {
            "parent2key": "parent2value"
          }
        }
      }
    ],
    "identifier": 73018197,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am trying to write a script to see if `AGREEMENT_OWNING_ORG` contains `ABC` and `WAREHOUSE` contains `XYZ`",
      "Please, please, don't use `grep` for such tasks.  Rather, consider something along these lines:",
      "< jk_raw jq '.. | objects | select(has(\"java.util.HashMap\")).\"java.util.HashMap\"\n   | (.AGREEMENT_OWNING_ORG | test(\"ABC\")) and (.WAREHOUSE | test(\"XYZ\"))'"
    ],
    "utterance": "Check whether AGREEMENT_OWNING_ORG contains ABC and WAREHOUSE contains XYZ in an object where both fields are under the key java.util.HashMap, which itself is nested under map and com.mytest.map.",
    "expressions": [
      ".. | objects | select(has(\"java.util.HashMap\")).\"java.util.HashMap\" | (.AGREEMENT_OWNING_ORG | test(\"ABC\")) and (.WAREHOUSE | test(\"XYZ\"))"
    ],
    "data": [
      {
        "input": {
          "fundingMoved": {
            "com.mytest.map": {
              "map": {
                "java.util.HashMap": {
                  "AGREEMENT_OWNING_ORG": "ABC 22",
                  "WAREHOUSE": "XYZ 6"
                }
              }
            }
          }
        },
        "output": true
      }
    ],
    "identifier": 73028125,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am trying to get the following results: \r\n\r\n    \"Rolo\"\r\n    \"Ernest\"\r\n    \"Juls\"",
      ".users is an array, so after fixing the obvious errors, you&#39;ll find:\r\n\r\n    jq '.users[].Firstname'\r\n\r\nproduces the results you expect.",
      "This command then works:\r\n```bash\r\ncat test.json | jq '.users[].Firstname'\r\n```"
    ],
    "utterance": "Extract all Firstname values from the users array at the top level.",
    "expressions": [
      ".users[].Firstname"
    ],
    "data": [
      {
        "input": {
          "users": [
            {
              "Firstname": "Rolo",
              "Lastname": "Car",
              "Hobbies": [
                "Metalcore",
                "concerts",
                "beach"
              ],
              "friends": [
                {
                  "Firstname": "Nitsuki",
                  "Lastname": "Saixx"
                },
                {
                  "Firstname": "Confesora",
                  "Lastname": "Kahlan"
                }
              ]
            },
            {
              "Firstname": "Ernest",
              "Lastname": "Vast",
              "Hobbies": [
                "Barista",
                "Strategy games",
                "RPG games"
              ],
              "friends": [
                {
                  "Firstname": "Diana",
                  "Lastname": "Vast"
                },
                {
                  "Firstname": "Pat",
                  "Lastname": "Vanberg"
                }
              ]
            },
            {
              "Firstname": "Juls",
              "Lastname": "Santana",
              "Hobbies": [
                "Sew",
                "Watch Gossip shows",
                "Foodie"
              ],
              "friends": [
                {
                  "Firstname": "Laura",
                  "Lastname": "Reed"
                },
                {
                  "Firstname": "Sandy",
                  "Lastname": "Vast"
                }
              ]
            }
          ]
        },
        "output": [
          "Rolo",
          "Ernest",
          "Juls"
        ]
      }
    ],
    "identifier": 73029136,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "if state is not free then display nodename : {comment = \"Why is the node down\"}",
      "jq '.nodes | map_values(select(.state != \"free\") | {comment})'"
    ],
    "utterance": "For each node whose state is not 'free', display the nodename as the key with a value containing only its comment.",
    "expressions": [
      ".nodes | map_values(select(.state != \"free\") | {comment})"
    ],
    "data": [
      {
        "input": {
          "timestamp": 1658186185,
          "nodes": {
            "x3006c0s13b1n0": {
              "Mom": "x3006c0s13b1n0.hsn.cm",
              "Port": 15002,
              "state": "free",
              "pcpus": 64,
              "resources_available": {
                "arch": "linux",
                "gputype": "A100",
                "host": "x3006c0s13b1n0",
                "mem": "527672488kb",
                "ncpus": 64,
                "ngpus": 4,
                "system": "polaris",
                "tier0": "x3006-g1",
                "tier1": "g1",
                "vnode": "x3006c0s13b1n0"
              },
              "resources_assigned": {},
              "comment": "CHC- Offlined due to node health check failure",
              "resv_enable": "True",
              "sharing": "default_shared",
              "license": "l",
              "last_state_change_time": 1658175652,
              "last_used_time": 1658175652
            },
            "x3006c0s13b1n1": {
              "state": "down",
              "comment": "Maintenance"
            }
          }
        },
        "output": {
          "x3006c0s13b1n1": {
            "comment": "Maintenance"
          }
        }
      }
    ],
    "identifier": 73029872,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  }
]