[
  {
    "context": [
      "I have a json file with a sub-array and I want to get each sub-array with the same id in separate lines.",
      "I try to get:\r\n123456789, yes , null, \"2021-01-16 11:32:25.922\"\r\n123456789, null, null, \"2021-01-16 11:32:38.179\"\r\n987654321, yes, null, \"2021-01-01 08:17:54.055\"\r\n987654321, something, \"meta entry\", \"2021-01-01 08:18:11.028\"\r\n987654321, no, null, \"2021-01-01 08:18:24.057\"",
      "For each element of the `result` array, create an object with just the `id` field, and for each element of the `snippets` sub-array, add it:",
      ".result[] | {id} + (.snippets[] | {transcipt, matched_entry, start})"
    ],
    "utterance": "For each item, output a line with id, transcript, matched_entry, and start from every snippet, grouped by the same id.",
    "expressions": [
      ".result[] | {id} + (.snippets[] | {transcript, matched_entry, start})"
    ],
    "data": [
      {
        "input": {
          "success": true,
          "status": {
            "http": {
              "code": 200,
              "message": "OK"
            }
          },
          "result": [
            {
              "id": "123456789",
              "start_date": "2021-01-01 08:17:39.989",
              "snippets": [
                {
                  "transcript": "yes",
                  "matched_entry": null,
                  "start": "2021-01-16 11:32:25.922"
                },
                {
                  "transcript": null,
                  "matched_entry": null,
                  "start": "2021-01-16 11:32:38.179"
                }
              ]
            },
            {
              "id": "987654321",
              "start_date": "2021-01-01 08:17:39.989",
              "duration_total": 301,
              "snippets": [
                {
                  "transcript": "yes",
                  "matched_entry": null,
                  "start": "2021-01-01 08:17:54.055"
                },
                {
                  "transcript": "something",
                  "matched_entry": " meta entry",
                  "start": "2021-01-01 08:18:11.028"
                },
                {
                  "transcript": "no",
                  "matched_entry": null,
                  "start": "2021-01-01 08:18:24.057"
                }
              ]
            }
          ]
        },
        "output": [
          {
            "id": "123456789",
            "transcript": "yes",
            "matched_entry": null,
            "start": "2021-01-16 11:32:25.922"
          },
          {
            "id": "123456789",
            "transcript": null,
            "matched_entry": null,
            "start": "2021-01-16 11:32:38.179"
          },
          {
            "id": "987654321",
            "transcript": "yes",
            "matched_entry": null,
            "start": "2021-01-01 08:17:54.055"
          },
          {
            "id": "987654321",
            "transcript": "something",
            "matched_entry": " meta entry",
            "start": "2021-01-01 08:18:11.028"
          },
          {
            "id": "987654321",
            "transcript": "no",
            "matched_entry": null,
            "start": "2021-01-01 08:18:24.057"
          }
        ]
      }
    ],
    "identifier": 69372441,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "GET \"/_cat/indices?h=docs.count&format=json\" | jq `.[].\"docs.count\"`",
      "The result I get is like the following:\n\n```\n\"188\"\n\"363\"\n\"346224\"\n```",
      "I would like to summarize these ones into one big number but I only get some kind of fault since they are not numbers.",
      "You can tell `jq` to do the maths for you:\n\n    jq '[ .[].docs.count | tonumber ] | add'"
    ],
    "utterance": "Sum all values of the docs.count field, which are returned as strings, and output the total.",
    "expressions": [
      "[ .[].docs.count | tonumber ] | add"
    ],
    "data": [
      {
        "input": [
          {
            "docs.count": "188"
          },
          {
            "docs.count": "363"
          },
          {
            "docs.count": "346224"
          }
        ],
        "output": 346775
      }
    ],
    "identifier": 69375872,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Specifically: I would like to delete \"type\": \"Link\" and \"linkType\": \"Entry\" everywhere (in all parts of the hierarchy, where ever they appear).",
      "Ideal result:\n\n    {\n      \"entries\": [\n        {\n          \"sys\": {\n            \"id\": \"vcLKKhJ3mZNfGMvVZZi07\",\n            \"contentType\": {\n              \"sys\": {\n                \"id\": \"page\"\n              }\n            }\n          },\n          \"fields\": {\n            \"title\": {\n              \"de-DE\": \"Startseite\",\n              \"en-US\": \"Home\"\n            },\n            \"description\": {\n              \"en-US\": \"foo\"\n            },\n            \"keywords\": {\n              \"en-US\": \"bar\"\n            },\n            \"stageModules\": {\n              \"en-US\": [\n                {\n                  \"sys\": {\n                    \"id\": \"11AfBBuNK8bx3EygAS3WTY\"\n                  }\n                }\n              ]\n            }\n          }\n        }\n      ]\n    }",
      "Use this to recursively go through all items, check your conditions and use `del` if necessary:\n\n    walk(\n      if type == \"object\" and .type == \"Link\" then del(.type) else . end\n    )",
      "Edit: If \"matching is less important\", as you state, then just use `del` on any object:\n\n    walk(if type == \"object\" then del(.type, .linkType) else . end)"
    ],
    "utterance": "Remove all key-value pairs where the key is \"type\" with value \"Link\" and key is \"linkType\" with value \"Entry\" at any depth in the data structure.",
    "expressions": [
      "walk(if type == \"object\" and .type == \"Link\" then del(.type) else . end)",
      "walk(if type == \"object\" then del(.type, .linkType) else . end)"
    ],
    "data": [
      {
        "input": {
          "entries": [
            {
              "sys": {
                "id": "vcLKKhJ3mZNfGMvVZZi07",
                "contentType": {
                  "sys": {
                    "id": "page"
                  }
                }
              },
              "fields": {
                "title": {
                  "de-DE": "Startseite",
                  "en-US": "Home"
                },
                "description": {
                  "en-US": "foo"
                },
                "keywords": {
                  "en-US": "bar"
                },
                "stageModules": {
                  "en-US": [
                    {
                      "sys": {
                        "type": "Link",
                        "linkType": "Entry",
                        "id": "11AfBBuNK8bx3EygAS3WTY"
                      }
                    }
                  ]
                }
              }
            }
          ]
        },
        "output": {
          "entries": [
            {
              "sys": {
                "id": "vcLKKhJ3mZNfGMvVZZi07",
                "contentType": {
                  "sys": {
                    "id": "page"
                  }
                }
              },
              "fields": {
                "title": {
                  "de-DE": "Startseite",
                  "en-US": "Home"
                },
                "description": {
                  "en-US": "foo"
                },
                "keywords": {
                  "en-US": "bar"
                },
                "stageModules": {
                  "en-US": [
                    {
                      "sys": {
                        "id": "11AfBBuNK8bx3EygAS3WTY"
                      }
                    }
                  ]
                }
              }
            }
          ]
        }
      }
    ],
    "identifier": 69378337,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Anyone know how I can also display one of the tags by key name, let's say I also want the Status tag displayed on the same line as the Id and SubnetId.",
      "jq '.Subnets[] | \"\\(.Id) \\(.SubnetId) \\(.Tags | from_entries | .Status)\"' subnets.json"
    ],
    "utterance": "For each subnet in the list, output the Id, SubnetId, and the Value associated with the tag where Key is Status, on the same line.",
    "expressions": [
      ".Subnets[] | \"\\(.Id) \\(.SubnetId) \\(.Tags | from_entries | .Status)\""
    ],
    "data": [
      {
        "input": {
          "Subnets": [
            {
              "SubnetId": "foo1",
              "Id": "bar1",
              "Tags": [
                {
                  "Key": "Name",
                  "Value": "foo"
                },
                {
                  "Key": "Status",
                  "Value": "dev"
                }
              ]
            },
            {
              "SubnetId": "foo2",
              "Id": "bar2",
              "Tags": [
                {
                  "Key": "Name",
                  "Value": "foo"
                },
                {
                  "Key": "Status",
                  "Value": "dev"
                }
              ]
            }
          ]
        },
        "output": [
          "bar1 foo1 dev",
          "bar2 foo2 dev"
        ]
      }
    ],
    "identifier": 69379954,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I'm trying to select comments where the attachments are not empty.",
      "I need the parent created_at for disambiguation.",
      ".comments[] | .created_at as $date | .attachments[] | {id, $date, name: .file_name, url: .content_url, size}"
    ],
    "utterance": "Select each attachment from comments along with its parent comment's created_at, only for comments that have non-empty attachments.",
    "expressions": [
      ".comments[] | .created_at as $date | .attachments[] | {id, date: $date, name: .file_name, url: .content_url, size}"
    ],
    "identifier": 69380073,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "But it fails because of `join(\",\")`. I want to force `jq` to always return members in an array. Is this possible?",
      "if type == \"string\" then [.] else . end | join(\",\")",
      "Which will ensure `members` is always an array whether there is one instance or more than one."
    ],
    "utterance": "Ensure that the 'members' field is always treated as an array, even when there is only a single value.",
    "expressions": [
      "if type == \"string\" then [.] else . end | join(\",\")"
    ],
    "data": [
      {
        "input": {
          "Package": {
            "@xmlns": "http://soap.sforce.com/2006/04/metadata",
            "types": {
              "members": "UserTriggerHandasdler_Test",
              "name": "ApexClass"
            },
            "version": "52.0"
          }
        },
        "output": "UserTriggerHandasdler_Test"
      },
      {
        "input": {
          "Package": {
            "@xmlns": "http://soap.sforce.com/2006/04/metadata",
            "types": {
              "members": [
                "UserTriggerHandler_Test",
                "UserTriggerHandasdler_Test"
              ],
              "name": "ApexClass"
            },
            "version": "52.0"
          }
        },
        "output": "UserTriggerHandler_Test,UserTriggerHandasdler_Test"
      }
    ],
    "identifier": 69380683,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "The task is missing a clear description of the expected output structure: should the output always be an array, or a comma-separated string, or something else? The examples are inconsistent (one is a string, the other is a comma-separated string), so the transformation is underspecified.",
      "other_environment": null
    }
  },
  {
    "context": [
      "I need to extract the .content element along with other things, but only when the fieldTag associated with it is \"v\". Only some records contain a fieldTag \"v\".",
      "When I try to parse using (.varFields[] |select(.fieldTag==\"v\") | \"\\(.content)\") // \"\" it works fine so long as v is present. However, when it is not present, I get jq: error (at <stdin>:353953): Cannot iterate over null (null)",
      "Take a look at the [error suppression operator ?][1] that works a bit like the new ?. nullable chaining operator in Javascript.",
      "The ? operator, used as EXP?, is shorthand for try EXP."
    ],
    "utterance": "Return the content value from an array element where fieldTag is \"v\", and omit or produce null if no such element exists, without causing an error when missing.",
    "expressions": [
      ".varFields?[]? | select(.fieldTag == \"v\") | .content"
    ],
    "data": [
      {
        "input": {
          "id": "1000001",
          "updatedDate": "2018-12-21T01:52:00Z",
          "createdDate": "1993-11-30T02:59:25Z",
          "varFields": [
            {
              "fieldTag": "b",
              "content": "1000679727"
            },
            {
              "fieldTag": "v",
              "content": "v.1"
            }
          ]
        },
        "output": "v.1"
      },
      {
        "input": {
          "id": "1000002",
          "updatedDate": "2018-12-22T01:52:00Z",
          "createdDate": "1994-11-30T02:59:25Z",
          "varFields": [
            {
              "fieldTag": "b",
              "content": "1000679728"
            }
          ]
        },
        "output": null
      }
    ],
    "identifier": 69384375,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "jq \\\n    --arg pet \"$pet\" \\\n    --arg name \"$name\" \\\n    --arg toy1 \"$toy1\" \\\n    --arg toy2 \"$toy2\" \\\n    --arg toy3 \"$toy3\" \\\n    '.pets[$pet] += {\n        PetInfo: {name: $name},\n        toys: [\"\\($toy1)-postfix\", \"\\($toy2)-postfix\", \"\\($toy3)-postfix\"]\n    }' \\\n    myfile.json",
      "jq --arg pet \"${PET}\" \\\n  --arg name \"${NAME}\" \\\n  --arg toy1 \"${toy1}-postfix\" \\\n  --arg toy2 \"${toy2}-postfix\" \\\n  --arg toy3 \"${toy3}-postfix\" \\\n  '\n    \n    .pets[$pet] += {PetInfo: {$name, toys: [$toy1,$toy2,$toy3]}}\n    \n  ' myfile.json"
    ],
    "utterance": "Add or update the pets object by inserting or updating a key using the value of a shell variable, and set PetInfo.name and a toys array derived from additional variables that may include a suffix.",
    "expressions": [
      ".pets[$pet] += {PetInfo: {name: $name}, toys: [\"\u0002($toy1)-postfix\", \"\u0002($toy2)-postfix\", \"\u0002($toy3)-postfix\"]}",
      ".pets[$pet] += {PetInfo: {$name, toys: [$toy1,$toy2,$toy3]}}"
    ],
    "data": [
      {
        "input": {
          "pets": {
            "some-new-pet": {
              "PetInfo": {
                "name": "my-brand-new-pet",
                "toys": [
                  "toy1-postfix",
                  "toy2-postfix",
                  "toy3-postfix"
                ]
              }
            }
          }
        },
        "output": {
          "pets": {
            "some-new-pet": {
              "PetInfo": {
                "name": "my-brand-new-pet",
                "toys": [
                  "toy1-postfix",
                  "toy2-postfix",
                  "toy3-postfix"
                ]
              }
            },
            "some-second-pet": {
              "PetInfo": {
                "name": "my-even-newer-pet"
              },
              "toys": [
                "toy1-postfix",
                "toy2-postfix",
                "toy3-postfix"
              ]
            }
          }
        }
      }
    ],
    "identifier": 69407077,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "So the problem is that `jq` formats its output on multiple lines by default. A simple fix would be to pass `--compact-output / -c` to `jq` so that your JSON string fits on a single line.",
      "I get an error on the following step,",
      "I am trying to create a JSON string that I can use in a later step."
    ],
    "utterance": "Generate a one-line JSON string with keys image=\"foo\", region=\"us-west-2\", secret=\"mysecret\", env=\"prod\" using object structure with flask, for use in an environment variable.",
    "expressions": [
      "jq -n --compact-output --arg image \"foo\" --arg region \"us-west-2\" --arg secret \"mysecret\" --arg env \"prod\" '{\"flask\": {\"image\": $image, \"ports\": {\"5000\":\"HTTP\"}, \"environment\": {\"FLASK_ENV\": $env, \"AWS_SECRET_NAME\": $secret, \"AWS_REGION_NAME\": $region}}}'"
    ],
    "data": [
      {
        "input": {},
        "output": "{\"flask\":{\"image\":\"foo\",\"ports\":{\"5000\":\"HTTP\"},\"environment\":{\"FLASK_ENV\":\"prod\",\"AWS_SECRET_NAME\":\"mysecret\",\"AWS_REGION_NAME\":\"us-west-2\"}}}"
      }
    ],
    "identifier": 69410346,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Will generate two separate objects (which can be processed with tools that support various more or less informal \"JSON streaming\" formats, including jq) whereas I'd want a single object, something like\n\n{ \"foo\": SOMETHING, \"bar\": SOMETHING_ELSE }\n\nbut I can't do that with multiple jq calls as jq will complain that the incomplete JSON is malformed.",
      "You can save and process intermediary JSON for the next jq command:\n\n#!/usr/bin/env bash\n\nread -r foo <a.txt\n\njson=\"$(jq -n --arg f \"$foo\" '{foo: $f}')\"\n\n\nbar=\"$(pwd)\"\njq -n --arg b \"$bar\" \"$json'+{bar: $b}'\n\n# or alternatively\njq --arg b \"$bar\" '.bar=$b' <<<$json",
      "jq -n --arg f \"$foo\" --arg b \"$bar\" '.foo = $f | .bar = $b' "
    ],
    "utterance": "Combine multiple key-value pairs collected at different stages into a single object with keys 'foo' and 'bar', assigning the values of shell variables.",
    "expressions": [
      "jq -n --arg f \"$foo\" --arg b \"$bar\" '.foo = $f | .bar = $b'",
      "jq -n --arg b \"$bar\" \"$json'+{bar: $b}'",
      "jq --arg b \"$bar\" '.bar=$b' <<<$json"
    ],
    "identifier": 69410606,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "echo '[{\"name\": \"bhanu\",\"dept\":\"test\"}]' | jq  -r '.[0].name'",
      "echo '[{name: bhanu,dept:test}]' | sed  's/[a-zA-Z]\\+/\"&\"/g' | jq -r '.[0].name'",
      "testValue='[{name: \"bhanu\", dept: \"test\"}]'",
      "jq -rn \"$testValue | .[0].name\""
    ],
    "utterance": "Extract the value of the 'name' property from the first object in an array where keys and string values are unquoted.",
    "expressions": [
      "echo '[{name: bhanu,dept:test}]' | sed  's/[a-zA-Z]\\+/\"&\"/g' | jq -r '.[0].name'",
      "jq -rn '[{name: \"bhanu\", dept: \"test\"}] | .[0].name'"
    ],
    "data": [
      {
        "input": "[{name: bhanu,dept:test}]",
        "output": "bhanu"
      },
      {
        "input": "[{name: \"bhanu\", dept: \"test\"}]",
        "output": "bhanu"
      }
    ],
    "identifier": 69414451,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": "The task requires handling non-standard JSON (unquoted keys and string values), which jq does not natively support. This is a limitation of the jq environment, not the task description itself.",
      "other_environment": "jq expects valid JSON input; unquoted keys/values are not valid JSON and will cause jq to fail to parse the input."
    }
  },
  {
    "context": [
      "I would like to split the string/stream/input by new lines and find (using the regexp `path\\+file\\:(.*)\\)`) the file path on each line.",
      "You can extract the paths using just `jq`, with its regular expression based `match` filter:",
      "$ cargo metadata | jq -r '.workspace_members[] | match(\"(?<=path\\\\+file:)[^)]+\") | .string'"
    ],
    "utterance": "Extract the file path from each workspace_members string where the path appears after 'path+file:' and before the closing parenthesis.",
    "expressions": [
      ".workspace_members[] | match(\"(?<=path\\+file:)[^)]+\") | .string"
    ],
    "data": [
      {
        "input": [
          "module1 0.1.0 (path+file:///workspace/module1)",
          "module2 0.1.0 (path+file:///workspace/nested/module2)",
          "module3 0.1.0 (path+file:///workspace/nested/module3)"
        ],
        "output": [
          "///workspace/module1",
          "///workspace/nested/module2",
          "///workspace/nested/module3"
        ]
      }
    ],
    "identifier": 69416175,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Currently, what I'm doing to get the data I need is something like this:",
      ".data[] | select(.type == \"poc_receipts_v1\") | select(.challengee != null) | select(.challengee == $a)",
      ".data[] | select(.type == \"poc_receipts_v1\") | select(.path[].challengee != null) | select(.path[].challengee == $a)",
      "It would be nice to do this in a single JQ query so I dont need to re-read the file on the second query.",
      "you could factor them out, simplifying it to",
      ".data[]\n| select(.type == \"poc_receipts_v1\")\n| select((.,.path[]).challengee != null)\n| select((.,.path[]).challengee == $a)"
    ],
    "utterance": "Extract all entries where type equals poc_receipts_v1 and challengee equals a specific value, whether challengee is at the top level or inside any path[].",
    "expressions": [
      ".data[] | select(.type == \"poc_receipts_v1\") | select((., .path[]).challengee == $a)"
    ],
    "identifier": 69418875,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "But I could not find how to get count by user (*group by*) for a title, to get a result like:\r\n\r\n    {\r\n      \"id\": 123,\r\n      \"title\": \"foo\",\r\n      \"count\": 2\r\n    }\r\n    {\r\n      \"id\": 456,\r\n      \"title\": \"foo\",\r\n      \"count\": 1\r\n    }",
      "jq  'map(\r\n         (.member_id) as $m\r\n        | .loans[].media[]\r\n        | select(.title==\"foo\")\r\n        | {id: $m, title: .title}\r\n        )\r\n    |group_by(.id)[]\r\n    |.[0] + { count: length }\r\n' input-file",
      "def count(s): reduce s as $x (null; .+1);\r\n\r\n\"foo\" as $title | .[] | {\r\n  id: .member_id,\r\n  $title,\r\n  count: count(.loans[].media[] | select(.title == $title))\r\n}"
    ],
    "utterance": "Count how many times the title \"foo\" appears for each member_id, outputting id, title, and count per member.",
    "expressions": [
      "map((.member_id) as $m | .loans[].media[] | select(.title==\"foo\") | {id: $m, title: .title}) | group_by(.id)[] | .[0] + { count: length }",
      "\"foo\" as $title | .[] | { id: .member_id, $title, count: (reduce .loans[].media[] as $m (0; . + (if $m.title == $title then 1 else 0 end))) }",
      "def count(s): reduce s as $x (null; .+1); \"foo\" as $title | .[] | { id: .member_id, $title, count: count(.loans[].media[] | select(.title == $title)) }"
    ],
    "data": [
      {
        "input": [
          {
            "member_id": 123,
            "loans": [
              {
                "date": "123",
                "media": [
                  {
                    "title": "foo"
                  },
                  {
                    "title": "bar"
                  }
                ]
              },
              {
                "date": "456",
                "media": [
                  {
                    "title": "foo"
                  }
                ]
              }
            ]
          },
          {
            "member_id": 456,
            "loans": [
              {
                "date": "789",
                "media": [
                  {
                    "title": "foo"
                  }
                ]
              }
            ]
          }
        ],
        "output": [
          {
            "id": 123,
            "title": "foo",
            "count": 2
          },
          {
            "id": 456,
            "title": "foo",
            "count": 1
          }
        ]
      }
    ],
    "identifier": 69423726,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am having trouble deleting all empty arrays from a JSON file with `jq`.",
      "I've tried:\r\n\r\n    walk(if type == \"array\" then map(select(length > 0)) else . end)\r\n\r\nwhich removes empty strings, but the arrays stay in the document. Is there a way to remove the arrays completely?",
      "You could walk through using `..`, select everything that is an empty array `[]` and set it to `empty`",
      "(.. | select(. == [])) |= empty",
      "oguz ismail uses `del()` instead of `|= empty` which is the preferred approach if the empty arrays to be deleted may be located not only in an object `{\"a\":[]}` but also in a superordinate array (`[[]]`).",
      "del(.. | select(. == []))"
    ],
    "utterance": "Remove all empty arrays from the data, regardless of where they are nested.",
    "expressions": [
      "(.. | select(. == [])) |= empty",
      "del(.. | select(. == []))"
    ],
    "identifier": 69434407,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "while read -r idx url; do",
      "done < <( jq -j '.image, \" \", .url, \"\\n\"' \"${1:-images.json}\" )",
      "jq -j '.image, \" \", .url, \"\\n\"' \"${1:-images.json}\""
    ],
    "utterance": "Extract both the 'image' field and 'url' field from each line, separated by a space, for use in a download script.",
    "expressions": [
      ".image, \" \", .url, \"\\n\"",
      "-j '.image, \" \", .url, \"\\n\"'"
    ],
    "data": [
      {
        "input": {
          "image": 136,
          "url": "https://enkistroy.ru/800/600/https/avatars.mds.yandex.net/get-zen_doc/1852570/pub_5dbb15dfba281e00b14b4174_5dbb2300e3062c00b072ecce/scale_1200"
        },
        "output": "136 https://enkistroy.ru/800/600/https/avatars.mds.yandex.net/get-zen_doc/1852570/pub_5dbb15dfba281e00b14b4174_5dbb2300e3062c00b072ecce/scale_1200\n"
      }
    ],
    "identifier": 69439635,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "On the command line for one run of the script a command like this one works:",
      "`./script_that_produces_json --intput=value | jq '.[] | select(.name==\"objecName\") | .fieldIWant'`",
      "output=$(./script_that_produces_json --input=$entry | jq '.[] | select(.name==\"$entry\") | .fieldIWant)'",
      "This yields the variable substituted, but there are no quotes around it, which fails when passed to jq.",
      "How do I quote the variable/string correctly such that both the variable is replaced and the quotes are passed?"
    ],
    "utterance": "Select objects where the name field matches a variable's value and extract a specific field.",
    "expressions": [
      ".[] | select(.name==\"$entry\") | .fieldIWant",
      ".[] | select(.name==$v) | .fieldIWant"
    ],
    "identifier": 69442309,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to select only the object ID if the following conditions matches:\r\n- status == success\r\n- name == build_job\r\n- artifacts.size > 0 where file_type == archive",
      "For the last condition, take the array `.artifacts`, reduce it to those elements matching your criteria `map(select(.file_type == \"archive\"))` and test the resulting array's length `length > 0`.",
      "all(.artifacts[] | select(.file_type == \"archive\");\r\n        .size > 0)",
      ".[0] | select(\r\n  .name == \"build_job\" and\r\n  .status == \"success\" and (\r\n    (.artifacts | map(select(.file_type == \"archive\"))) | length > 0\r\n  )\r\n)\r\n| .id"
    ],
    "utterance": "Select the id of the first object in the array where status is success, name is build_job, and there is at least one artifact with file_type equal to archive and size greater than 0.",
    "expressions": [
      ".[0] | select(.name == \"build_job\" and .status == \"success\" and (.artifacts | map(select(.file_type == \"archive\" and .size > 0)) | length > 0)) | .id",
      ".[0] | select(.name == \"build_job\" and .status == \"success\" and (all(.artifacts[] | select(.file_type == \"archive\"); .size > 0))) | .id",
      ".[0] | select(.name == \"build_job\" and .status == \"success\" and any(.artifacts[]; .file_type == \"archive\" and .size > 0)) | .id"
    ],
    "data": [
      {
        "input": [
          {
            "id": 125650,
            "status": "success",
            "name": "build_job",
            "artifacts": [
              {
                "file_type": "archive",
                "size": 72720116,
                "filename": "artifacts.zip",
                "file_format": "zip"
              },
              {
                "file_type": "metadata",
                "size": 1406,
                "filename": "metadata.gz",
                "file_format": "gzip"
              }
            ]
          }
        ],
        "output": 125650
      }
    ],
    "identifier": 69448191,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Can jq return the same json but with fields that I choose on multiple levels?",
      "Here, I have selected the `string` field at the top level, and the `a` field within the `object` field.",
      "Yes, it is possible. I wrote a function for that once and posted [here](/a/61655940/10248678). You can use it like so:",
      "def pick(paths):\n  . as $in\n  | reduce path(paths) as $path (null;\n    setpath($path; $in | getpath($path))\n  );\n\npick(.object.a, .string)"
    ],
    "utterance": "Select specific fields, such as 'string' at the top level and 'a' within 'object', and return an object with only those fields preserved at their respective nesting levels.",
    "expressions": [
      "def pick(paths):\n  . as $in\n  | reduce path(paths) as $path (null;\n    setpath($path; $in | getpath($path))\n  );\n\npick(.object.a, .string)"
    ],
    "data": [
      {
        "input": {
          "array": [
            1,
            2,
            3
          ],
          "boolean": true,
          "color": "gold",
          "null": null,
          "number": 123,
          "object": {
            "a": "b",
            "c": "d"
          },
          "string": "Hello World"
        },
        "output": {
          "object": {
            "a": "b"
          },
          "string": "Hello World"
        }
      }
    ],
    "identifier": 69453999,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "From a composer.lock I am creating json output of some fields, like this:\n\n```js\n <composer.lock jq  '.packages[] | {site: \"foo\", name: .name, version: .version, type: .type}' | jq -s\n```\nIt works fine:",
      "but now I want to add a generated value, using uuidgen, but I can't figure out how to do that.",
      "End result would be something like:\n\n```json\n  {\n    \"site\": \"foo\",\n    \"name\": \"webmozart/path-util\",\n    \"version\": \"2.3.0\",\n    \"type\": \"library\",\n    \"uuid\": \"c4e97c3c-147d-4360-a601-6b4f6f5e71bb\"\n  },\n  {\n    \"site\": \"foo\",\n    \"name\": \"webonyx/graphql-php\",\n    \"version\": \"v14.9.0\",\n    \"type\": \"library\",\n    \"uuid\": \"6fbe472b-49fe-4064-93f0-09a18a7e1c24\"\n  }\n```",
      "Here's one possibility that does not require your shell to support \"process substitution\" but which entails two passes, the first for determining the number of UUIDs that are required:",
      "generate $( <composer.lock jq '.packages|length') | \n  jq -nR --argfile input composer.lock  '\n    ($input \n    | .packages[]\n    | {site: \"foo\", name: .name, version: .version, type: .type,\n       id: input})'",
      "If your shell supports process substitution, you can avoid a two-pass solution as follows:",
      "jq -nR  --argfile input composer.lock '\n  $input \n  | .packages[]  \n  | {site: \"foo\", name: .name, version: .version, type: .type, id: input}\n ' < <(while true; do uuidgen; done)"
    ],
    "utterance": "Output each package's site, name, version, type, and a unique UUID generated at runtime.",
    "expressions": [
      "jq -nR --argfile input composer.lock '($input | .packages[] | {site: \"foo\", name: .name, version: .version, type: .type, uuid: input})'",
      "jq -nR --argfile input composer.lock '$input | .packages[] | {site: \"foo\", name: .name, version: .version, type: .type, uuid: input}' < <(while true; do uuidgen; done)"
    ],
    "data": [
      {
        "input": {
          "packages": [
            {
              "name": "webmozart/path-util",
              "version": "2.3.0",
              "type": "library"
            },
            {
              "name": "webonyx/graphql-php",
              "version": "v14.9.0",
              "type": "library"
            }
          ]
        },
        "output": [
          {
            "site": "foo",
            "name": "webmozart/path-util",
            "version": "2.3.0",
            "type": "library",
            "uuid": "c4e97c3c-147d-4360-a601-6b4f6f5e71bb"
          },
          {
            "site": "foo",
            "name": "webonyx/graphql-php",
            "version": "v14.9.0",
            "type": "library",
            "uuid": "6fbe472b-49fe-4064-93f0-09a18a7e1c24"
          }
        ]
      }
    ],
    "identifier": 69457618,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "$ jq -r '.records[] | \"id: \\(.id) name: \\(.name)\"' afile.json",
      "How can I insert new lines so that I could get this:\n```\nid: 1\nname: hello\n\nid: 2\nname: world\n```",
      "One of many ways would be simply to add `\\n` to the end of the string specification.",
      "Another option is to use [`map()`][1] combined with [`join(\"\\n\\n\")`][2] to insert an `\\n` between each iteration. This way there won't be a trailing newline:\n\n```\njq -rj '.records | map(\"id: \\(.id)\\nname: \\(.name)\") | join(\"\\n\\n\")' input.json\n```",
      "I recommend this: I just splitted the output string in parts and added an extra  empty filter \"\" to get an empty line.\n\n```\njq -r '.records[] |  \"id: \\(.id)\", \"name: \\(.name)\", \"\"' afile.json\n```"
    ],
    "utterance": "For each object in the records array, print its id and name on separate lines, with a blank line between records.",
    "expressions": [
      ".records[] | \"id: \\(.id)\\nname: \\(.name)\\n\"",
      ".records | map(\"id: \\(.id)\\nname: \\(.name)\") | join(\"\\n\\n\")",
      ".records[] | \"id: \\(.id)\", \"name: \\(.name)\", \"\""
    ],
    "data": [
      {
        "input": {
          "records": [
            {
              "id": 1,
              "name": "hello"
            },
            {
              "id": 2,
              "name": "world"
            }
          ]
        },
        "output": "id: 1\nname: hello\n\nid: 2\nname: world\n"
      }
    ],
    "identifier": 69468778,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "The desired output is the name of the object with the highest build number in the rhel folder:",
      "$ jq -r 'some_program' file.json \nrhel/app-3.11.4.629600.txt",
      "[.objects[].name|select(test(\"rhel/.\"))]\n| max_by(split(\".\")[-2]|tonumber)"
    ],
    "utterance": "Find the name of the file in the 'rhel' folder with the highest build number (the last numeric segment before '.txt').",
    "expressions": [
      "[.objects[].name | select(test(\"rhel/.\"))] | max_by(split(\".\")[-2] | tonumber)"
    ],
    "data": [
      {
        "input": {
          "objects": [
            {
              "name": "rhel/"
            },
            {
              "name": "rhel/app-3.9.6.629089.txt"
            },
            {
              "name": "rhel/app-3.11.4.629600.txt"
            }
          ]
        },
        "output": "rhel/app-3.11.4.629600.txt"
      }
    ],
    "identifier": 69472830,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I would like `jq` to produce the following output:\r\ng1\r\nv11 v12\r\nv21 v22\r\n\r\ng2\r\nv31 v32",
      "jq --raw-output 'group_by(.g) | .[] | {g: .[0].g, v: .} | \"\\(.g)\\n\\(.v[].k1) \\(.v[].k2)\\n\"'",
      "group_by(.g)[] | .[0].g, (.[] | \"\\(.k1) \\(.k2)\"), \"\"",
      "jq --raw-output 'group_by(.g)|\n        map(.[0].g as $g | del(.[].g) |\n             [$g] + map(\n                 [.[]] | join(\" \")\n             ) | join(\"\\n\")\n        ) |\n        join(\"\\n\\n\")'"
    ],
    "utterance": "Group the array by the value of key g and output, for each group, g followed by each object's k1 and k2 values on one line, separated by a space, with groups separated by blank lines.",
    "expressions": [
      "group_by(.g)[] | .[0].g, (.[] | \"\\(.k1) \\(.k2)\"), \"\"",
      "group_by(.g)| map(.[0].g as $g | del(.[].g) | [$g] + map([.[]] | join(\" \")) | join(\"\\n\")) | join(\"\\n\\n\")"
    ],
    "data": [
      {
        "input": [
          {
            "g": "g1",
            "k1": "v11",
            "k2": "v12"
          },
          {
            "g": "g1",
            "k1": "v21",
            "k2": "v22"
          },
          {
            "g": "g2",
            "k1": "v31",
            "k2": "v32"
          }
        ],
        "output": "g1\nv11 v12\nv21 v22\n\ng2\nv31 v32\n"
      }
    ],
    "identifier": 69475661,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to monitor highest CpuUsage from this result.",
      "My desired result is a collection of CpuUsage values like \r\n`0.45 0.73 0.6...`",
      "You\u2019ll need to first parse the `MonitorHistory` json string to get it into a workable form.",
      "Just pull the values from each of the entries.",
      "just spit out the values.",
      "```\r\n.MonitorHistory | fromjson[][] | tonumber\r\n```"
    ],
    "utterance": "Extract all CpuUsage values as numbers from within MonitorHistory's time-based entries.",
    "expressions": [
      ".MonitorHistory | fromjson[][] | tonumber"
    ],
    "data": [
      {
        "input": {
          "MonitorHistory": "{\"2021-10-06T00:00:00Z\":{\"CpuUsage\":\"0.45\"},\"2021-10-06T00:01:00Z\":{\"CpuUsage\":\"0.57\"},\"2021-10-06T00:02:00Z\":{\"CpuUsage\":\"0.73\"},\"2021-10-06T00:03:00Z\":{\"CpuUsage\":\"0.6\"}}"
        },
        "output": [
          0.45,
          0.57,
          0.73,
          0.6
        ]
      }
    ],
    "identifier": 69477790,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have the following kind of array:\n{\n  \"name\": [\n    \"Paul\",\n    \"Alex\",\n    \"John\"\n  ],\n  \"age\": [\n    \"22\",\n    \"31\",\n    \"56\"\n  ],\n  \"town\": [\n    \"New York\",\n    \"Los Angeles\",\n    \"Detroit\"\n  ]\n}",
      "From this I want to create new arrays with the relevant information combined into one large array called \"items\".",
      "{ \"items\": [\n    {\n        \"name\": \"Paul\",\n        \"age\": \"22\",\n        \"town\": \"New York\"\n    },\n    {\n        \"name\": \"Alex\",\n        \"age\": \"31\",\n        \"town\": \"Los Angeles\"\n    },\n    {\n        \"name\": \"John\",\n        \"age\": \"56\",\n        \"town\": \"Detroit\"\n    }\n]}",
      "Here's a straightforward solution, which would be easy to adapt to handle the renaming of keys:\n[.[]]\n| transpose\n| {items: map( {name: .[0], age: .[1], town: .[2] } )}",
      "Here's a solution that makes it easy to handle the key names without explicitly mentioning any of them:\n\ndef objectify($keys):\n  with_entries( .key |= $keys[.] );\n\nkeys_unsorted as $keys\n| [.[]]\n| transpose\n| {items: map( objectify($keys) )}"
    ],
    "utterance": "Combine corresponding elements from the 'name', 'age', and 'town' arrays into a single array of objects under 'items', so each object has its matching name, age, and town.",
    "expressions": [
      "[.[]] | transpose | {items: map( {name: .[0], age: .[1], town: .[2] } )}",
      "def objectify($keys): with_entries( .key |= $keys[.] ); keys_unsorted as $keys | [.[]] | transpose | {items: map( objectify($keys) )}"
    ],
    "data": [
      {
        "input": {
          "name": [
            "Paul",
            "Alex",
            "John"
          ],
          "age": [
            "22",
            "31",
            "56"
          ],
          "town": [
            "New York",
            "Los Angeles",
            "Detroit"
          ]
        },
        "output": {
          "items": [
            {
              "name": "Paul",
              "age": "22",
              "town": "New York"
            },
            {
              "name": "Alex",
              "age": "31",
              "town": "Los Angeles"
            },
            {
              "name": "John",
              "age": "56",
              "town": "Detroit"
            }
          ]
        }
      }
    ],
    "identifier": 69479140,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have to get value from the last array in ```values```.",
      "How can I get value from the last array in ```values```?",
      "You can use a negative index to enumerate from the back of an array: to get the last element of the last array in the `values` of the last element of `result`:",
      "```\n.data.result[-1].values[-1][-1]\n```",
      "filter: \n.data.result[].values[1][1]?\n",
      "output:\n\"65.51666666666667\"\n"
    ],
    "utterance": "Get the second value from the last array in the values property of each result.",
    "expressions": [
      ".data.result[].values[-1][1]",
      ".data.result[-1].values[-1][1]",
      ".data.result[-1].values[-1][-1]"
    ],
    "data": [
      {
        "input": {
          "status": "success",
          "data": {
            "resultType": "matrix",
            "result": [
              {
                "metric": {
                  "myMetric": "ABC"
                },
                "values": [
                  [
                    1633597734,
                    "64.54166666666667"
                  ],
                  [
                    1633598034,
                    "65.51666666666667"
                  ]
                ]
              }
            ]
          }
        },
        "output": "65.51666666666667"
      }
    ],
    "identifier": 69480708,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I'm currently selecting the `_name` when `sppci_device_type` is equal to `sppci_nvme`:",
      "$ system_profiler -json SPPCIDataType | jq -r '.SPPCIDataType[] | select(.sppci_device_type == \"sppci_nvme\")._name'",
      "What is the correct `select` statement for a key matching any one of multiple values? Something like:",
      "select(.sppci_device_type EQUAL TO (\"sppci_audiodevice\" OR \"sppci_nvme\"))._name",
      "select(.sppci_device_type | . == \"sppci_audiodevice\" or . == \"sppci_nvme\") ._name",
      "select(.sppci_device_type | IN(\"sppci_audiodevice\", \"sppci_nvme\")) ._name"
    ],
    "utterance": "Select all objects where sppci_device_type is either 'sppci_audiodevice' or 'sppci_nvme', and output the value of _name.",
    "expressions": [
      ".SPPCIDataType[] | select(.sppci_device_type == \"sppci_audiodevice\" or .sppci_device_type == \"sppci_nvme\")._name",
      ".SPPCIDataType[] | select(.sppci_device_type | IN(\"sppci_audiodevice\", \"sppci_nvme\"))._name"
    ],
    "data": [
      {
        "input": {
          "SPPCIDataType": [
            {
              "_name": "Radeon RX 580",
              "sppci_device_type": "sppci_displaycontroller"
            },
            {
              "_name": "pci1002,aaf0",
              "sppci_device_type": "sppci_audiodevice"
            },
            {
              "_name": "PXS3",
              "sppci_device_type": "sppci_usbxhci"
            },
            {
              "_name": "pci144d,a808",
              "sppci_device_type": "sppci_nvme"
            },
            {
              "_name": "pci144d,a808",
              "sppci_device_type": "sppci_nvme"
            }
          ]
        },
        "output": [
          "pci1002,aaf0",
          "pci144d,a808",
          "pci144d,a808"
        ]
      }
    ],
    "identifier": 69481933,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I would like to retrieve unique values based on a certain key.",
      "I would like to grab the last duplicate key `name` value and display that in the final output.",
      "Is it possible to do this with `unique_by` or what would be the best approach?",
      "If you want the last unique item, simply `reverse` the array first",
      "jq 'reverse | unique_by( {name} )'",
      "And if you want to retain the original order, `reverse` back again afterwards",
      "jq 'reverse | unique_by( {name} ) | reverse'"
    ],
    "utterance": "Get objects with unique 'name', keeping only the last occurrence of each.",
    "expressions": [
      "reverse | unique_by(.name)",
      "reverse | unique_by(.name) | reverse"
    ],
    "data": [
      {
        "input": [
          {
            "name": "app-fastly",
            "tag": "20210825-95-448f024",
            "image": "docker.io/repoxy/app-fastly:20210825-95-448f024"
          },
          {
            "name": "app-lovely",
            "tag": "20211004-2101-b6a256c",
            "image": "ghcr.io/repox/app-lovely:20211004-2101-b6a256c"
          },
          {
            "name": "app-lovely",
            "tag": "20211007-6622-b3fooba",
            "image": "ghcr.io/repoxy/app-lovely:20211007-6622-b3fooba"
          },
          {
            "name": "app-dogwood",
            "tag": "20210325-36-2a349e9",
            "image": "docker.io/repoxy/app-dogwood:20210325-36-2a349e9"
          }
        ],
        "output": [
          {
            "name": "app-dogwood",
            "tag": "20210325-36-2a349e9",
            "image": "docker.io/repoxy/app-dogwood:20210325-36-2a349e9"
          },
          {
            "name": "app-fastly",
            "tag": "20210825-95-448f024",
            "image": "docker.io/repoxy/app-fastly:20210825-95-448f024"
          },
          {
            "name": "app-lovely",
            "tag": "20211007-6622-b3fooba",
            "image": "ghcr.io/repoxy/app-lovely:20211007-6622-b3fooba"
          }
        ]
      }
    ],
    "identifier": 69485161,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have json like this",
      "{\n   \"Subject\": {\n       \"Data\": \"Value\",\n       \"Charset\": \"UTF-8\"\n   },\n   \"Body\": {\n       \"Text\": {\n           \"Data\": \"Value\",\n           \"Charset\": \"UTF-8\"\n       },\n       \"Html\": {\n           \"Data\": \"Data example\",\n           \"Charset\": \"UTF-8\"\n       }\n   }\n}",
      "And I need pass into this variables from bash script like this:",
      "{\n   \"Subject\": {\n       \"Data\": \"Value $value1\",\n       \"Charset\": \"UTF-8\"\n   },\n   \"Body\": {\n       \"Text\": {\n           \"Data\": \"Value $value2\",\n           \"Charset\": \"UTF-8\"\n       },\n       \"Html\": {\n           \"Data\": \"Data example\",\n           \"Charset\": \"UTF-8\"\n       }\n   }\n}",
      "I tried with this:",
      "jq -n --arg value1 test --arg value2 -f message.jq \nbut this ignore my $variables from message.jq, when I delete \"\" (quotes in message.jq close to $variable then working by json is destroyed I need quotes with variables."
    ],
    "utterance": "Insert environment variable values into the 'Subject.Data' and 'Body.Text.Data' fields so their values become 'Value <value1>' and 'Value <value2>'.",
    "expressions": [
      "{\n  \"Subject\": {\n    \"Data\": \"Value \"+$value1,\n    \"Charset\": \"UTF-8\"\n  },\n  \"Body\": {\n    \"Text\": {\n      \"Data\": \"Value \"+$value2,\n      \"Charset\": \"UTF-8\"\n    },\n    \"Html\": {\n      \"Data\": \"Data example\",\n      \"Charset\": \"UTF-8\"\n    }\n  }\n}"
    ],
    "data": [
      {
        "input": {},
        "output": {
          "Subject": {
            "Data": "Value test",
            "Charset": "UTF-8"
          },
          "Body": {
            "Text": {
              "Data": "Value <passed_value2>",
              "Charset": "UTF-8"
            },
            "Html": {
              "Data": "Data example",
              "Charset": "UTF-8"
            }
          }
        }
      }
    ],
    "identifier": 69494382,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have some JSON data that looks like this:",
      "I've tried running `jq '.entries[].timestamp = .entries[].timestamp/1000.0' data.json` but the outcome has lost precision on the values.",
      "Use the update operator `|=` instead. In this case, you could even go with `/=`.",
      "jq '.entries[].timestamp /= 1000.0' data.json",
      "jq '.entries[] |= ( .timestamp = .timestamp/1000.0 )'",
      "jq '.entries[] |= ( .timestamp /= 1000.0 )'",
      "jq '.entries[].timestamp /= 1000.0'"
    ],
    "utterance": "Divide the timestamp field of each entry by 1000.0 and update the result in place, keeping full floating-point precision.",
    "expressions": [
      ".entries[].timestamp /= 1000.0",
      ".entries[] |= (.timestamp = .timestamp/1000.0)",
      ".entries[] |= (.timestamp /= 1000.0)"
    ],
    "data": [
      {
        "input": {
          "entries": [
            {
              "frame": 1,
              "timestamp": 0,
              "r": 130,
              "g": 105,
              "b": 99
            },
            {
              "frame": 2,
              "timestamp": 99.96299999999974,
              "r": 129,
              "g": 105,
              "b": 99
            }
          ]
        },
        "output": {
          "entries": [
            {
              "frame": 1,
              "timestamp": 0,
              "r": 130,
              "g": 105,
              "b": 99
            },
            {
              "frame": 2,
              "timestamp": 0.09996299999999975,
              "r": 129,
              "g": 105,
              "b": 99
            }
          ]
        }
      }
    ],
    "identifier": 69495568,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am using jq to get the value for a specific key",
      "echo {{job.message}} | jq -r '.[0] | .status'",
      "Also you can use `jq -n` to build the JSON object:",
      "jq -n --arg text $job_message '{\"text\": ($text | fromjson)[0].status}'"
    ],
    "utterance": "Extract the value of the 'status' key from the first element in a list and output it as the 'text' field of a new object.",
    "expressions": [
      "jq -n --arg text \"$job_message\" '{\"text\": ($text | fromjson)[0].status}'"
    ],
    "data": [
      {
        "input": [
          {
            "status": "SUCCESS",
            "other_key": "value1"
          },
          {
            "status": "FAIL",
            "other_key": "value2"
          }
        ],
        "output": {
          "text": "SUCCESS"
        }
      }
    ],
    "identifier": 69499499,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Inside my `sample2.json` file, I have:\n\n    {\n            \"a\": 86,\n            \"b\": 99\n    }",
      "Inside my `json2csv` file, I have:\n\n    .[\"a\"]",
      "When I do:\n\n    cat sample.json | jq -L json2csv\n\nThe output is the whole sample.json file, like this:\n\nbut I would expect the output is only `86`because:",
      "How do I change the content of the `json2csv` file so I can only get the `86` in the output?",
      "Use `jq -f json2csv` instead.",
      "From `man jq`:\n\n> -Ldirectory / -L directory:\n>\n> Prepend directory to the search list for modules. If this option is used then no builtin search list is used.",
      "Compare this to:\n\n> -f filename / --from-file filename:\n>\n>Read filter from the file rather than from a command line, like awk\u00b4s -f option. You can also use # to make comments."
    ],
    "utterance": "Extract the value associated with the key \"a\" from an input where the top-level object contains the keys \"a\" and \"b\".",
    "expressions": [
      ".[\"a\"]"
    ],
    "data": [
      {
        "input": {
          "a": 86,
          "b": 99
        },
        "output": 86
      }
    ],
    "identifier": 69499767,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "but how do I insert the `\"k0\": 0` before the `\"k1\": 1`?",
      "I would like the result to be:\n{\n    \"k0\": 0,\n    \"k1\": 1\n}",
      "One option is creating an object with only k0-0 pair and adding the input to it like so:\n```\n{k0: 0} + .\n```"
    ],
    "utterance": "Insert a key-value pair with key 'k0' and value 0 at the beginning of an object that already contains 'k1': 1.",
    "expressions": [
      "{k0: 0} + ."
    ],
    "data": [
      {
        "input": {
          "k1": 1
        },
        "output": {
          "k0": 0,
          "k1": 1
        }
      }
    ],
    "identifier": 69500591,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "oci compute instance list-vnics --all --compartment-id xxxxxx  --profile myprof |C:\\myfiles\\jq-win64.exe \".data[]|select(.\"hostname-label\"==\"test1\" )\"",
      "Double quotes inside single quotes need to be escaped :",
      "'.data[]|select(.\\\"hostname-label\\\"==\\\"test1\\\")'"
    ],
    "utterance": "Filter all objects where the hostname-label field is equal to test1.",
    "expressions": [
      ".data[] | select(.\"hostname-label\" == \"test1\")"
    ],
    "identifier": 69505903,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have 2 json files that I wish to compare with each other and end up with a file that has just the unique objects from the second file. A unique object is not determined by the full object but rather by the value of one key within it called `Car ID`.",
      "only the second object from the second file should remain, as so:\n\n    {\n        \"User Data\": [\n        {\"First Name\": \"Jeremy\", \"Last Name\": \"Blake\", \"DOB\": \"1965-09-21\", \"Car ID\": \"UJ345IE2\"}\n        ]\n        }",
      "Read in both files as variables using `--argfile`, then `map` both to just their `Car ID`, then subtract the first array from the second resulting in an array of `Car ID`s you want to keep, and finally use the keys to `select` the right elements from the second input file.",
      "jq -n \\\n  --argfile file1 file1.json \\\n  --argfile file2 file2.json \\\n  '\n    # Take both input files and create an array $ids with all\n    # Car IDs from the second file that are not part of the first one\n    [ $file1, $file2 ]\n    | map(.[\"User Data\"] | map(.[\"Car ID\"]))\n    | (.[1] - .[0]) as $ids\n    \n    # Take the second input file and select only those entries\n    # whose Car ID is part of the previously stored array of Car IDs\n    | $file2\n    | .[\"User Data\"] |= map(select(.[\"Car ID\"] == $ids[]))\n  '"
    ],
    "utterance": "Return all objects in the second file where Car ID does not appear in the first file.",
    "expressions": [
      "[ $file1, $file2 ]\n| map(.[\"User Data\"] | map(.[\"Car ID\"]))\n| (.[1] - .[0]) as $ids\n| $file2\n| .[\"User Data\"] |= map(select(.[\"Car ID\"] == $ids[]))"
    ],
    "data": [
      {
        "input": {
          "file1": {
            "User Data": [
              {
                "First Name": "Tony",
                "Last Name": "Evans",
                "DOB": "1987-02-01",
                "Car ID": "UJ928JD9"
              },
              {
                "First Name": "John",
                "Last Name": "Smith",
                "DOB": "1972-11-27",
                "Car ID": "UJ235UW8"
              },
              {
                "First Name": "Kirsty",
                "Last Name": "Morgan",
                "DOB": "1991-06-08",
                "Car ID": "UJ424KL2"
              }
            ]
          },
          "file2": {
            "User Data": [
              {
                "First Name": "Harry",
                "Last Name": "Jones",
                "DOB": "1983-03-09",
                "Car ID": "UJ928JD9"
              },
              {
                "First Name": "Jeremy",
                "Last Name": "Blake",
                "DOB": "1965-09-21",
                "Car ID": "UJ345IE2"
              },
              {
                "First Name": "Jason",
                "Last Name": "Roberts",
                "DOB": "1972-10-18",
                "Car ID": "UJ424KL2"
              }
            ]
          }
        },
        "output": {
          "User Data": [
            {
              "First Name": "Jeremy",
              "Last Name": "Blake",
              "DOB": "1965-09-21",
              "Car ID": "UJ345IE2"
            }
          ]
        }
      }
    ],
    "identifier": 69509122,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": true,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "My json file is basically the same as this except it's all on one line. My goal is to obtain a .csv file like the possible output listed below.",
      "Input:",
      "[",
      "    {\"code\": \"NSW\", \"name\": \"New South Wales\", \"level\":\"state\", \"country\": \"AU\"},",
      "    {\"code\": \"AB\", \"name\": \"Alberta\", \"level\":\"province\", \"country\": \"CA\"},",
      "    {\"code\": \"ABD\", \"name\": \"Aberdeenshire\", \"level\":\"council area\", \"country\": \"GB\"},",
      "    {\"code\": \"AK\", \"name\": \"Alaska\", \"level\":\"state\", \"country\": \"US\"}",
      "]",
      "Possible output:",
      "code,name,level,country",
      "NSW,New South Wales,state,AU",
      "AB,Alberta,province,CA",
      "ABD,Aberdeenshire,council area,GB",
      "AK,Alaska,state,US"
    ],
    "utterance": "Convert an array of objects with keys code, name, level, and country into a CSV string with a header row followed by rows for each object's values in that order.",
    "expressions": [
      "[\"code\",\"name\",\"level\",\"country\"], (.[] | [.code, .name, .level, .country]) | @csv"
    ],
    "data": [
      {
        "input": [
          {
            "code": "NSW",
            "name": "New South Wales",
            "level": "state",
            "country": "AU"
          },
          {
            "code": "AB",
            "name": "Alberta",
            "level": "province",
            "country": "CA"
          },
          {
            "code": "ABD",
            "name": "Aberdeenshire",
            "level": "council area",
            "country": "GB"
          },
          {
            "code": "AK",
            "name": "Alaska",
            "level": "state",
            "country": "US"
          }
        ],
        "output": "code,name,level,country\n\"NSW\",\"New South Wales\",\"state\",\"AU\"\n\"AB\",\"Alberta\",\"province\",\"CA\"\n\"ABD\",\"Aberdeenshire\",\"council area\",\"GB\"\n\"AK\",\"Alaska\",\"state\",\"US\""
      }
    ],
    "identifier": 69519035,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "When there are multiple columns and one column is JSON, how to print the processing result of the JSON column alone with other columns literally? In the following example, how to print the first column and the JSON processing result of the 2nd column?",
      "To solve this entirely in `jq` you have to read it as non-JSON first and interpret the second column as JSON using jq's `fromjson`",
      "jq -Rr './\"\\t\" | .[1] |= (fromjson | .a) | @tsv' <<< $'A\\t{\"a\": 1}'$'\\nB\\t{\"a\": 2}'",
      "jq --raw-input --raw-output --slurp 'split(\"\\n\") | map(split(\"\\t\")) | map(select(length>0)) | .[] | {\"p\":.[0], \"j\":.[1] | fromjson} | [.p, .j.a] | @tsv' <<< $'A\\t{\"a\": 1}'$'\\nB\\t{\"a\": 2}'"
    ],
    "utterance": "Extract the first column as-is and the value of .a from the JSON in the second column for each row in a two-column TSV file.",
    "expressions": [
      "jq -Rr 'split(\"\\t\") | .[0] + \"\\t\" + ((.[1] | fromjson | .a) | tostring)'",
      "jq --raw-input --raw-output --slurp 'split(\"\\n\") | map(split(\"\\t\")) | map(select(length>0)) | .[] | {p:.[0], j:.[1] | fromjson} | [.p, .j.a] | @tsv'"
    ],
    "data": [
      {
        "input": "A\t{\"a\": 1}\nB\t{\"a\": 2}",
        "output": "A\t1\nB\t2"
      }
    ],
    "identifier": 69519816,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "within each object I need the value for three keys: \"STK_NUM\":\"1004251 \", \"DLR_COST\":40.32 , \"RTL_AMT\":9.99. How would I write the jq program to get the key names as headers in my csv file: STK_NUM, DLR_COST,RTL_AMT and the values of these keys from all objects: 1004251,40.32,9.99 ? These three keys are in every object.",
      "Desired csv:",
      "STK_NUM,DLR_COST,RTL_AMT",
      "1004251,40.32,9.99",
      "1012658,29.99,4.69",
      "1232556,18.89,2.49",
      "Try\n\n    jq -r '\n      \n      # Headers\n      [\"STK_NUM\", \"DLR_COST\", \"RTL_AMT\"],\n      \n      # Rows\n      (.[] | [.STK_NUM, .DLR_COST, .RTL_AMT])\n      \n      # Output Format\n      | @csv\n      \n    ' HSEitm.json",
      "In the spirit of DRY:\n```\njq -r '\n  [\"STK_NUM\", \"DLR_COST\", \"RTL_AMT\"] as $headers\n  | $headers, \n    # Rows:\n    map(.[$headers[]]) \n  | @csv\n'\n```"
    ],
    "utterance": "Output a CSV with headers STK_NUM, DLR_COST, RTL_AMT and the corresponding values for these keys from all objects in the array.",
    "expressions": [
      "[\"STK_NUM\", \"DLR_COST\", \"RTL_AMT\"], (.[] | [.STK_NUM, .DLR_COST, .RTL_AMT]) | @csv",
      "[\"STK_NUM\", \"DLR_COST\", \"RTL_AMT\"] as $headers | $headers, map(.[$headers[]]) | @csv"
    ],
    "data": [
      {
        "input": [
          {
            "STK_NUM": "1004251",
            "DLR_COST": 40.32,
            "RTL_AMT": 9.99
          },
          {
            "STK_NUM": "1012658",
            "DLR_COST": 29.99,
            "RTL_AMT": 4.69
          },
          {
            "STK_NUM": "1232556",
            "DLR_COST": 18.89,
            "RTL_AMT": 2.49
          }
        ],
        "output": "STK_NUM,DLR_COST,RTL_AMT\n1004251,40.32,9.99\n1012658,29.99,4.69\n1232556,18.89,2.49"
      }
    ],
    "identifier": 69520350,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "with JQ I need to extract the **resourceName** value for value = **mail@mail1.com**",
      "So in my case, the result should be **name_1**",
      "jq '.connections[] | . as $parent | .emailAddresses // empty | .[] | select(.value == \"mail@mail1.com\") | $parent.resourceName' file.json",
      ".connections[] | select(any(.emailAddresses[];.value == \"mail@mail1.com\")).resourceName"
    ],
    "utterance": "Get the resourceName for the connection whose emailAddresses array contains a value equal to mail@mail1.com.",
    "expressions": [
      ".connections[] | . as $parent | .emailAddresses // empty | .[] | select(.value == \"mail@mail1.com\") | $parent.resourceName",
      ".connections[] | select(any(.emailAddresses[];.value == \"mail@mail1.com\")).resourceName"
    ],
    "data": [
      {
        "input": {
          "connections": [
            {
              "resourceName": "name_1",
              "etag": "123456789",
              "emailAddresses": [
                {
                  "metadata": {
                    "primary": true,
                    "source": {
                      "type": "CONTACT",
                      "id": "123456"
                    }
                  },
                  "value": "mail@mail1.com"
                }
              ]
            },
            {
              "resourceName": "name_2",
              "etag": "987654321",
              "emailAddresses": [
                {
                  "metadata": {
                    "primary": true,
                    "source": {
                      "type": "CONTACT",
                      "id": "654321"
                    },
                    "sourcePrimary": true
                  },
                  "value": "mail@mail2.com"
                }
              ]
            }
          ],
          "totalPeople": 187,
          "totalItems": 187
        },
        "output": "name_1"
      }
    ],
    "identifier": 69531760,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I would like to use jq to \"abbreviate\" values longer than `n` characters. Each value should be converted to a string `s`, and if the result is longer than `n`, it should be replaced with `s[:n] + \" ...\"`. If the string is not too long, it should ideally be left as the original value.",
      "$ jq --argjson n 10 '.[] |= (\n  if type == \"number\" then . else\n    tostring | if length > $n then .[:$n] + \"...\" else . end\n  end\n)' j.json",
      "To prevent the numbers from being converted to strings, we can wrap the simple case with another `if` condition checking the item's `type`"
    ],
    "utterance": "Abbreviate each top-level value to at most 10 characters by converting it to a string and truncating with '...' if necessary, but leave numbers unaltered.",
    "expressions": [
      ".[] |= (if type == \"number\" then . else tostring | if length > 10 then .[:10] + \"...\" else . end end)"
    ],
    "data": [
      {
        "input": {
          "a": "hi",
          "b": 4213,
          "c": 23154646789132456,
          "d": "a very long string that should be shortened",
          "e": {
            "x": "some value",
            "y": {
              "alpha": "foo"
            }
          }
        },
        "output": {
          "a": "hi",
          "b": 4213,
          "c": 23154646789132456,
          "d": "a very lon...",
          "e": "{\"x\":\"some..."
        }
      }
    ],
    "identifier": 69532143,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to filter only those id's whose last four-digit matches with the input list which looks like this",
      "Select objects from data that match the input",
      "match.jq",
      ".data[]\n| (.id|split(\"\")|reverse|.[0:4]|reverse|join(\"\")) as $l4\n| { orig: ., uid: $users[$l4] }\n| select(.uid != null)\n| \"\\( .uid ) \\( .orig.id )\""
    ],
    "utterance": "Extract all items where the last four characters of the id field match any value from a provided list and output the mapped key and id.",
    "expressions": [
      ".data[]\n| (.id|split(\"\")|reverse|.[0:4]|reverse|join(\"\")) as $l4\n| { orig: ., uid: $users[$l4] }\n| select(.uid != null)\n| \"\\( .uid ) \\( .orig.id )\""
    ],
    "data": [
      {
        "input": {
          "data": [
            {
              "id": "dkfjgh48sdfg",
              "abc": "sdfdsf",
              "pqr": "sdkfj"
            },
            {
              "id": "sdfg742sdfg2",
              "abc": "aksjdfj",
              "pqr": "alkejf"
            },
            {
              "id": "sdfjk7df3dfg",
              "abc": "smdfb",
              "pqr": "sdjkfhakj"
            },
            {
              "id": "xxxxxxxxxxxx",
              "abc": "smdfb",
              "pqr": "sdjkfhakj"
            },
            {
              "id": "nope........",
              "abc": "smdfb",
              "pqr": "sdjkfhakj"
            },
            {
              "id": "____________",
              "abc": "smdfb",
              "pqr": "sdjkfhakj"
            },
            {
              "id": "pltrsg2rt4tl",
              "abc": "artgrre",
              "pqr": "sfeewrwg"
            }
          ]
        },
        "output": [
          "44 dkfjgh48sdfg",
          "66 sdfg742sdfg2",
          "12 sdfjk7df3dfg",
          "69 pltrsg2rt4tl"
        ]
      }
    ],
    "identifier": 69533089,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "NOTE: I added `EWR` (from `.reservationSummary.pickLoc.locationCode`to the 1st column of each row.",
      "`@csv` requires an array as input but `.reservationSummary.pickLoc.locationCode` only produces a string. Instead, make it an array by wrapping it into `[]` and then add the two arrays together using `+`:",
      "[.reservationSummary.pickLoc.locationCode] + (\n  .vehicleSummaryList[]\n  | [.carClass, .carGroup, .carAvailability, .payLaterRate.totalRateAmount]\n)\n| @csv"
    ],
    "utterance": "For each element in vehicleSummaryList, produce a row where the first column is reservationSummary.pickLoc.locationCode followed by carClass, carGroup, carAvailability, and payLaterRate.totalRateAmount.",
    "expressions": [
      "[.reservationSummary.pickLoc.locationCode] + (.vehicleSummaryList[] | [.carClass, .carGroup, .carAvailability, .payLaterRate.totalRateAmount]) | @csv"
    ],
    "data": [
      {
        "input": {
          "reservationSummary": {
            "pickLoc": {
              "locationCode": "EWR"
            }
          },
          "vehicleSummaryList": [
            {
              "carGroup": "Economy",
              "carClass": "A",
              "payNowRate": {
                "amount": "2327.49",
                "totalRateAmount": "3387.19"
              },
              "payLaterRate": {
                "amount": "2449.99",
                "totalRateAmount": "3540.79"
              }
            },
            {
              "carGroup": "Compact",
              "carClass": "B",
              "payNowRate": {
                "amount": "2327.49",
                "totalRateAmount": "3387.19"
              },
              "payLaterRate": {
                "amount": "2449.99",
                "totalRateAmount": "3540.79"
              }
            }
          ]
        },
        "output": [
          "\"EWR\",\"A\",\"Economy\",null,\"3540.79\"",
          "\"EWR\",\"B\",\"Compact\",null,\"3540.79\""
        ]
      }
    ],
    "identifier": 69534972,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Would it be possible to collapse it into a single `jq` command?",
      "jq -nr '[inputs | [.[\"@graph\"][][\"rdfs:label\"]]] | .[0]-.[1] | .[]' {9.0,13.0}/schemaorg-all-http.jsonld",
      "These arrays contain the value of `rdfs:label` and have the same number of strings as `@graph` had objects.",
      "The first operator will take this array of arrays and take the first and second elements of it and calculate the difference.",
      "Finally, instead of outputting this array, we want to output one thing after the other which will allow `-r` to strip the quotes. The `.[]` means to iterate over the difference array and output each element. It's the opposite operation to the `[]` array constructor."
    ],
    "utterance": "List all values in the rdfs:label fields found in the @graph array from the first input file that are not present in the second input file.",
    "expressions": [
      "jq -nr '[inputs | [.[\"@graph\"][][\"rdfs:label\"]]] | .[0]-.[1] | .[]' {9.0,13.0}/schemaorg-all-http.jsonld"
    ],
    "identifier": 69536572,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": true,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "`jq -r '&#39;.&quot;@graph&quot;[][&quot;rdfs:label&quot;]&#39;' 9.0/schemaorg-all-http.jsonld` works but `jq -r '&#39;.&quot;@graph&quot;[].[&quot;rdfs:label&quot;]&#39;' 9.0/schemaorg-all-http.jsonld` does not and I don't understand why `.[&quot;rdfs:label&quot;]` does _not_ need the dot.",
      "the expression under consideration '.\"@graph\"[][\"rdfs:label\"]' can be viewed as an abbreviated form of the pipeline:\n\n    .[\"@graph\"] | .[] | .[\"rdfs:label\"]",
      "One of the most important abbreviation rules is:\n\n    E | .[] #=> E[]\n\nAnother is:\n\n    .[\"<string>\"] #=> .\"<string>\""
    ],
    "utterance": "Extract all values of the 'rdfs:label' key from every object within the '@graph' array.",
    "expressions": [
      ".\"@graph\"[] [\"rdfs:label\"]",
      ".[\"@graph\"][] [\"rdfs:label\"]"
    ],
    "identifier": 69537349,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I\u2019d like to print the path to all keys named \"ENTITY\" with a value \"TEXT\".",
      "paths( objects | .ENTITY == \"TEXT\" )",
      "path( .. | select( type == \"object\" and .ENTITY == \"TEXT\" ) )",
      "jq --arg key \"ENTITY\" --arg value \"TEXT\" '\n  [[leaf_paths],[getpath(leaf_paths)]]\n  | transpose\n  | map(select(.[0][-1] == $key and .[1] == $value))[][0]\n'"
    ],
    "utterance": "Find all paths to keys named ENTITY whose value is TEXT, regardless of nesting level.",
    "expressions": [
      "paths(objects | .ENTITY == \"TEXT\")",
      "path(.. | select(type == \"object\" and .ENTITY == \"TEXT\"))",
      "jq --arg key \"ENTITY\" --arg value \"TEXT\" '[[leaf_paths],[getpath(leaf_paths)]] | transpose | map(select(.[0][-1] == $key and .[1] == $value))[][0]'"
    ],
    "data": [
      {
        "input": {
          "foo": {
            "ENTITY": "TEXT",
            "bar": [
              {
                "ENTITY": "OTHERVALUE"
              },
              {
                "ENTITY": "TEXT"
              }
            ]
          },
          "baz": {
            "something": {
              "ENTITY": "TEXT"
            }
          }
        },
        "output": [
          [
            "foo",
            "ENTITY"
          ],
          [
            "foo",
            "bar",
            1,
            "ENTITY"
          ],
          [
            "baz",
            "something",
            "ENTITY"
          ]
        ]
      }
    ],
    "identifier": 69544555,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I could get what I want with select(.type==\"mammal\" or .type==\"reptile\"). But this is simplified version, as you can expect, this could be too verbose easily.",
      "So I want to do the same with more succinct way like select(.type in [\"mammal\", \"reptile\"]).",
      "Is this what you are looking for?\n\n    select(.type | IN(\"mammal\",\"reptile\"))"
    ],
    "utterance": "Filter objects whose type is either \"mammal\" or \"reptile\".",
    "expressions": [
      "select(.type | IN(\"mammal\",\"reptile\"))"
    ],
    "data": [
      {
        "input": [
          {
            "type": "mammal",
            "value": "lion"
          },
          {
            "type": "reptile",
            "value": "snake"
          },
          {
            "type": "insect",
            "value": "fly"
          },
          {
            "type": "mammal",
            "value": "chicken"
          }
        ],
        "output": [
          {
            "type": "mammal",
            "value": "lion"
          },
          {
            "type": "reptile",
            "value": "snake"
          },
          {
            "type": "mammal",
            "value": "chicken"
          }
        ]
      }
    ],
    "identifier": 69545347,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am trying to pull out of this list the `ip_prefix` based on where region contains `us-west-` and list them in csv.",
      "You have the filters in the wrong order. Try:",
      "jq '.prefixes[] | select(.region|startswith(\"us-west\")) | .ip_prefix' <your json file>",
      "What this is doing is getting the `prefixes` array, filtering for elements where the region starts with \"us-west\" and then selecting the `ip_prefix`."
    ],
    "utterance": "List all ip_prefix values where region starts with 'us-west'.",
    "expressions": [
      ".prefixes[] | select(.region|startswith(\"us-west\")) | .ip_prefix"
    ],
    "data": [
      {
        "input": {
          "syncToken": "15985471",
          "createDate": "2021-10-08-20-13-16",
          "prefixes": [
            {
              "ip_prefix": "3.5.140.0/22",
              "region": "ap-northeast-2",
              "service": "AMAZON",
              "network_border_group": "ap-northeast-2"
            },
            {
              "ip_prefix": "13.34.37.64/27",
              "region": "ap-southeast-4",
              "service": "AMAZON",
              "network_border_group": "ap-southeast-4"
            },
            {
              "ip_prefix": "35.180.0.0/16",
              "region": "eu-west-3",
              "service": "AMAZON",
              "network_border_group": "eu-west-3"
            },
            {
              "ip_prefix": "52.93.153.170/32",
              "region": "eu-west-2",
              "service": "AMAZON",
              "network_border_group": "eu-west-2"
            },
            {
              "ip_prefix": "52.93.178.234/32",
              "region": "us-west-1",
              "service": "AMAZON",
              "network_border_group": "us-west-1"
            }
          ]
        },
        "output": [
          "52.93.178.234/32"
        ]
      }
    ],
    "identifier": 69545895,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "However I&#39;m not sure what to do if I want to merge two additional objects, say I had:\r\n\r\n## food.json\r\n\r\n```json\r\n{\r\n  &quot;food&quot;: [\r\n    &quot;meat&quot;,\r\n    &quot;fish&quot;,\r\n    &quot;vegetables&quot;\r\n  ]\r\n}\r\n```\r\n\r\nas well that I want at the top, resulting in:\r\n\r\n```json\r\n{\r\n  &quot;owners&quot;: [\r\n    &quot;peter&quot;,\r\n    &quot;william&quot;,\r\n    &quot;sally&quot;\r\n  ],\r\n  &quot;food&quot;: [\r\n    &quot;meat&quot;,\r\n    &quot;fish&quot;,\r\n    &quot;vegetables&quot;\r\n  ],\r\n  &quot;animals&quot;: [ ... ]\r\n}\r\n```",
      "Just read in `food.json` using parameter `--slurpfile food food.json` and modify `program.jq` to include it: `input as $owners | {$owners, food: $food[0].food, animals: [inputs]}`."
    ],
    "utterance": "Combine owners from one file, food from another, and group cats, dogs, and snakes as animals into a single top-level object containing owners, food, and animals keys.",
    "expressions": [
      "input as $owners | {$owners, food: $food[0].food, animals: [inputs]}"
    ],
    "data": [
      {
        "input": {
          "owners.json": {
            "owners": [
              "peter",
              "william",
              "sally"
            ]
          },
          "food.json": {
            "food": [
              "meat",
              "fish",
              "vegetables"
            ]
          },
          "cats.json": {
            "cats": [
              {
                "name": "fluffles",
                "age": 10,
                "color": "white"
              }
            ]
          },
          "dogs.json": {
            "dogs": [
              {
                "name": "sam",
                "age": 5,
                "color": "black and white"
              },
              {
                "name": "rover",
                "age": 2,
                "color": "brown and white"
              }
            ]
          },
          "snakes.json": {
            "snakes": [
              {
                "name": "noodles",
                "age": 10,
                "color": "green"
              }
            ]
          }
        },
        "output": {
          "owners": [
            "peter",
            "william",
            "sally"
          ],
          "food": [
            "meat",
            "fish",
            "vegetables"
          ],
          "animals": [
            {
              "cats": [
                {
                  "name": "fluffles",
                  "age": 10,
                  "color": "white"
                }
              ]
            },
            {
              "dogs": [
                {
                  "name": "sam",
                  "age": 5,
                  "color": "black and white"
                },
                {
                  "name": "rover",
                  "age": 2,
                  "color": "brown and white"
                }
              ]
            },
            {
              "snakes": [
                {
                  "name": "noodles",
                  "age": 10,
                  "color": "green"
                }
              ]
            }
          ]
        }
      }
    ],
    "identifier": 69555673,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": true,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "And my condition is \"type\" should be \"secondary\". It should display the field having type secondary.",
      "For this specific case to filter on a condition, you need a `select` statement",
      "[ \"color\", \"category\", \"type\" ], \n( .colors[] | select( .type == \"secondary\" ) | [ .color, .category, .type] ) | @csv"
    ],
    "utterance": "Select all elements whose type is \"secondary\" in the colors array and output their color, category, and type fields as CSV.",
    "expressions": [
      "[\"color\", \"category\", \"type\"], (.colors[] | select(.type == \"secondary\") | [.color, .category, .type]) | @csv"
    ],
    "data": [
      {
        "input": {
          "colors": [
            {
              "color": "black",
              "category": "hue",
              "type": "primary",
              "code": {
                "rgba": [
                  255,
                  255,
                  255,
                  1
                ],
                "hex": "#000"
              }
            },
            {
              "color": "white",
              "category": "value",
              "code": {
                "rgba": [
                  0,
                  0,
                  0,
                  1
                ],
                "hex": "#FFF"
              }
            },
            {
              "color": "red",
              "category": "hue",
              "type": "primary",
              "code": {
                "rgba": [
                  255,
                  0,
                  0,
                  1
                ],
                "hex": "#FF0"
              }
            },
            {
              "color": "blue",
              "category": "hue",
              "type": "primary",
              "code": {
                "rgba": [
                  0,
                  0,
                  255,
                  1
                ],
                "hex": "#00F"
              }
            },
            {
              "color": "yellow",
              "category": "hue",
              "type": "primary",
              "code": {
                "rgba": [
                  255,
                  255,
                  0,
                  1
                ],
                "hex": "#FF0"
              }
            },
            {
              "color": "green",
              "category": "hue",
              "type": "secondary",
              "code": {
                "rgba": [
                  0,
                  255,
                  0,
                  1
                ],
                "hex": "#0F0"
              }
            }
          ]
        },
        "output": "\"color\",\"category\",\"type\"\n\"green\",\"hue\",\"secondary\""
      }
    ],
    "identifier": 69561522,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to concatenate both the port and address and also prepend and append some text achieving solely with jq.",
      "For example I want the new output to be:",
      "{ \"Address\": \"myaddress1.com\", \"Port\": 6379, \"FullAddress\": \"redis://myaddress1.com:6379\" },",
      "Assuming that your input file is actually an array of objects, then the following might work for you:",
      "jq 'map(. + { \"FullAddress\": \"redis://\\(.Address):\\(.Port)\" })' input.json"
    ],
    "utterance": "Add a new field FullAddress to each object by concatenating 'redis://', the Address, a colon, and the Port fields.",
    "expressions": [
      "map(. + { \"FullAddress\": \"redis://\\(.Address):\\(.Port)\" })"
    ],
    "data": [
      {
        "input": [
          {
            "Address": "myaddress1.com",
            "Port": 6379
          },
          {
            "Address": "myaddress2.com",
            "Port": 6379
          }
        ],
        "output": [
          {
            "Address": "myaddress1.com",
            "Port": 6379,
            "FullAddress": "redis://myaddress1.com:6379"
          },
          {
            "Address": "myaddress2.com",
            "Port": 6379,
            "FullAddress": "redis://myaddress2.com:6379"
          }
        ]
      }
    ],
    "identifier": 69570802,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Working command",
      "    oci --profile $PROF bastion session list --all --bastion-id $BSTN_OCID  |jq -r '.data[].id'",
      "Failing command",
      "    oci --profile $PROF bastion session list --all --bastion-id $BSTN_OCID --output table |jq -r '.data[].id'",
      "Error",
      "    parse error: Invalid numeric literal at line 2, column 0",
      "As updated by Botje - JQ can not process as its not JSON. I used --query and it worked.",
      "    oci bastion session list  --bastion-id  $BSTN_OCID --output table --profile $PROF --all --query 'data[].id'"
    ],
    "utterance": "Extract all id fields from the data array in the response from the bastion session list command.",
    "expressions": [
      ".data[].id"
    ],
    "identifier": 69571099,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "The task does not provide example input or output, nor does it specify the structure of the 'response' object. It is unclear what the exact input JSON looks like, so the transformation cannot be precisely defined.",
      "other_environment": null
    }
  },
  {
    "context": [
      "method=$(cat settings.json | ./jq --arg URL ${url} '.settings[] | select(.pathPattern==$URL|tostring).httpOperations[] | select(.method == \"PUT\" or .method == \"POST\").method')",
      "This filter is equivalent to:\n\nselect((.pathPattern==$URL)|tostring)",
      "So these filters will actually select all the settings every time!",
      "I managed to make it work by replacing the single quote to double quotes in my JQ filter.",
      "method=$(cat settings.json | ./jq \".settings[] | select(.pathPattern==$URL|tostring).httpOperations[] | select(.method == \\\"PUT\\\" or .method == \\\"POST\\\").method\")"
    ],
    "utterance": "Output all method values from httpOperations where pathPattern equals the specified URL and method is either PUT or POST",
    "expressions": [
      ".settings[] | select(.pathPattern==$URL|tostring).httpOperations[] | select(.method == \"PUT\" or .method == \"POST\").method"
    ],
    "identifier": 69572458,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "First the `category_id` is looked up in following categories data set to extract out a category.key which is a translation tag (i.e. category_id 113 is `electronicsComputers`)",
      "Next the translation tag obtained from step #1 (i.e. `electronicsComputers`) is looked up in a file called `en_CA.json` (or `fr_CA.json` if using the site in French), this gives the desired natural language description: `Electronics & Computers`",
      "\"jq --argfile cat_lookup categories.json --argfile cat_word en_CA.json '\n  .category_key = $cat_lookup[.category_id | tostring].key \n  | .category_desc = $cat_word.global.transactionsCategories[.category_key]\n' ${START_YEAR}-transaction.json\"",
      "\"jq --argfile cat_lookup categories.json --argfile cat_word en_CA.json '\n  .category_desc = $cat_word.global.transactionsCategories[\n    $cat_lookup[.category_id | tostring].key\n  ]\n' ${START_YEAR}-transaction.json\""
    ],
    "utterance": "For each transaction, retrieve the natural language category description corresponding to its category_id by looking up the key in categories data and then using that key in the language file.",
    "expressions": [
      ".category_key = $cat_lookup[.category_id | tostring].key | .category_desc = $cat_word.global.transactionsCategories[.category_key]",
      ".category_desc = $cat_word.global.transactionsCategories[$cat_lookup[.category_id | tostring].key]"
    ],
    "data": [
      {
        "input": {
          "transaction_date": "2020-01-01",
          "id": 365707178,
          "category_id": 113,
          "description": "PAYPAL *JOES DC"
        },
        "output": {
          "transaction_date": "2020-01-01",
          "id": 365707178,
          "category_id": 113,
          "description": "PAYPAL *JOES DC",
          "category_key": "electronicsComputers",
          "category_desc": "Electronics & Computers"
        }
      }
    ],
    "identifier": 69579355,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "The task refers to 'categories data' and a 'language file', but does not specify their structure or how they are provided as input to jq. It is unclear how these lookups are to be performed without more information.",
      "other_environment": null
    }
  },
  {
    "context": [
      "So the question is, how would I remove *only* the Thumbnail entries containing or starting with the string `data:image`?",
      "You could add another constraint `startswith(\"data:image\") | not` and `select` to keep only the elements whose `.key` does not match `or` whose `.value` does not start that way, resulting in: `select((.key | test(\"Thumbnail\") | not) or (.value | startswith(\"data:image\") | not))`.",
      "Then simply call\n```\njq 'map(del(.meta_info.Thumbnail | select(startswith(\"data:image\"))))' Bookmarks\n```",
      "Something like this should do the trick:\n```\ndel(recurse | objects | select(has(\"Thumbnail\")) .Thumbnail | select(startswith(\"data:image\")))\n```"
    ],
    "utterance": "Remove all Thumbnail entries whose value starts with 'data:image', without affecting other Thumbnail entries.",
    "expressions": [
      "map(del(.meta_info.Thumbnail | select(startswith(\"data:image\"))))",
      "del(recurse | objects | select(has(\"Thumbnail\")) .Thumbnail | select(startswith(\"data:image\")))"
    ],
    "data": [
      {
        "input": {
          "Bookmarks": [
            {
              "meta_info": {
                "Thumbnail": "data:image/jpeg;base64,AAAA..."
              },
              "name": "A"
            },
            {
              "meta_info": {
                "Thumbnail": "chrome://vivaldi-data/local-image/aa0d87..."
              },
              "name": "B"
            },
            {
              "meta_info": {
                "Thumb": "data:image/png;base64,BBBB..."
              },
              "name": "C"
            }
          ]
        },
        "output": [
          {
            "meta_info": {},
            "name": "A"
          },
          {
            "meta_info": {
              "Thumbnail": "chrome://vivaldi-data/local-image/aa0d87..."
            },
            "name": "B"
          },
          {
            "meta_info": {
              "Thumb": "data:image/png;base64,BBBB..."
            },
            "name": "C"
          }
        ]
      }
    ],
    "identifier": 69595774,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Goal is to filter only timestamp and price of \"aa\", without giving any care about the \"success\": true key/value pair.",
      "You need to select the `data` field first: ```jq .data.aa[]```"
    ],
    "utterance": "Extract the array of objects under the 'aa' key from within the 'data' property, accessing each object's timestamp and price.",
    "expressions": [
      ".data.aa[]"
    ],
    "data": [
      {
        "input": {
          "success": true,
          "data": {
            "aa": [
              {
                "timestamp": 123456,
                "price": 1
              },
              {
                "timestamp": 123457,
                "price": 2
              }
            ],
            "bb": [
              {
                "timestamp": 123456,
                "price": 3
              },
              {
                "timestamp": 123457,
                "price": 4
              }
            ]
          }
        },
        "output": [
          {
            "timestamp": 123456,
            "price": 1
          },
          {
            "timestamp": 123457,
            "price": 2
          }
        ]
      }
    ],
    "identifier": 69599755,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "how to get the output in the indicated format without selected groups, e.g. group_3. All data is to be selected from each selected group: name_x.",
      "Ouptut:\n\n    name: name_1, group: group_1, field_A_value: value_111, field_B_value: value_112\n    name: name_2, group: group_1, field_A_value: value_121, field_B_value: value_122 \n    name: name_3, group: group_2, field_A_value: value_231, field_B_value: value_232",
      "jq --raw-output '\n\n  .groups | to_entries[] | select(.key | IN(\"group_3\") | not)\n  | [.key] + (.value | to_entries[] | [.key, .value[]])\n  | \"name: \\(.[1]), group: \\(.[0]), field_A_value: \\(.[2]), field_B_value: \\(.[3])\"\n\n' input.json"
    ],
    "utterance": "Return a line for each entry in all groups except group_3, displaying the name, group, first two field values as field_A_value and field_B_value.",
    "expressions": [
      ".groups | to_entries[] | select(.key | IN(\"group_3\") | not) | [.key] + (.value | to_entries[] | [.key, .value[]]) | \"name: \\(.[1]), group: \\(.[0]), field_A_value: \\(.[2]), field_B_value: \\(.[3])\""
    ],
    "data": [
      {
        "input": {
          "groups": {
            "group_1": {
              "name_1": {
                "field_111": "value_111",
                "field_112": "value_112",
                "field_11n": "value_11n"
              },
              "name_2": {
                "field_121": "value_121",
                "field_122": "value_122",
                "field_12n": "value_12n"
              }
            },
            "group_2": {
              "name_3": {
                "field_231": "value_231",
                "field_232": "value_232",
                "field_23n": "value_23n"
              }
            },
            "group_3": {
              "name_4": {
                "field_341": "value_341",
                "field_342": "value_342",
                "field_34n": "value_34n"
              }
            }
          }
        },
        "output": "name: name_1, group: group_1, field_A_value: value_111, field_B_value: value_112\nname: name_2, group: group_1, field_A_value: value_121, field_B_value: value_122\nname: name_3, group: group_2, field_A_value: value_231, field_B_value: value_232"
      }
    ],
    "identifier": 69609625,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have a text file with data like below:\r\n\r\n~~~\r\n6111119268639|22|65024:3|2000225350|Samsung|ADD|234534643645|REMOVE|5645657|65067:3|Apple|ADD|234534643645|REMOVE|3432523|65023:3\r\n6111119268639|22|65024:3|2000225350|Apple|ADD|234534643645|REMOVE|3432523|65023:3\r\n6111119268639|22|65024:3|2000225350|Samsung|ADD|234534643645|REMOVE|3432523|65023:3\r\n~~~",
      "I want want json output like this below:\r\n\r\n~~~\r\n[{\r\n\t\"ExternalId\": \"6111119268639\",\r\n\t\"ExternalIdType\": \"22\",\r\n\t\"RPPI\": \"65024:3\",\r\n\t\"NewPrimaryOfferId\": \"2000225350\",\r\n\t\"Samsung\": [{\r\n\t\t\t\"Action\": \"ADD\",\r\n\t\t\t\"NewSecondaryOfferId\": \"234534643645\"\r\n\t\t},\r\n\t\t{\r\n\t\t\t\"Action\": \"REMOVE\",\r\n\t\t\t\"SecondaryProductOfferId\": \"5645657\",\r\n\t\t\t\"RemoveSecondaryProductInstance\": \"65067:3\"\r\n\t\t}\r\n\t],\r\n\t\"Apple\": [\r\n\t{\r\n\t\t\t\"Action\": \"ADD\",\r\n\t\t\t\"NewComponentOfferId\": \"234534643645\"\r\n\t\t},\r\n\t\t{\r\n\t\t\t\"Action\": \"REMOVE\",\r\n\t\t\t\"ComponentOfferId\": \"3432523\",\r\n\t\t\t\"RemoveAddOnProductInstance\": \"65023:3\"\r\n\t\t}\r\n\t]\r\n}, \r\n...",
      "Here **ExternalId,ExternalIdType,RPPI,NewPrimaryOfferId** are constant and will be there in every line.But **Samsung** and **Apple** can vary accordingly means there could be only 'Samsung' in one line or there could be only 'Apple' in one line or there could be both as shown in sample text.",
      "This seems to work on your test data:\r\n\r\n```sh\r\njq -nR '\r\n   def offer:\r\n     . as $data |\r\n     [[], 0] | until([$data[.[1]]] | inside([\"ADD\", \"REMOVE\"]) | not;\r\n       if $data[.[1]] == \"ADD\" then\r\n        [ .[0] + [{ Action: \"ADD\", NewComponentOfferId: $data[.[1] + 1] }], .[1] + 2 ]\r\n       else\r\n        [ .[0] + [{ Action: \"REMOVE\", ComponentOfferId: $data[.[1] + 1], \r\n          RemoveAddOnProductInstance: $data[.[1] + 2] }], .[1] + 3 ]\r\n       end);\r\n\r\n   def build:\r\n       (. / \"|\") as $data | ($data | length) as $len |\r\n       [ { ExternalId: $data[0], ExternalIdType: $data[1], RPPI: $data[2],\r\n         NewPrimaryOfferId: $data[3] }, 4 ] |\r\n       until(.[1] >= $len;\r\n             ($data[.[1]+1:] | offer) as $off |\r\n             [ .[0] + { ($data[.[1]]): $off[0] }, .[1] + 1 + $off[1] ]) |\r\n      .[0];\r\n\r\n  [ inputs | build ]' data.txt\r\n```",
      "Another approach:\r\n\r\n```\r\njq -Rn '\r\n  [\r\n    inputs / \"|\" | reduce (.[4:] | while(. != [];.[6:])) as $prod (\r\n      .[:4] | with_entries(.key |= [\"ExternalId\",\"ExternalIdType\",\"RPPI\",\"NewPrimaryOfferId\"][.]);\r\n      .[$prod[0]] = [\r\n        {Action:\"ADD\", NewComponentOfferId:$prod[2]},\r\n        {Action:\"REMOVE\", ComponentOfferId:$prod[4], RemoveAddOnProductInstance:$prod[5]}\r\n      ]\r\n    )\r\n  ]\r\n' data.txt\r\n```\r\n[Demo](https://jqplay.org/s/SfG7bQ_8cI)\r\n"
    ],
    "utterance": "Convert lines of delimited text containing fields like ExternalId, ExternalIdType, RPPI, NewPrimaryOfferId, and a variable sequence of product actions into objects where each top-level key-value pair represents a field or a product key containing an array of action objects as shown in the sample output.",
    "expressions": [
      "jq -nR '\n   def offer:\n     . as $data |\n     [[], 0] | until([$data[.[1]]] | inside([\"ADD\", \"REMOVE\"]) | not;\n       if $data[.[1]] == \"ADD\" then\n        [ .[0] + [{ Action: \"ADD\", NewComponentOfferId: $data[.[1] + 1] }], .[1] + 2 ]\n       else\n        [ .[0] + [{ Action: \"REMOVE\", ComponentOfferId: $data[.[1] + 1], \n          RemoveAddOnProductInstance: $data[.[1] + 2] }], .[1] + 3 ]\n       end);\n\n   def build:\n       (. / \"|\") as $data | ($data | length) as $len |\n       [ { ExternalId: $data[0], ExternalIdType: $data[1], RPPI: $data[2],\n         NewPrimaryOfferId: $data[3] }, 4 ] |\n       until(.[1] >= $len;\n             ($data[.[1]+1:] | offer) as $off |\n             [ .[0] + { ($data[.[1]]): $off[0] }, .[1] + 1 + $off[1] ]) |\n      .[0];\n\n  [ inputs | build ]'",
      "jq -Rn '\n  [\n    inputs / \"|\" | reduce (.[4:] | while(. != [];.[6:])) as $prod (\n      .[:4] | with_entries(.key |= [\"ExternalId\",\"ExternalIdType\",\"RPPI\",\"NewPrimaryOfferId\"][.]);\n      .[$prod[0]] = [\n        {Action:\"ADD\", NewComponentOfferId:$prod[2]},\n        {Action:\"REMOVE\", ComponentOfferId:$prod[4], RemoveAddOnProductInstance:$prod[5]}\n      ]\n    )\n  ]\n'"
    ],
    "data": [
      {
        "input": "6111119268639|22|65024:3|2000225350|Samsung|ADD|234534643645|REMOVE|5645657|65067:3|Apple|ADD|234534643645|REMOVE|3432523|65023:3\n6111119268639|22|65024:3|2000225350|Apple|ADD|234534643645|REMOVE|3432523|65023:3\n6111119268639|22|65024:3|2000225350|Samsung|ADD|234534643645|REMOVE|3432523|65023:3",
        "output": [
          {
            "ExternalId": "6111119268639",
            "ExternalIdType": "22",
            "RPPI": "65024:3",
            "NewPrimaryOfferId": "2000225350",
            "Samsung": [
              {
                "Action": "ADD",
                "NewSecondaryOfferId": "234534643645"
              },
              {
                "Action": "REMOVE",
                "SecondaryProductOfferId": "5645657",
                "RemoveSecondaryProductInstance": "65067:3"
              }
            ],
            "Apple": [
              {
                "Action": "ADD",
                "NewComponentOfferId": "234534643645"
              },
              {
                "Action": "REMOVE",
                "ComponentOfferId": "3432523",
                "RemoveAddOnProductInstance": "65023:3"
              }
            ]
          },
          {
            "ExternalId": "6111119268639",
            "ExternalIdType": "22",
            "RPPI": "65024:3",
            "NewPrimaryOfferId": "2000225350",
            "Apple": [
              {
                "Action": "ADD",
                "NewComponentOfferId": "234534643645"
              },
              {
                "Action": "REMOVE",
                "ComponentOfferId": "3432523",
                "RemoveAddOnProductInstance": "65023:3"
              }
            ]
          },
          {
            "ExternalId": "6111119268639",
            "ExternalIdType": "22",
            "RPPI": "65024:3",
            "NewPrimaryOfferId": "2000225350",
            "Samsung": [
              {
                "Action": "ADD",
                "NewSecondaryOfferId": "234534643645"
              },
              {
                "Action": "REMOVE",
                "SecondaryProductOfferId": "3432523",
                "RemoveSecondaryProductInstance": "65023:3"
              }
            ]
          }
        ]
      }
    ],
    "identifier": 69611614,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "The mapping from delimited fields to output object keys is not fully specified, especially for the variable product actions and their mapping to output field names (e.g., how 'ADD' and 'REMOVE' map to 'NewSecondaryOfferId', 'SecondaryProductOfferId', etc.). The rules for mapping product action fields are not described, making the transformation ambiguous.",
      "other_environment": null
    }
  },
  {
    "context": [
      "As a final alternative, you can use jq to convert the JSON into a CSV file and use the far simpler SQLite [.import command](https://sqlite.org/cli.html#importing_csv_files).",
      "jq -r \".[] | [ .col1, .col2, .col3, .col4 ] | @csv\" somedata.json > somedata.csv"
    ],
    "utterance": "Extract all objects' col1, col2, col3, and col4 fields as CSV rows with correct CSV formatting.",
    "expressions": [
      ".[] | [ .col1, .col2, .col3, .col4 ] | @csv"
    ],
    "data": [
      {
        "input": [
          {
            "col1": 1,
            "col2": "a",
            "col3": true,
            "col4": null
          },
          {
            "col1": 2,
            "col2": "b",
            "col3": false,
            "col4": "x"
          }
        ],
        "output": [
          "1,\"a\",true,null",
          "2,\"b\",false,\"x\""
        ]
      }
    ],
    "identifier": 69614594,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "kubectl get pv -o=json | jq -c '.items[] | {name: .metadata.name, claimName: .spec | if has(\"claimRef\") then .claimRef.name else empty end }'",
      "Trying to get the PV's which have PVC and if there are no PVC can be null of empty.",
      "Put parentheses around the value of the `claimName` field:",
      "{\n  name: .metadata.name,\n  claimName: (.spec | if has(\"claimRef\") then .claimRef.name else empty end)\n}",
      "Depending on how much you simplified your example, you may consider reducing the the `if` statement to:",
      "{\n  name: .metadata.name,\n  claimName: (.spec.claimRef.name? // empty)\n}"
    ],
    "utterance": "Extract the name and claimRef.name fields from each item in the items array, leaving claimName empty or null if claimRef is missing.",
    "expressions": [
      ".items[] | {name: .metadata.name, claimName: (.spec | if has(\"claimRef\") then .claimRef.name else empty end)}",
      ".items[] | {name: .metadata.name, claimName: (.spec.claimRef.name? // empty)}"
    ],
    "data": [
      {
        "input": {
          "apiVersion": "v1",
          "items": [
            {
              "apiVersion": "v1",
              "kind": "PersistentVolume",
              "metadata": {
                "annotations": {
                  "meta.helm.sh/release-name": "",
                  "meta.helm.sh/release-namespace": ""
                },
                "creationTimestamp": "",
                "finalizers": [
                  ""
                ],
                "labels": {
                  "app.kubernetes.io/managed-by": ""
                },
                "name": "",
                "resourceVersion": "",
                "uid": ""
              },
              "spec": {
                "accessModes": [
                  "ReadWriteMany"
                ],
                "capacity": {
                  "storage": ""
                },
                "claimRef": {
                  "apiVersion": "",
                  "kind": "",
                  "name": "",
                  "namespace": "",
                  "resourceVersion": "",
                  "uid": ""
                },
                "csi": {
                  "driver": "",
                  "volumeHandle": ""
                },
                "persistentVolumeReclaimPolicy": "",
                "volumeMode": ""
              },
              "status": {
                "phase": ""
              }
            },
            {
              "apiVersion": "v1",
              "kind": "PersistentVolume",
              "metadata": {
                "annotations": {
                  "meta.helm.sh/release-name": "",
                  "meta.helm.sh/release-namespace": ""
                },
                "creationTimestamp": "",
                "finalizers": [
                  ""
                ],
                "labels": {
                  "app.kubernetes.io/managed-by": ""
                },
                "name": "",
                "resourceVersion": "",
                "uid": ""
              },
              "spec": {
                "accessModes": [
                  "ReadWriteMany"
                ],
                "capacity": {
                  "storage": ""
                },
                "claimRef": {
                  "apiVersion": "",
                  "kind": "",
                  "name": "",
                  "namespace": "",
                  "resourceVersion": "",
                  "uid": ""
                },
                "csi": {
                  "driver": "",
                  "volumeHandle": ""
                },
                "persistentVolumeReclaimPolicy": "",
                "volumeMode": ""
              },
              "status": {
                "phase": ""
              }
            }
          ],
          "kind": "List",
          "metadata": {
            "resourceVersion": "",
            "selfLink": ""
          }
        }
      }
    ],
    "identifier": 69616871,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Im trying to replace the values starting with CONNECTOR using jq.",
      "jq 'with_entries( .value = ($ENV.\"\\(.value)\" // .value ) )' device-connectorbody.json"
    ],
    "utterance": "Replace all values whose value starts with CONNECTOR with the corresponding environment variable value if it exists, otherwise keep the original value.",
    "expressions": [
      "with_entries(.value = ($ENV[.value] // .value))"
    ],
    "data": [
      {
        "input": {
          "connector.class": "com.mongodb.kafka.connect.MongoSinkConnector",
          "value.converter.schema.registry.url": "CONNECTOR_SCHEMA_REGISTRY_URL",
          "topics": "CONNECTOR_TOPICS",
          "name": "CONNECTOR_NAME",
          "connection.uri": "CONNECTOR_MONGODB_CONNECTION_URI"
        },
        "output": {
          "connector.class": "com.mongodb.kafka.connect.MongoSinkConnector",
          "value.converter.schema.registry.url": "http://schema-registry:8081",
          "topics": "my_topic1,my_topic2",
          "name": "custom_connector",
          "connection.uri": "mongodb://admin:password@mongo:27017"
        }
      }
    ],
    "identifier": 69619558,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "jq --arg TS_SEC_START 1534574204 --arg SVC \"${SERVICE_NAME}.${RELEASE_NAMESPACE}\" -s 'map(select(.ts | (split(\".\")[0] + \"Z\") | fromdateiso8601 > ($TS_SEC_START | tonumber)) | select(.canary == $SVC))'",
      "However, when I try to pass `mysvc.prod` as an arg, I receive 0 responses.",
      "I've tried a few different options here, e.g. `.canary == \"$SVC\"` and `.canary == \"$SERVICE_NAME.$RELEASE_NAMESPACE\" but I can't get it to work."
    ],
    "utterance": "Filter objects where the top-level field canary equals the shell variable SVC, passed as an argument.",
    "expressions": [
      "map(select(.ts | (split(\".\")[0] + \"Z\") | fromdateiso8601 > ($TS_SEC_START | tonumber)) | select(.canary == $SVC))"
    ],
    "data": [
      {
        "input": [
          {
            "level": "error",
            "ts": "2021-08-16T02:41:34.128Z",
            "caller": "controller/scheduler.go:163",
            "msg": "Canary mysvc.prod not found",
            "canary": "mysvc.prod",
            "stacktrace": "github.com/fluxcd/flagger/pkg/controller.(*Controller).advanceCanary\n\t/workspace/pkg/controller/scheduler.go:163\ngithub.com/fluxcd/flagger/pkg/controller.CanaryJob.Start.func1\n\t/workspace/pkg/controller/job.go:39"
          }
        ],
        "output": [
          {
            "level": "error",
            "ts": "2021-08-16T02:41:34.128Z",
            "caller": "controller/scheduler.go:163",
            "msg": "Canary mysvc.prod not found",
            "canary": "mysvc.prod",
            "stacktrace": "github.com/fluxcd/flagger/pkg/controller.(*Controller).advanceCanary\n\t/workspace/pkg/controller/scheduler.go:163\ngithub.com/fluxcd/flagger/pkg/controller.CanaryJob.Start.func1\n\t/workspace/pkg/controller/job.go:39"
          }
        ]
      }
    ],
    "identifier": 69619865,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "So I need to remove endpoint1, endpoint2 and endpoint3 and preserve their children as array.",
      "Once you\u2019ve fixed all the typos in your input (several commas and some curly braces were missing), this should work:",
      "jq 'map(.endpoints |= [.[][]])'"
    ],
    "utterance": "Flatten all child arrays from the 'endpoints' object into a single array for each object, removing the endpoint keys.",
    "expressions": [
      "map(.endpoints |= [.[][]])"
    ],
    "data": [
      {
        "input": [
          {
            "serviceid": "service1",
            "endpoints": {
              "endpoint1": [
                "example1",
                "example2",
                "example3",
                "example4"
              ],
              "endpoint2": [
                "example3",
                "example4",
                "example5",
                "example6"
              ]
            },
            "version": "1.0"
          },
          {
            "serviceid": "service2",
            "endpoints": {
              "endpoint3": [
                "example7",
                "example8",
                "example9",
                "example10"
              ]
            },
            "version": "2.0"
          }
        ],
        "output": [
          {
            "serviceid": "service1",
            "endpoints": [
              "example1",
              "example2",
              "example3",
              "example4",
              "example3",
              "example4",
              "example5",
              "example6"
            ],
            "version": "1.0"
          },
          {
            "serviceid": "service2",
            "endpoints": [
              "example7",
              "example8",
              "example9",
              "example10"
            ],
            "version": "2.0"
          }
        ]
      }
    ],
    "identifier": 69619968,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "to get all unique keys and lists of matching values for these keys and then wrap them in {}",
      "The json looks like:",
      "{ \"sepalLength\": 5, \"sepalWidth\": 3.3, \"petalLength\": 1.4, \"petalWidth\": 0.2, \"species\": \"setosa\" }, { \"sepalLength\": 7, \"sepalWidth\": 3.2, \"petalLength\": 4.7, \"petalWidth\": 1.4, \"species\": \"versicolor\" }, { \"sepalLength\": 6.4, \"sepalWidth\": 3.2, \"petalLength\": 4.5, \"petalWidth\": 1.5, \"species\": \"versicolor\" }, ...",
      "And the result must look this way:",
      "{ \"species\": [\"setosa\", \"setosa\", ...], \"petalWidth\": [1.2, ...], ... }",
      "The following",
      "jq '[(map(keys[]) | unique[]) as $key | {($key): map(.[$key])} ] | add'",
      "if fed with your sample data will produce something like",
      "{ \"petalLength\": [1.4,4.7,4.5], \"petalWidth\": [0.2,1.4,1.5], \"sepalLength\": [5,7,6.4], \"sepalWidth\": [3.3,3.2,3.2], \"species\": [\"setosa\",\"versicolor\",\"versicolor\"] }"
    ],
    "utterance": "For an array of objects with potentially overlapping keys, collect all unique keys and create an object mapping each key to a list of its values from the array.",
    "expressions": [
      "[(map(keys[]) | unique[]) as $key | {($key): map(.[$key])} ] | add"
    ],
    "data": [
      {
        "input": [
          {
            "sepalLength": 5,
            "sepalWidth": 3.3,
            "petalLength": 1.4,
            "petalWidth": 0.2,
            "species": "setosa"
          },
          {
            "sepalLength": 7,
            "sepalWidth": 3.2,
            "petalLength": 4.7,
            "petalWidth": 1.4,
            "species": "versicolor"
          },
          {
            "sepalLength": 6.4,
            "sepalWidth": 3.2,
            "petalLength": 4.5,
            "petalWidth": 1.5,
            "species": "versicolor"
          }
        ],
        "output": {
          "petalLength": [
            1.4,
            4.7,
            4.5
          ],
          "petalWidth": [
            0.2,
            1.4,
            1.5
          ],
          "sepalLength": [
            5,
            7,
            6.4
          ],
          "sepalWidth": [
            3.3,
            3.2,
            3.2
          ],
          "species": [
            "setosa",
            "versicolor",
            "versicolor"
          ]
        }
      }
    ],
    "identifier": 69621147,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "So that it matches on \"that\" and removes the key and value both in that grouping, leaving json like this:\n\n{\n\"id\": \"whatever\",\n\"attributes\": [\n    {\n      \"key\": \"this\",\n      \"value\": \"A\"\n    },\n    {\n      \"key\": \"other\",\n      \"value\": \"C\"\n    }\n  ]\n}",
      "Try this\n\n```\n.attributes |= map(select(.key != \"that\"))\n```",
      "jq 'del(.attributes[] | select(.key == \"that\"))' test.json | sponge test.json"
    ],
    "utterance": "Remove the object from the 'attributes' array where 'key' has the value 'that'.",
    "expressions": [
      ".attributes |= map(select(.key != \"that\"))",
      "del(.attributes[] | select(.key == \"that\"))"
    ],
    "data": [
      {
        "input": {
          "id": "whatever",
          "attributes": [
            {
              "key": "this",
              "value": "A"
            },
            {
              "key": "that",
              "value": "B"
            },
            {
              "key": "other",
              "value": "C"
            }
          ]
        },
        "output": {
          "id": "whatever",
          "attributes": [
            {
              "key": "this",
              "value": "A"
            },
            {
              "key": "other",
              "value": "C"
            }
          ]
        }
      }
    ],
    "identifier": 69639335,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to get the Cartesian product of 2 lists, and tried following way:",
      "echo [[1,2,3],[4,5,6]] | jq '[.[0][],.[1][]]'",
      "I expected to get [[1,4],[1,5],[1,6],[2,4],[2,5],....], but what I really get is [1,2,3,4,5,6]",
      "You can still generate cartesian product on the arrays, when you do this",
      ".[0][] as $x | .[1][] as $y | [$x,$y]",
      "and put the whole thing into an array if you put the above filter inside `[..]` as",
      "[.[0][] as $x | .[1][] as $y | [$x,$y]]"
    ],
    "utterance": "Generate an array of all pairs consisting of one element from the first list and one from the second list, forming the Cartesian product.",
    "expressions": [
      "[.[0][] as $x | .[1][] as $y | [$x,$y]]"
    ],
    "data": [
      {
        "input": [
          [
            1,
            2,
            3
          ],
          [
            4,
            5,
            6
          ]
        ],
        "output": [
          [
            1,
            4
          ],
          [
            1,
            5
          ],
          [
            1,
            6
          ],
          [
            2,
            4
          ],
          [
            2,
            5
          ],
          [
            2,
            6
          ],
          [
            3,
            4
          ],
          [
            3,
            5
          ],
          [
            3,
            6
          ]
        ]
      }
    ],
    "identifier": 69639938,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I wish to make a formatting decision depending on the value of a json bool in the previous iteration of the array.",
      "I need to have a MARK on the first instance of igroup=true and then all subsequent igroup=true to be SPACE.",
      "To do this I need to know what the previous value of igroup was.",
      "foreach .result[].segments[].lines[] as {$igroup,$products} ( [null,false];    # init: pair of previous and current value [.[1],$igroup];  # update: update values [ \"line format\", if .[0] and .[0] == $igroup then \"SPACE\" else \"MARK\" end, ([$products[].name] | join(\",\")), \"end line format\" ] # extract: the collection of values to output ), [\"END\",0] | join(\"|\")",
      "( if $l > 0 then ( if .[$c].segments[0].lines[$l].igroup then ( if .[$c].segments[0].lines[$l - 1].igroup then \"SPACE\" else \"MARK\" end ) else \"MARK\" end ) else \"MARK\" end )"
    ],
    "utterance": "Format each line so that the first occurrence where igroup is true is labeled 'MARK', and subsequent consecutive igroup true entries are labeled 'SPACE', with labeling restarting to 'MARK' after igroup switches to false.",
    "expressions": [
      "foreach .result[].segments[].lines[] as {$igroup,$products} (\n  [null,false];\n  [.[1],$igroup];\n  [\n    \"line format\",\n    if .[0] and .[0] == $igroup then \"SPACE\" else \"MARK\" end,\n    ([$products[].name] | join(\",\")),\n    \"end line format\"\n  ]\n), [\"END\",0] | join(\"|\")",
      ".result\n| keys[] as $c\n|\n  (\n      (.[ $c ].segments[0].lines | keys[]) as $l |\n          [\n            ( $l ),\n            ( if $l > 0\n              then\n                ( if .[$c].segments[0].lines[$l].igroup\n                  then\n                    ( if .[$c].segments[0].lines[$l - 1].igroup\n                        then \"SPACE\"\n                        else \"MARK\" end\n                    )\n                  else \"MARK\" end\n                )\n              else \"MARK\" end\n            ),\n            \"line format\",\n            ( [ .[$c].segments[0].lines[$l].products[].name ] | join(\",\")),\n            \"end line format\"\n          ]\n  ),\n  [\"END\",\"0\"]\n| join(\"|\")"
    ],
    "data": [
      {
        "input": {
          "result": [
            {
              "cn": "abc",
              "segments": [
                {
                  "lines": [
                    {
                      "igroup": false,
                      "products": [
                        {
                          "name": "Should be MARK"
                        }
                      ]
                    },
                    {
                      "igroup": false,
                      "products": [
                        {
                          "name": "Should be MARK"
                        },
                        {
                          "name": "Addon to MARK"
                        }
                      ]
                    },
                    {
                      "igroup": true,
                      "products": [
                        {
                          "name": "Should be MARK First igroup=true BROKEN!!!"
                        }
                      ]
                    },
                    {
                      "igroup": true,
                      "products": [
                        {
                          "name": "Should be SPACE !! After first igroup=true"
                        }
                      ]
                    },
                    {
                      "igroup": true,
                      "products": [
                        {
                          "name": "Should be SPACE until next igroup=false"
                        }
                      ]
                    },
                    {
                      "igroup": false,
                      "products": [
                        {
                          "name": "Should be MARK"
                        }
                      ]
                    }
                  ]
                }
              ]
            }
          ],
          "id": 1
        },
        "output": [
          "line format|MARK|Should be MARK|end line format",
          "line format|MARK|Should be MARK,Addon to MARK|end line format",
          "line format|MARK|Should be MARK First igroup=true BROKEN!!!|end line format",
          "line format|SPACE|Should be SPACE !! After first igroup=true|end line format",
          "line format|SPACE|Should be SPACE until next igroup=false|end line format",
          "line format|MARK|Should be MARK|end line format",
          "END|0"
        ]
      }
    ],
    "identifier": 69640189,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Need name, weight and State information extracted from the above Json format and get a comma separated file for analysis like this:",
      "jq -r '.name as $name | .details[] | [$name, .weight, .State] | @csv' foo.json",
      "\"johnDoe\",\"150\",\"NJ\"",
      "\"johnDoe\",\"160\",\"NC\"",
      "\"DanLee\",\"180\",\"NJ\"",
      "\"DanLee\",\"190\",\"NC\"",
      "\"DanLee\",\"170\",\"NY\""
    ],
    "utterance": "Extract name, weight, and State fields from each details entry and generate comma-separated rows.",
    "expressions": [
      ".name as $name | .details[] | [$name, .weight, .State] | @csv"
    ],
    "data": [
      {
        "input": [
          {
            "name": "johnDoe",
            "dept": 45,
            "details": [
              {
                "salary": "76566",
                "weight": "150",
                "height": "160",
                "sex": "male",
                "country": "Usa",
                "State": "NJ"
              },
              {
                "salary": "76560",
                "weight": "160",
                "height": "180",
                "sex": "male",
                "country": "Usa",
                "State": "NC"
              }
            ]
          },
          {
            "name": "DanLee",
            "dept": 46,
            "details": [
              {
                "salary": "76566",
                "weight": "180",
                "height": "160",
                "sex": "male",
                "country": "Usa",
                "State": "NJ"
              },
              {
                "salary": "76560",
                "weight": "190",
                "height": "180",
                "sex": "male",
                "country": "Usa",
                "State": "NC"
              },
              {
                "salary": "87888",
                "weight": "170",
                "height": "160",
                "sex": "male",
                "country": "Usa",
                "State": "NY"
              }
            ]
          }
        ],
        "output": [
          "\"johnDoe\",\"150\",\"NJ\"",
          "\"johnDoe\",\"160\",\"NC\"",
          "\"DanLee\",\"180\",\"NJ\"",
          "\"DanLee\",\"190\",\"NC\"",
          "\"DanLee\",\"170\",\"NY\""
        ]
      }
    ],
    "identifier": 69640429,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "now I want to replace for example \"ab/any-dashboard\" to only any-dashboard, which means above command should print like below",
      "any-dashboard",
      "many-dashboard",
      "too-dashboard",
      "maximum-dashboard",
      "minimum-dashboard",
      "and I dont have knowledge in JQ."
    ],
    "utterance": "Display only the part of each uri field after the last slash for all uri fields in the data.",
    "expressions": [
      ".[] | .uri | split(\"/\") | last"
    ],
    "data": [
      {
        "input": [
          {
            "uri": "ab/any-dashboard"
          },
          {
            "uri": "ab/many-dashboard"
          },
          {
            "uri": "ab/too-dashboard"
          },
          {
            "uri": "ab/maximum-dashboard"
          },
          {
            "uri": "ab/minimum-dashboard"
          }
        ],
        "output": [
          "any-dashboard",
          "many-dashboard",
          "too-dashboard",
          "maximum-dashboard",
          "minimum-dashboard"
        ]
      }
    ],
    "identifier": 69647247,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to convert numbers in the `setpath` param to create arrays.",
      "The **desired output is the same as the original data** in `nested.json`",
      "And decode it with",
      "jq 'reduce to_entries[] as $item (null; setpath($item.key / \".\" | map(tonumber? // .); $item.value))' flat.json",
      "[",
      "  {",
      "    \"a\": \"a\",",
      "    \"b\": [",
      "      {",
      "        \"c\": \"c\"",
      "      }",
      "    ]",
      "  },",
      "  {",
      "    \"a\": \"a\",",
      "    \"b\": [",
      "      {",
      "        \"c\": \"c\"",
      "      }",
      "    ]",
      "  }",
      "]"
    ],
    "utterance": "Reconstruct the original nested structure from an object with flattened keys using dot notation and convert numeric key segments into array indices.",
    "expressions": [
      "reduce to_entries[] as $item (null; setpath($item.key / \".\" | map(tonumber? // .); $item.value))"
    ],
    "data": [
      {
        "input": {
          "0.a": "a",
          "0.b.0.c": "c",
          "1.a": "a",
          "1.b.0.c": "c"
        },
        "output": [
          {
            "a": "a",
            "b": [
              {
                "c": "c"
              }
            ]
          },
          {
            "a": "a",
            "b": [
              {
                "c": "c"
              }
            ]
          }
        ]
      }
    ],
    "identifier": 69647260,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I need to retrieve the number of `host_name` (hosts) that has `wkn` in its hostname.",
      "as you \"need to retrieve the number of `host_name`\", simply finalize with `length` (which needs an array, therefore I also changed `.[]` to `map`).",
      "... | jq -r '.items | map(select(.Hosts.host_name | test(\"wkn\"))) | length'"
    ],
    "utterance": "Count the number of hosts whose host_name contains the substring 'wkn'.",
    "expressions": [
      ".items | map(select(.Hosts.host_name | test(\"wkn\"))) | length"
    ],
    "data": [
      {
        "input": {
          "items": [
            {
              "href": "http://10.10.0.10:8080",
              "Hosts": {
                "cluster_name": "test",
                "host_name": "testmasternode.example.test.com",
                "ip": "10.10.0.10"
              }
            },
            {
              "href": "http://10.10.0.10:8080",
              "Hosts": {
                "cluster_name": "test",
                "host_name": "testwkn01.example.test.com",
                "ip": "10.10.0.11"
              }
            },
            {
              "href": "http://10.10.0.10:8080",
              "Hosts": {
                "cluster_name": "test",
                "host_name": "testwkn02.example.test.com",
                "ip": "10.10.0.12"
              }
            }
          ]
        },
        "output": 2
      }
    ],
    "identifier": 69653118,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "So what i want for example is to delete the element {\r\n  \"Name\": \"OS-Command-Injection-01\"\r\n}",
      "I need to do it conditionally",
      "jq '\r\n   ( .[].Statement.ManagedRuleGroupStatement.ExcludedRules | arrays ) |=\r\n      map(select(.Name != \"OS-Command-Injection-01\"))\r\n'",
      "jq 'walk(if type==\"object\" and\r\n            (.Name|IN(\"OS-Command-Injection-01\",\"SQL-Injection-03\"))\r\n         then empty\r\n         else . end)' input-file"
    ],
    "utterance": "Delete all objects with Name equal to \"OS-Command-Injection-01\" from any ExcludedRules array under ManagedRuleGroupStatement.",
    "expressions": [
      "(.[] .Statement.ManagedRuleGroupStatement.ExcludedRules | arrays) |= map(select(.Name != \"OS-Command-Injection-01\"))",
      "walk(if type==\"object\" and (.Name|IN(\"OS-Command-Injection-01\")) then empty else . end)"
    ],
    "data": [
      {
        "input": [
          {
            "Name": "Fortinet-all_rules",
            "Priority": 0,
            "Statement": {
              "ManagedRuleGroupStatement": {
                "VendorName": "Fortinet",
                "Name": "all_rules",
                "ExcludedRules": [
                  {
                    "Name": "Database-Vulnerability-Exploit-01"
                  },
                  {
                    "Name": "OS-Command-Injection-01"
                  },
                  {
                    "Name": "SQL-Injection-03"
                  }
                ]
              }
            }
          }
        ],
        "output": [
          {
            "Name": "Fortinet-all_rules",
            "Priority": 0,
            "Statement": {
              "ManagedRuleGroupStatement": {
                "VendorName": "Fortinet",
                "Name": "all_rules",
                "ExcludedRules": [
                  {
                    "Name": "Database-Vulnerability-Exploit-01"
                  },
                  {
                    "Name": "SQL-Injection-03"
                  }
                ]
              }
            }
          }
        ]
      }
    ],
    "identifier": 69661364,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "This is the JSON output that I get from the curl command:",
      "\"scan_overview\":{\r\n          \"application/vnd.scanner.adapter.vuln.report.harbor+json; version=1.0\":{\r\n             \"complete_percent\":100,\r\n             \"duration\":8,\r\n             \"end_time\":\"2021-10-22T10:28:57.356Z\",\r\n             \"report_id\":\"e83854eb-2304-4c58-85c9-a3e0fd9067a8\",\r\n             \"scan_status\":\"Success\",\r\n             \"severity\":\"Critical\",\r\n             ...",
      "A more confident match would be achievable by using more quotes:\r\n\r\nRESULTS=$RESULTS | jq '.scan_overview .\"application/vnd.scanner.adapter.vuln.report.harbor+json; version=1.0\" .scan_status'",
      "Use square brackets and double quotes around the problematic key:\r\n\r\n    jq '.scan_overview[\"application/vnd.scanner.adapter.vuln.report.harbor+json; version=1.0\"].scan_status'",
      "Wrap the key in question inside barckets and quotes like so:\r\n\r\n.scan_overview[\"application/vnd.scanner.adapter.vuln.report.harbor+json; version=1.0\"].scan_status"
    ],
    "utterance": "Extract the value of scan_status from scan_overview where the key is 'application/vnd.scanner.adapter.vuln.report.harbor+json; version=1.0'.",
    "expressions": [
      ".scan_overview[\"application/vnd.scanner.adapter.vuln.report.harbor+json; version=1.0\"].scan_status",
      ".scan_overview.\"application/vnd.scanner.adapter.vuln.report.harbor+json; version=1.0\".scan_status"
    ],
    "data": [
      {
        "input": {
          "addition_links": {
            "build_history": {
              "absolute": false,
              "href": "..."
            },
            "vulnerabilities": {
              "absolute": false,
              "href": "...."
            }
          },
          "digest": "sha256:bcd665be2b7c6725b410029db385d7c6c71a9ce557427cbd0f54d01a9",
          "extra_attrs": {
            "architecture": "amd64",
            "author": null,
            "created": "2021-10-22T10:28:46.058276455Z",
            "os": "linux"
          },
          "icon": "sha256:0048162a053ee7518615bef084403614f8bca43b40ae2e762e11e06",
          "id": 362,
          "labels": null,
          "manifest_media_type": "application/vnd.docker.distribution.manifest.v2+json",
          "media_type": "application/vnd.docker.container.image.v1+json",
          "project_id": 3,
          "pull_time": "2021-10-22T10:28:55.305Z",
          "push_time": "2021-10-22T10:28:49.341Z",
          "references": null,
          "repository_id": 12,
          "scan_overview": {
            "application/vnd.scanner.adapter.vuln.report.harbor+json; version=1.0": {
              "complete_percent": 100,
              "duration": 8,
              "end_time": "2021-10-22T10:28:57.356Z",
              "report_id": "e83854eb-2304-4c58-85c9-a3e0fd9067a8",
              "scan_status": "Success",
              "severity": "Critical",
              "start_time": "2021-10-22T10:28:49.827Z",
              "summary": {
                "summary": {
                  "Critical": 7,
                  "High": 47,
                  "Low": 18,
                  "Medium": 47
                },
                "total": 119
              }
            }
          }
        },
        "output": "Success"
      }
    ],
    "identifier": 69678761,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to collect IPs and print `id` and `host` next to it but if `IP` is used multiple times then print multiple `id` and `host` next to instead of a new line. `IP` and `host` can be the same for multiple items but `id` is unique.",
      "So the final output should look like this;\n$ echo <something>\n192.168.1.1 cat cherry dog apple\n192.168.1.2 cat banana",
      "you only need one `jq` call:\n\n```\njq --raw-output 'group_by(.ip)[] | [first.ip, (.[] | .host, .id)] | join(\" \")' input.json\n```",
      "$  jq -r 'group_by(.ip)[] | [.[0].ip, map(.host, .id)[]] | @tsv' input.json\n192.168.1.1     cat     cherry  dog     apple\n192.168.1.2     cat     banana"
    ],
    "utterance": "Group objects by ip and print each ip followed by the host and id values for all records sharing that ip, all on the same line.",
    "expressions": [
      "group_by(.ip)[] | [first.ip, (.[] | .host, .id)] | join(\" \")",
      "group_by(.ip)[] | [.[0].ip, map(.host, .id)[]] | @tsv"
    ],
    "data": [
      {
        "input": [
          {
            "host": "cat",
            "ip": "192.168.1.1",
            "id": "cherry"
          },
          {
            "host": "dog",
            "ip": "192.168.1.1",
            "id": "apple"
          },
          {
            "host": "cat",
            "ip": "192.168.1.2",
            "id": "banana"
          }
        ],
        "output": [
          "192.168.1.1 cat cherry dog apple",
          "192.168.1.2 cat banana"
        ]
      }
    ],
    "identifier": 69691269,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am currently trying to get the content of a JSON without showing the names of the key values.",
      "So, how can I hide the key values and only show the content of the JSON after doing the filters I did?",
      "And this is what I want to show\nWomen\n--------------\n38,Female,3, 5.6\n38,Female,3, 5.6\n32,Female,3, 6\n31,Female,3, 6\n19,Female,3, 5.5\n38,Female,3, 7\n20,Female,3, 6.1\n32,Female,3, 7\n42,Female,3, 6.7\n\nMen\n--------------\n72,Male,3, 7.4\n60,Male,3, 6.3\n33,Male,3, 5.4\n60,Male,3, 6.8\n60,Male,3, 7.4\n60,Male,3, 7\n72,Male,3, 6.2",
      "jq --raw-output '\n  group_by(.V2)[]\n  | if first.V2 == \"Male\" then \"Men\" else \"Women\" end,\n    \"--------------\",\n    (\n      .[]\n      | select(.V9 == 3.3)    # this filters to matching records\n      | [.V1, .V2, .V9, .V8]\n      | join(\",\")\n    )\n' csvjson.json"
    ],
    "utterance": "Output all fields' values for records where V9 equals 3.3, displaying them as comma-separated values without key names, grouped and labeled by gender.",
    "expressions": [
      "group_by(.V2)[]\n| if first.V2 == \"Male\" then \"Men\" else \"Women\" end,\n  \"--------------\",\n  (\n    .[]\n    | select(.V9 == 3.3)\n    | [.V1, .V2, .V9, .V8]\n    | join(\",\")\n  )"
    ],
    "identifier": 69694427,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": "The task does not specify the input JSON structure (e.g., array of objects, object with gender keys, etc.), nor does it provide an example input or output. This makes it unclear how to group and label by gender, and how to extract the values for each record.",
      "other_environment": null
    }
  },
  {
    "context": [
      "I have the following object :",
      "and i wish to transform it into :",
      "jq  '.fields_selected = .fields_selected | map({name : .measurement , aggregation : {step: .step , function: .aggregation}})' config-it-client.json",
      "I was able to figure out what i was doing wrong:",
      "jq  '.fields_selected = (.fields_selected | map({name : .measurement , aggregation : {step: .step , function: .aggregation}}))' config-it-client.json",
      ".fields_selected[] |= { name: .measurement, aggregation: { function: .aggregation, step } }",
      ".fields_selected |= map({ name: .measurement, aggregation: { function: .aggregation, step: .step } })",
      ".fields_selected[] |= {\n   name: .measurement,\n   aggregation: {\n      function: .aggregation,\n      step: .step\n   }\n}",
      ".fields_selected[] |= {\n   name: .measurement,\n   aggregation: {\n      function: .aggregation,\n      step\n   }\n}"
    ],
    "utterance": "Change each object in 'fields_selected' by renaming 'measurement' to 'name', and creating an 'aggregation' object with 'step' and 'function' keys populated from 'step' and 'aggregation'.",
    "expressions": [
      ".fields_selected |= map({ name: .measurement, aggregation: { function: .aggregation, step: .step } })",
      ".fields_selected[] |= { name: .measurement, aggregation: { function: .aggregation, step: .step } }",
      ".fields_selected[] |= { name: .measurement, aggregation: { function: .aggregation, step } }"
    ],
    "data": [
      {
        "input": {
          "db_credentials": {
            "database": "greengrass",
            "host": "localhost",
            "password": "yZqXJzXHLUsLlPm",
            "port": 7086,
            "username": "greengrass"
          },
          "default_interval": 90000,
          "fields_selected": [
            {
              "measurement": "ABPLCGD-GD_AB1AirFlowCalc",
              "aggregation": "last",
              "step": "10m",
              "timeserie_physical": "null",
              "timeserie_type": "null",
              "timeserie_interpolation": "null",
              "timeserie_unit": "null",
              "timeserie_step": "10"
            },
            {
              "measurement": "ABPLCGD-GD_AB1InletAirTempAct",
              "aggregation": "last",
              "step": "10m",
              "timeserie_physical": "null",
              "timeserie_type": "null",
              "timeserie_interpolation": "null",
              "timeserie_unit": "null",
              "timeserie_step": "10"
            }
          ]
        },
        "output": {
          "db_credentials": {
            "database": "greengrass",
            "host": "localhost",
            "password": "yZqXJzXHLUsLlPm",
            "port": 7086,
            "username": "greengrass"
          },
          "default_interval": 90000,
          "fields_selected": [
            {
              "name": "ABPLCGD-GD_AB1AirFlowCalc",
              "aggregation": {
                "step": "10m",
                "function": "last"
              }
            },
            {
              "name": "ABPLCGD-GD_AB1InletAirTempAct",
              "aggregation": {
                "step": "10m",
                "function": "last"
              }
            }
          ]
        }
      }
    ],
    "identifier": 69710645,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "but I want to pass the initial part as an argument, but it doesn't work and gives a syntax error. How can i make it dynamic that an argument becomes an expression on runtime when passed:",
      "We could provide a path using a JSON array.",
      "But a dot-separated path is much nicer.",
      "getpath( $root_path | split(\".\") ) as $root |",
      "( $root | .name                  ) |= $_job_name |",
      "( $root | .schedule.pause_status ) |= $_parent",
      "or just",
      "getpath( $root_path | split(\".\") ) |= (",
      "   .name                  |= $_job_name |",
      "   .schedule.pause_status |= $_parent",
      ")"
    ],
    "utterance": "Update the .name and .schedule.pause_status fields at a variable object path given as a dot-separated string argument, assigning values from input variables.",
    "expressions": [
      "getpath($root_path | split(\".\")) |= (.name = $_job_name | .schedule.pause_status = $_parent)"
    ],
    "data": [
      {
        "input": {
          "email_notifications": {
            "on_failure": "_"
          },
          "name": "_",
          "schedule": {
            "quartz_cron_expression": "_",
            "timezone_id": "Europe/Amsterdam",
            "pause_status": "_"
          }
        },
        "output": {
          "email_notifications": {
            "on_failure": "_"
          },
          "name": "foo",
          "schedule": {
            "quartz_cron_expression": "_",
            "timezone_id": "Europe/Amsterdam",
            "pause_status": "bar"
          }
        }
      },
      {
        "input": {
          "job_id": "_",
          "new_settings": {
            "email_notifications": {
              "on_failure": "_"
            },
            "name": "_",
            "schedule": {
              "quartz_cron_expression": "_",
              "timezone_id": "Europe/Amsterdam",
              "pause_status": "_"
            }
          }
        },
        "output": {
          "job_id": "_",
          "new_settings": {
            "email_notifications": {
              "on_failure": "_"
            },
            "name": "foo",
            "schedule": {
              "quartz_cron_expression": "_",
              "timezone_id": "Europe/Amsterdam",
              "pause_status": "bar"
            }
          }
        }
      }
    ],
    "identifier": 69722281,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to append this array to this json string using jq. My result only adds the hello element",
      "`jq` expects the `--arg` values to be strings, so we need to join the array into a single string, and then split it within jq:",
      "Then, import it to your main `jq` script using `--argjson` instead of `--arg`",
      "Or you can pass your array as a `null`-delimited elements stream, and have `jq` read it into a string to be split on `null` delimiters into a JSON array like this:"
    ],
    "utterance": "Construct an object with property method as array containing '1234', and property values as array ['hello', 'world'].",
    "expressions": [
      "jq -n --arg method \"$method\" --arg array \"$(printf '%s\\n' \"${arraynew[@]}\")\" '{method: [$method], values: ($array | split(\"\\n\"))}'",
      "jq -Rn --arg method \"$method\" --argjson array \"$jsonarray\" '{method: [$method], values: $array}'",
      "jq -n '{method: $method, values: $ARGS.positional}' --arg method \"$method\" --args \"${arraynew[@]}\"",
      "printf '%s\\0' \"${arraynew[@]}\" | jq -Rs '{method: $method, values: split(\"\\u0000\")}' --arg method \"$method\""
    ],
    "data": [
      {
        "input": {
          "method": "1234",
          "arraynew": [
            "hello",
            "world"
          ]
        },
        "output": {
          "method": [
            "1234"
          ],
          "values": [
            "hello",
            "world"
          ]
        }
      }
    ],
    "identifier": 69760036,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "How would use JQ to determine if it contained a certain IP address?",
      "I really just want one final answer as whether or not the array contains one instance. of my requested IP.",
      "`.IPSet.Addresses | any(. == $tgt)`"
    ],
    "utterance": "Determine if the 'Addresses' array inside 'IPSet' contains exactly the value '255.255.255.8'.",
    "expressions": [
      ".IPSet.Addresses | any(. == \"255.255.255.8\")"
    ],
    "data": [
      {
        "input": {
          "IPSet": {
            "Description": "",
            "Name": "testip",
            "IPAddressVersion": "IPV4",
            "Id": "a1b2c3d4-5678-90ab-cdef-EXAMPLE1111",
            "ARN": "arn:aws:wafv2:us-west-2:123456789012:regional/ipset/testip/a1b2c3d4-5678-90ab-cdef-EXAMPLE1111",
            "Addresses": [
              "192.0.2.0/16"
            ]
          },
          "LockToken": "447e55ac-2396-4c6d-b9f9-86b67c17f8b5"
        },
        "output": false
      }
    ],
    "identifier": 69760130,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "hexdump -ve '1/1 \"%u\\n\"' foo.bin | jq -s '{content: .}'",
      "jq -Rs '{content: explode}' foo.bin",
      "jq -n --arg bin \"$(cat foo.bin)\" '{content: $bin | explode}'"
    ],
    "utterance": "Create an object with a 'content' key containing an array of byte values from the contents of a binary file.",
    "expressions": [
      "hexdump -ve '1/1 \"%u\\n\"' foo.bin | jq -s '{content: .}'",
      "jq -Rs '{content: explode}' foo.bin",
      "jq -n --arg bin \"$(cat foo.bin)\" '{content: $bin | explode}'"
    ],
    "data": [
      {
        "input": "foo.bin with bytes: 97 98 99",
        "output": {
          "content": [
            97,
            98,
            99
          ]
        }
      }
    ],
    "identifier": 69761040,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I need to make a json file based on another 2 files.",
      "Here is file 1\n\n    {\n    \"name\": \"foo\",\n      \"key\": \"1\",\n      \"id\": \"x\"\n    }\n    {\n      \"name\": \"bar\",\n      \"key\": \"2\",\n      \"id\": \"x\"\n    }\n    {\n      \"name\": \"baz\",\n      \"key\": \"3\",\n      \"id\": \"y\"\n    }",
      "file 2\n\n    {\n    \"name\": \"a\",\n      \"key\": \"1\"\n    }\n    {\n      \"name\": \"b\",\n      \"key\": \"1\"\n    }\n    {\n      \"name\": \"c\",\n      \"key\": \"2\"\n    }\n    {\n      \"name\": \"d\",\n      \"key\": \"2\"\n    }\n    {\n      \"name\": \"e\",\n      \"key\": \"3\"\n    }",
      "Expected Result:\n\n    {\n    \"x\": {\n        \"foo\": [\n          \"a\",\n          \"b\"\n        ],\n        \"bar\": [\n          \"c\",\n          \"d\"\n        ]\n      },\n      \"y\": {\n        \"baz\": [\n          \"e\"\n        ]\n      }\n    }",
      "Use `reduce` on the first file's items (`$i`) to successively build up the result object using `setpath` with fields from the item and values as a matching map on the secondary dictionary file (`$d`).",
      "jq -s --slurpfile d file2 '\n  reduce .[] as $i ({}; setpath(\n    [$i.id, $i.name];\n    [$d[] | select(.key == $i.key).name]\n  ))\n' file1",
      "For efficiency, the following solution first constructs a \"dictionary\" based on file2; furthermore, it does so without having to \"slurp\" it.",
      "< file2 jq -nc --slurpfile file1 file1 '\n  (reduce inputs as {$name, $key} ({};\n      .[$key] += [$name])) as $dict\n  | reduce $file1[] as {$name, $key, $id} ({};\n      .[$id] += [ {($name): $dict[$key]} ] )\n'"
    ],
    "utterance": "For each object in the first file, group objects by their id, and for each name under that id, produce an array of names from the second file whose key matches, creating a nested object mapping id \u2192 name \u2192 corresponding names from the second file.",
    "expressions": [
      "jq -s --slurpfile d file2 '\n  reduce .[] as $i ({}; setpath(\n    [$i.id, $i.name];\n    [$d[] | select(.key == $i.key).name]\n  ))\n' file1",
      "< file2 jq -nc --slurpfile file1 file1 '\n  (reduce inputs as {$name, $key} ({};\n      .[$key] += [$name])) as $dict\n  | reduce $file1[] as {$name, $key, $id} ({};\n      .[$id] += [ {($name): $dict[$key]} ] )\n'"
    ],
    "data": [
      {
        "input": [
          [
            {
              "name": "foo",
              "key": "1",
              "id": "x"
            },
            {
              "name": "bar",
              "key": "2",
              "id": "x"
            },
            {
              "name": "baz",
              "key": "3",
              "id": "y"
            }
          ],
          [
            {
              "name": "a",
              "key": "1"
            },
            {
              "name": "b",
              "key": "1"
            },
            {
              "name": "c",
              "key": "2"
            },
            {
              "name": "d",
              "key": "2"
            },
            {
              "name": "e",
              "key": "3"
            }
          ]
        ],
        "output": {
          "x": {
            "foo": [
              "a",
              "b"
            ],
            "bar": [
              "c",
              "d"
            ]
          },
          "y": {
            "baz": [
              "e"
            ]
          }
        }
      }
    ],
    "identifier": 69779787,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": true,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I have json data like below:",
      "I\u2019d like to print the value `id` and `type` in one line per item from the `Items` array, e.g.",
      "I tried `cat file | jq  '.Items[].id.S, .Items[].type.S'` but it prints `id` and `type` in separate lines. How can I achieve it with jq?",
      "jq --raw-output '.Items[] | .id.S + \", \" + .type.S' file",
      "jq --raw-output '.Items[] | \"\\(.id.S), \\(.type.S)\"' file"
    ],
    "utterance": "Print the values of id.S and type.S from each element of the Items array together on one line, separated by a comma.",
    "expressions": [
      ".Items[] | .id.S + \", \" + .type.S",
      ".Items[] | \"\\(.id.S), \\(.type.S)\""
    ],
    "data": [
      {
        "input": {
          "Items": [
            {
              "id": {
                "S": "c921e4eb-5958-424a-ae3a-b9cada0d9481"
              },
              "type": {
                "S": "transaction.1612878877726"
              }
            },
            {
              "id": {
                "S": "355057f0-4327-49c7-979f-5a27410d81ba"
              },
              "type": {
                "S": "transaction.1612345630260"
              }
            },
            {
              "id": {
                "S": "664dc02f-0ad8-484a-98a5-a403beea775b"
              },
              "type": {
                "S": "transaction.1612164919232"
              }
            }
          ]
        },
        "output": [
          "c921e4eb-5958-424a-ae3a-b9cada0d9481, transaction.1612878877726",
          "355057f0-4327-49c7-979f-5a27410d81ba, transaction.1612345630260",
          "664dc02f-0ad8-484a-98a5-a403beea775b, transaction.1612164919232"
        ]
      }
    ],
    "identifier": 69791344,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I'm interested in the object with guid `_0no-4e9woLW` ... Is it possible to get somehow full path up to `\"guid\": \"_0no-4e9woLW\"`? guid keys are uniq in JSON. My goal is to type something like `jq '[\"full path I've somehow previously obtained\"][\"children\"].title'` ... And I will obtain: css child override parent style css child parent inheritance",
      "( paths | select(.[-1] == \"guid\") ) as $p |   if getpath($p)  == \"_0no-4e9woLW\" then getpath($p[:-1] + [\"children\"])[].title else empty end",
      ".. | objects | select( .guid == \"_0no-4e9woLW\" ) | .children[].title",
      "getpath( [ \"foo\", \"bar\", \"zoo\" ] ).children[].title"
    ],
    "utterance": "List the titles of all children of the object whose guid equals \"_0no-4e9woLW\".",
    "expressions": [
      "( paths | select(.[-1] == \"guid\") ) as $p | if getpath($p) == \"_0no-4e9woLW\" then getpath($p[:-1] + [\"children\"])[].title else empty end",
      ".. | objects | select( .guid == \"_0no-4e9woLW\" ) | .children[].title",
      "getpath([\"foo\",\"bar\",\"zoo\"]).children[].title"
    ],
    "data": [
      {
        "input": {
          "foo": {
            "bar": {
              "zoo": {
                "guid": "_0no-4e9woLW",
                "title": "frontend",
                "index": 3,
                "dateAdded": 1614198626454000,
                "lastModified": 1619505016472000,
                "id": 16518,
                "typeCode": 2,
                "type": "text/x-moz-place-container",
                "children": [
                  {
                    "guid": "2L7ZsYTM2RX0",
                    "title": "css child override parent style",
                    "index": 0,
                    "dateAdded": 1614198648005000,
                    "lastModified": 1614198648005000,
                    "id": 16519,
                    "typeCode": 1,
                    "type": "text/x-moz-place",
                    "uri": "https://www.google.com/search?client=firefox-b-d&q=css+child+override+parent+style"
                  },
                  {
                    "guid": "x3JHTAezzyI8",
                    "title": "css child parent inheritance",
                    "index": 1,
                    "dateAdded": 1614198648096000,
                    "lastModified": 1614198648096000,
                    "id": 16520,
                    "typeCode": 1,
                    "type": "text/x-moz-place",
                    "uri": "https://www.google.com/search?client=firefox-b-d&q=css+child+parent+inheritance"
                  }
                ]
              }
            }
          }
        },
        "output": [
          "css child override parent style",
          "css child parent inheritance"
        ]
      }
    ],
    "identifier": 69801232,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to flatten the output into an array of objects similar to this. I want most data from the `warnings` array except `.location` and `.render_path`.",
      "With assumptions based on your description provided, I guess you are looking to do this",
      ".scan_info + ( .warnings[] | del(.location, .render_path) )",
      "If you want the entire results packed into an array, enclose the above filter within square brackets(`[ ]`)"
    ],
    "utterance": "Produce an array of objects containing all properties from each element in warnings except location and render_path, merged with all top-level properties from scan_info.",
    "expressions": [
      "[ .warnings[] | del(.location, .render_path) + .scan_info ]"
    ],
    "data": [
      {
        "input": {
          "scan_info": {
            "security_warnings": 9,
            "start_time": "2021-11-01 14:44:58 +1100",
            "end_time": "2021-11-01 14:45:13 +1100",
            "brakeman_version": "5.1.2"
          },
          "warnings": [
            {
              "warning_type": "Redirect",
              "warning_code": 18,
              "fingerprint": "cae2f2cfd8",
              "check_name": "Redirect",
              "message": "Possible unprotected redirect",
              "file": "app/controllers/my/controller.rb",
              "line": 24,
              "link": "https://brakemanscanner.org/docs/warning_types/redirect/",
              "code": "redirect_to....",
              "render_path": null,
              "location": {
                "type": "method",
                "class": "My::Controller",
                "method": "request_token"
              },
              "user_input": "my.base_url",
              "confidence": "High"
            }
          ],
          "ignored_warnings": [],
          "errors": [],
          "obsolete": [
            "xxx",
            "yyyy"
          ]
        },
        "output": [
          {
            "warning_type": "Redirect",
            "warning_code": 18,
            "fingerprint": "cae2f2cfd8",
            "check_name": "Redirect",
            "message": "Possible unprotected redirect",
            "file": "app/controllers/my/controller.rb",
            "line": 24,
            "link": "https://brakemanscanner.org/docs/warning_types/redirect/",
            "code": "redirect_to....",
            "user_input": "my.base_url",
            "confidence": "High",
            "security_warnings": 9,
            "start_time": "2021-11-01 14:44:58 +1100",
            "end_time": "2021-11-01 14:45:13 +1100",
            "brakeman_version": "5.1.2"
          }
        ]
      }
    ],
    "identifier": 69806317,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Basically, I\u2019m looking to be able to step through the JSON, find components where the version is a string other than ^ (which may or may not be a specific tag), and import the URL for the submodule.",
      "$ jq -r < /tmp/o '.components[] | select( .version != \"^\" ) | \"git submodule init \\(.URL):\\(.version)\"'",
      "git submodule init https://my.git.repo/Standard:X"
    ],
    "utterance": "List all components where version is not '^', outputting 'git submodule init' commands with their URL and version.",
    "expressions": [
      ".components[] | select(.version != \"^\") | \"git submodule init \\(.URL):\\(.version)\""
    ],
    "data": [
      {
        "input": {
          "name": "Custom Development Components",
          "description": "Reusable objects to increase the efficiency and quality of custom development",
          "components": [
            {
              "name": "stats",
              "description": "Statistics",
              "URL": "https://my.git.repo/Statistics",
              "folder-name": "STATS",
              "version": "^"
            },
            {
              "name": "standard",
              "description": "Standard",
              "URL": "https://my.git.repo/Standard",
              "folder-name": "STANDARD",
              "version": "X"
            },
            {
              "name": "helpers",
              "description": "Helper Objects and Scripts",
              "URL": "https://my.git.repo/Helpers",
              "folder-name": "HELPERS",
              "version": "^"
            }
          ]
        },
        "output": [
          "git submodule init https://my.git.repo/Standard:X"
        ]
      }
    ],
    "identifier": 69813151,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I'd like to add the name of the json file, from which the event is taken. I've tried input_filename within the \"{}\" but had no success.",
      "Using `cat *` destroys any filename information that exists. Modify your pipeline to\njq -r '...' *\nand `input_filename` will evaluate to the current filename."
    ],
    "utterance": "Output each selected event including its id, index, player_id, striker_name, and the name of the file it was read from, for all files where shot.type.id equals 88.",
    "expressions": [
      ".[] | select(.shot.type.id==88) | {filename: input_filename, id: .id, index: .index, player_id: .player.id, striker_name: .player.name}"
    ],
    "identifier": 69822085,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": true,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "How can I get my wanted output? I can't figure out how to preserve the object names (keys).",
      "This is what I want: input and expected output: What is the jq #some-jq-magic-here?",
      "The jq #some-jq-magic-here is this: jq '{apples, cherries}'. From own answer. See below.",
      "echo '...' | jq 'with_entries(select([.key] | inside([\"apples\", \"cherries\"]))))'",
      "# Try jq '{apples, cherries}'",
      "* jq '{apples, cherries}' <= Works.",
      "Without the dots it works just fine:"
    ],
    "utterance": "Return a new object containing only the keys apples and cherries with their original values, preserving their object names.",
    "expressions": [
      "{apples, cherries}",
      "with_entries(select([.key] | inside([\"apples\", \"cherries\"])))"
    ],
    "data": [
      {
        "input": {
          "apples": {
            "color": "green",
            "count": 3
          },
          "bananas": {
            "color": "yellow",
            "count": 4
          },
          "cherries": {
            "color": "red"
          }
        },
        "output": {
          "apples": {
            "color": "green",
            "count": 3
          },
          "cherries": {
            "color": "red"
          }
        }
      }
    ],
    "identifier": 69823661,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I'd like the output of this command to be on a single line :",
      "$ lscpu -J | jq -r '.lscpu[] | select(.field==\"CPU(s):\").data + \" x \", select(.field==\"Model name:\").data'",
      "So I tried this but it outputs ... nothing :",
      "$ lscpu -J | jq -r '.lscpu[] | select(.field==\"CPU(s):\").data + \" x \" + select(.field==\"Model name:\").data'",
      "You can keep the two items in a list, then join them together :",
      "lscpu -J | jq -r '.lscpu | map(select(.field | IN(\"CPU(s):\",\"Model name:\")).data) | join(\" x \")'"
    ],
    "utterance": "Combine the values of the 'data' fields where 'field' equals 'CPU(s):' and 'Model name:', separated by ' x ', on one line.",
    "expressions": [
      ".lscpu | map(select(.field | IN(\"CPU(s):\",\"Model name:\")).data) | join(\" x \")"
    ],
    "data": [
      {
        "input": {
          "lscpu": [
            {
              "field": "Architecture:",
              "data": "x86_64"
            },
            {
              "field": "CPU(s):",
              "data": "4"
            },
            {
              "field": "Model name:",
              "data": "AMD A4-6210 APU with AMD Radeon R3 Graphics"
            }
          ]
        },
        "output": "4 x AMD A4-6210 APU with AMD Radeon R3 Graphics"
      }
    ],
    "identifier": 69832505,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "But I need a more universal jq rule to remove single-quotes, e.g. `&quot;&#39;mystring&#39;&quot;` => `&quot;mystring&quot;`",
      "sub(\"'\\'(?<s>[^'\\']+)'\\'\"; .s)"
    ],
    "utterance": "Remove outer single quotes from a string, returning only the inner content.",
    "expressions": [
      "sub(\"'(?<s>[^']+)'\"; .s)"
    ],
    "data": [
      {
        "input": "'mysql'",
        "output": "mysql"
      },
      {
        "input": "'whatever'",
        "output": "whatever"
      }
    ],
    "identifier": 69842607,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I got the list of inactive channels (defined as not updated in last 90 days) by checking the timestamp of the last message in the channel directory in my export and writing the list of inactive channels to an array as this data is not available in the json objects themselves.",
      "Only problem is I don't know how to arrange it so that each of the channels in my array are removed from the input before writing to a new output file.",
      "More efficient is to feed jq all your channels to delete at once, rather than one-at-a-time.",
      "jq --rawfile inactive_channel_stream <(printf '%s\\0' \"${inactive_channels[@]}\") '\n  # generate an object mapping keys to delete to a truthy value\n  INDEX($inactive_channel_stream | split(\"\\u0000\")[]; .) as $inactive_channels\n\n  | map(select(($inactive_channels[.name] // false) | not))\n' <in.json >out.json"
    ],
    "utterance": "Remove all objects whose name is in the list of inactive channels provided separately.",
    "expressions": [
      "jq --rawfile inactive_channel_stream <(printf '%s\\0' \"${inactive_channels[@]}\") '\n  INDEX($inactive_channel_stream | split(\"\\u0000\")[]; .) as $inactive_channels\n  | map(select(($inactive_channels[.name] // false) | not))\n'"
    ],
    "data": [
      {
        "input": [
          {
            "id": "",
            "name": "announcements",
            "created": 1500000000,
            "creator": "",
            "is_archived": false,
            "is_general": true,
            "members": [
              ""
            ],
            "pins": [
              {
                "id": "",
                "type": "C",
                "created": 1500000000,
                "user": "",
                "owner": ""
              }
            ],
            "topic": {
              "value": "",
              "creator": "",
              "last_set": 0
            },
            "purpose": {
              "value": "company wide announcements",
              "creator": "",
              "last_set": 1500000000
            }
          },
          {
            "id": "",
            "name": "general",
            "created": 1500000000,
            "creator": "",
            "is_archived": false,
            "is_general": true,
            "members": [
              ""
            ],
            "pins": [
              {
                "id": "",
                "type": "C",
                "created": 1500000000,
                "user": "",
                "owner": ""
              }
            ],
            "topic": {
              "value": "",
              "creator": "",
              "last_set": 0
            },
            "purpose": {
              "value": "general",
              "creator": "",
              "last_set": 1500000000
            }
          }
        ],
        "output": []
      }
    ],
    "identifier": 69842972,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "From type AM_STARTED 'json/index/row', I want *event.AMStarted.startTime*",
      "Ignore TASK_STARTED - I do not want anything that has 'TASK' in type",
      "From JOB_INITED, I want all fields",
      "From JOB_FINISHED, I want finishTime, and values of FILE_BYTES_READ & FILE_BYTES_WRITTEN",
      "Expected Output :  \n> 1636039435822, 1636039438569, 356, 100, 1636043211181, 1156766568459, 1720873364662",
      "jq --raw-output '\n  map(\n    select(.type == \"AM_STARTED\").event.AMStarted.startTime,\n    select(.type == \"JOB_INITED\").event.JobInited[],\n    (\n      select(.type == \"JOB_FINISHED\").event.JobFinished\n      | .finishTime,\n      (\n        .totalCounters.groups[].counts[] | (\n          select(.name == \"FILE_BYTES_READ\").value,\n          select(.name == \"FILE_BYTES_WRITTEN\").value\n        )\n      )\n    )\n  )\n  | @csv\n' "
    ],
    "utterance": "Extract event.AMStarted.startTime from objects with type AM_STARTED, all fields from event.JobInited when type is JOB_INITED, finishTime from event.JobFinished and values of FILE_BYTES_READ and FILE_BYTES_WRITTEN from nested structures when type is JOB_FINISHED; ignore entries where type contains 'TASK'; output as comma-separated values.",
    "expressions": [
      "map(\n    select(.type == \"AM_STARTED\").event.AMStarted.startTime,\n    select(.type == \"JOB_INITED\").event.JobInited[],\n    (\n      select(.type == \"JOB_FINISHED\").event.JobFinished\n      | .finishTime,\n      (\n        .totalCounters.groups[].counts[] | (\n          select(.name == \"FILE_BYTES_READ\").value,\n          select(.name == \"FILE_BYTES_WRITTEN\").value\n        )\n      )\n    )\n  ) | @csv"
    ],
    "data": [
      {
        "input": [
          {
            "type": "AM_STARTED",
            "event": {
              "AMStarted": {
                "startTime": 1636039435822,
                "nodeManagerPort": 8041
              }
            }
          },
          {
            "type": "TASK_STARTED",
            "event": {
              "TaskStarted": {
                "taskid": "task_1351",
                "jobStatus": "Started"
              }
            }
          },
          {
            "type": "JOB_INITED",
            "event": {
              "JobInited": {
                "launchTime": 1636039438569,
                "totalMaps": 356,
                "totalReduces": 100
              }
            }
          },
          {
            "type": "JOB_FINISHED",
            "event": {
              "JobFinished": {
                "jobid": "job_1636039105678_0061",
                "finishTime": 1636043211181,
                "totalCounters": {
                  "name": "TOTAL_COUNTERS",
                  "groups": [
                    {
                      "name": "org.apache.hadoop.mapreduce.FileSystemCounter",
                      "displayName": "File System Counters",
                      "counts": [
                        {
                          "name": "FILE_BYTES_READ",
                          "displayName": "FILE: Number of bytes read",
                          "value": 1156766568459
                        },
                        {
                          "name": "FILE_BYTES_WRITTEN",
                          "displayName": "FILE: Number of bytes written",
                          "value": 1720873364662
                        },
                        {
                          "name": "RANDOM_BYTES",
                          "displayName": "RANDOM: Bytes",
                          "value": 1720873311162
                        }
                      ]
                    }
                  ]
                }
              }
            }
          }
        ],
        "output": "1636039435822,1636039438569,356,100,1636043211181,1156766568459,1720873364662"
      }
    ],
    "identifier": 69846016,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am trying to create an array of objects in bash given an array in bash using jq.",
      "My desired result:",
      "[",
      "   {",
      "     \"id\": \"baf3eca8-c4bd-4590-bf1f-9b1515d521ba\",",
      "     \"names\": [",
      "       \"bob\",",
      "       \"sally\"",
      "     ]",
      "   },",
      "   {",
      "     \"id\": \"ef2fa922-2038-445c-9d32-8c1f23511fe4\",",
      "     \"names\": [",
      "       \"bob\",",
      "       \"sally\"",
      "     ]",
      "   }",
      "]",
      "printf '%s\\0' \"${IDS[@]}\" | jq -Rs '",
      "  split(\"\\u0000\") | map({id:., names: [\"bob\", \"sally\"]})",
      "'",
      "printf \"%s\\n\" \"${IDS[@]}\" | jq -nR '[{id: inputs, names: [\"bob\", \"sally\"]}]'"
    ],
    "utterance": "Create an array of objects where each object has an id from the bash array and a names field containing [\"bob\", \"sally\"]",
    "expressions": [
      "split(\"\\u0000\") | map({id:., names: [\"bob\", \"sally\"]})",
      "[{id: inputs, names: [\"bob\", \"sally\"]}]"
    ],
    "data": [
      {
        "input": [
          "baf3eca8-c4bd-4590-bf1f-9b1515d521ba",
          "ef2fa922-2038-445c-9d32-8c1f23511fe4"
        ],
        "output": [
          {
            "id": "baf3eca8-c4bd-4590-bf1f-9b1515d521ba",
            "names": [
              "bob",
              "sally"
            ]
          },
          {
            "id": "ef2fa922-2038-445c-9d32-8c1f23511fe4",
            "names": [
              "bob",
              "sally"
            ]
          }
        ]
      }
    ],
    "identifier": 69847625,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I am trying to get *key@value* pairs of JSON file below using ***jq***",
      "The *key@value* pair should look like this:",
      "`THING_1@dsfkes4s34jlis4jsj4lis4sli3`",
      "Currently I have tried this on my own:",
      "jq -r ' .THINGS | keys[] as $k | \"($k)@(.[$k].revision)\" ' things.json",
      "But it does not give the resul that I really want.",
      "You need to escape `(` :",
      "jq -r ' .THINGS | keys[] as $k | \"\\($k)@\\(.[$k].revision)\" ' things.json"
    ],
    "utterance": "Produce a list of key@value pairs, where each key under THINGS is joined with its corresponding revision value using the @ character.",
    "expressions": [
      ".THINGS | keys[] as $k | \"\\($k)@\\(.[$k].revision)\""
    ],
    "data": [
      {
        "input": {
          "STUFF_RELATED1": "STUFF_RELATED1",
          "STUFF_RELATED2": "STUFF_RELATED2",
          "THINGS": {
            "THING_2": {
              "details": {
                "stuff_branch": "user/dev"
              },
              "repository": "path/to/repo",
              "branch": "master",
              "revision": "dsfkes4s34jlis4jsj4lis4sli3"
            },
            "THING_1": {
              "details": {
                "stuff_branch": "master"
              },
              "repository": "path/to/repo",
              "branch": "master",
              "revision": "dsfkes4s34jlis4jsj4lis4sli3"
            }
          },
          "STUFF": {
            "revision": "4u324i324iy32g",
            "branch": "master"
          }
        },
        "output": [
          "THING_2@dsfkes4s34jlis4jsj4lis4sli3",
          "THING_1@dsfkes4s34jlis4jsj4lis4sli3"
        ]
      }
    ],
    "identifier": 69852741,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "var=\"test-54342\"\necho \"$( jq '.context[\"customer-name\"] = \"$var\"' cdk.json )\" > cdk.json",
      "{\n  \"app\": \"mySuperApp\",\n  \"context\": {\n    \"customer-name\": \"text-I-want-to-modify\"\n  }\n}"
    ],
    "utterance": "Replace the value of context[\"customer-name\"] with the contents of a Bash variable whose value is test-54342.",
    "expressions": [
      "jq --arg var \"$var\" '.context[\"customer-name\"] = $var' cdk.json"
    ],
    "data": [
      {
        "input": {
          "app": "mySuperApp",
          "context": {
            "customer-name": "text-I-want-to-modify"
          }
        },
        "output": {
          "app": "mySuperApp",
          "context": {
            "customer-name": "test-54342"
          }
        }
      }
    ],
    "identifier": 69858213,
    "annotated": {
      "invalid": {
        "variables": true,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Using `jq` or `jtc`, I\u2019d like to take a list of json files as inputs and generate a single json file based on the keys to the arrays found in each input file.",
      "I can't use it directly because I don't know what keys I will get in each input file.",
      "jq -n 'reduce inputs as $in ({}; \n        reduce ($in|keys_unsorted)[] as $k (.;\n         .[$k] += $in[$k]))'  file1 file2 file3 # ..."
    ],
    "utterance": "Merge multiple input files so that for each key present in any file, the output contains that key with all corresponding arrays' elements concatenated.",
    "expressions": [
      "jq -n 'reduce inputs as $in ({}; reduce ($in|keys_unsorted)[] as $k (.; .[$k] += $in[$k]))' file1 file2 file3"
    ],
    "data": [
      {
        "input": [
          {
            "key1": [
              "key1-out1"
            ]
          },
          {
            "key1": [
              "key1-out2"
            ]
          },
          {
            "key2": [
              "key2-out1"
            ]
          }
        ],
        "output": {
          "key1": [
            "key1-out1",
            "key1-out2"
          ],
          "key2": [
            "key2-out1"
          ]
        }
      }
    ],
    "identifier": 69862320,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": true,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "Here I'm trying to convert json to csv. Also, I'm trying to ignore duplicate values of 'color' attribute in same command.",
      "jq -r '.colors | unique_by(.color)[] | [.color, .category, .type] | @csv'",
      "\"black\",\"hue\",\"primary\"\n\"red\",\"hue\",\"primary\"\n\"white\",\"value\",\n\"yellow\",\"hue\",\"primary\"",
      "jq -r '.colors | unique_by(.color)[] | [.color//\"\", .category//\"\", .type//\"\"] | @csv'",
      "\"black\",\"hue\",\"primary\"\n\"red\",\"hue\",\"primary\"\n\"white\",\"value\",\"\"\n\"yellow\",\"hue\",\"primary\""
    ],
    "utterance": "Convert an array of objects to CSV rows for the fields color, category, and type, including only one row per unique color value.",
    "expressions": [
      ".colors | unique_by(.color)[] | [.color, .category, .type] | @csv",
      ".colors | unique_by(.color)[] | [.color//\"\", .category//\"\", .type//\"\"] | @csv"
    ],
    "data": [
      {
        "input": {
          "colors": [
            {
              "color": "black",
              "category": "hue",
              "type": "primary",
              "code": {
                "rgba": [
                  255,
                  255,
                  255,
                  1
                ],
                "hex": "#000"
              }
            },
            {
              "color": "white",
              "category": "value",
              "code": {
                "rgba": [
                  0,
                  0,
                  0,
                  1
                ],
                "hex": "#FFF"
              }
            },
            {
              "color": "red",
              "category": "hue",
              "type": "primary",
              "code": {
                "rgba": [
                  255,
                  0,
                  0,
                  1
                ],
                "hex": "#FF0"
              }
            },
            {
              "color": "black",
              "category": "hue",
              "type": "primary",
              "code": {
                "rgba": [
                  0,
                  0,
                  255,
                  1
                ],
                "hex": "#00F"
              }
            },
            {
              "color": "yellow",
              "category": "hue",
              "type": "primary",
              "code": {
                "rgba": [
                  255,
                  255,
                  0,
                  1
                ],
                "hex": "#FF0"
              }
            },
            {
              "color": "white",
              "category": "hue",
              "type": "secondary",
              "code": {
                "rgba": [
                  0,
                  255,
                  0,
                  1
                ],
                "hex": "#0F0"
              }
            }
          ]
        },
        "output": [
          "\"black\",\"hue\",\"primary\"",
          "\"red\",\"hue\",\"primary\"",
          "\"white\",\"value\",",
          "\"yellow\",\"hue\",\"primary\""
        ]
      }
    ],
    "identifier": 69889510,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": true
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "1) ignore duplicate values of 'address' attribute using jq cmd. 2) Also, I have to ignore the duplicate json [].",
      "my jq cmd, address attribute is getting ignored, *but duplicate json[] is not*.",
      "You can use slurp and flatten:",
      "jq -rs '[\"Name\",\"Address\",\"Status\"], (flatten(1) | unique_by(.address)[] | [.name, .address, .status]) | @csv' sample.json > sample.csv",
      "Another solution was suggested by [pmf](https://stackoverflow.com/users/2158479/pmf) in the comments: replace . with [inputs](https://stedolan.github.io/jq/manual/v1.6/#inputs):",
      "jq -r '[\"Name\",\"Address\",\"Status\"], (inputs | unique_by(.address)[] | [.name, .address, .status]) | @csv' sample.json > sample.csv"
    ],
    "utterance": "Output CSV rows containing unique addresses, ensuring duplicate arrays in the input are ignored, and include only the first occurrence per address.",
    "expressions": [
      "jq -rs '[\"Name\",\"Address\",\"Status\"], (flatten(1) | unique_by(.address)[] | [.name, .address, .status]) | @csv'",
      "jq -r '[\"Name\",\"Address\",\"Status\"], (inputs | unique_by(.address)[] | [.name, .address, .status]) | @csv'"
    ],
    "data": [
      {
        "input": [
          [
            {
              "id": 1,
              "name": "Ram",
              "address": "Kolkata",
              "status": "active"
            },
            {
              "id": 3,
              "name": "Rahul",
              "address": "Kolkata",
              "status": "inactive"
            },
            {
              "id": 2,
              "name": "Kishore",
              "address": "Hyderabad",
              "status": "active"
            },
            {
              "id": 4,
              "name": "Manu",
              "address": "Kerala",
              "status": "active"
            },
            {
              "id": 5,
              "name": "Adam",
              "address": "Bangalore",
              "status": "inactive"
            }
          ],
          [
            {
              "id": 1,
              "name": "Ram",
              "address": "Kolkata",
              "status": "active"
            },
            {
              "id": 3,
              "name": "Rahul",
              "address": "Kolkata",
              "status": "inactive"
            },
            {
              "id": 2,
              "name": "Kishore",
              "address": "Hyderabad",
              "status": "active"
            },
            {
              "id": 4,
              "name": "Manu",
              "address": "Kerala",
              "status": "active"
            },
            {
              "id": 5,
              "name": "Adam",
              "address": "Bangalore",
              "status": "inactive"
            }
          ]
        ],
        "output": [
          "\"Name\",\"Address\",\"Status\"",
          "\"Adam\",\"Bangalore\",\"inactive\"",
          "\"Kishore\",\"Hyderabad\",\"active\"",
          "\"Manu\",\"Kerala\",\"active\"",
          "\"Ram\",\"Kolkata\",\"active\""
        ]
      }
    ],
    "identifier": 69898423,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "PKG_VERSION=$(shell echo v$$(jq .version package.json))",
      "You can try this to remove \\r:\r\n\r\n```\r\nPKG_VERSION=$(shell echo v$$(jq .version package.json | tr -d $$\\'\\\\r\\'))\r\n```",
      "You may need to apply the pipe to LATEST_TAG as well."
    ],
    "utterance": "Extract the version value from package.json as a string suitable for shell variable assignment with a 'v' prefix.",
    "expressions": [
      ".version",
      "\"v\"+.version"
    ],
    "data": [
      {
        "input": {
          "version": "1.1.0"
        },
        "output": "v1.1.0"
      }
    ],
    "identifier": 69900533,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to fetch **InstanceId** and **PrivateIpAddress** from json. I am using below command to fetch value:",
      "file.json | jq -r .Instances[].InstanceId",
      "file.json | jq -r .Instances[].PrivateIpAddress",
      "I am expecting below response",
      "i-123548874",
      "10.245.247.21",
      "$ cat file.json | jq -r .Instances[].PrivateIpAddress",
      "10.245.247.21",
      "$ cat file.json | jq -r .Instances[].InstanceId",
      "i-123548874",
      "$ jq -r .Instances[].PrivateIpAddress file.json",
      "10.245.247.21",
      "$ jq -r .Instances[].InstanceId file.json",
      "i-123548874"
    ],
    "utterance": "Extract the InstanceId and PrivateIpAddress values for all instances present in the Instances array.",
    "expressions": [
      ".Instances[].InstanceId",
      ".Instances[].PrivateIpAddress"
    ],
    "data": [
      {
        "input": {
          "Groups": [],
          "Instances": [
            {
              "AmiLaunchIndex": 0,
              "ImageId": "ami-id",
              "InstanceId": "i-123548874",
              "InstanceType": "c5.4xlarge",
              "KeyName": "kay",
              "LaunchTime": "dateValue",
              "Monitoring": {
                "State": "disabled"
              },
              "Placement": {
                "AvailabilityZone": "zone value",
                "GroupName": "",
                "Tenancy": "default"
              },
              "PrivateDnsName": "ip address",
              "PrivateIpAddress": "10.245.247.21",
              "ProductCodes": []
            }
          ]
        },
        "output": [
          "i-123548874",
          "10.245.247.21"
        ]
      }
    ],
    "identifier": 69914737,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  },
  {
    "context": [
      "I want to set value = 500 (int value) to \"total\" and \"MY_PROJECT\".",
      "But I need to update BOTH KEYS.",
      "The result must be:\n\n    {\n      \"total\": 500,\n      \"project\": \"MY_PROJECT\",\n      \"Severity\": [],\n      \"issues\": []\n    }",
      "No answer (so far) accounts for the requirement that `total` be of type `int`. This can be accomplished by using `--argjson` instead of `--arg`. Here's my two cents:\n\n```sh\njq --argjson total 500 --arg project \"MY_PROJECT\" '. + {$total, $project}' report.json\n```"
    ],
    "utterance": "Set the value 500 (as an integer) for the key total and the value MY_PROJECT for the key project in the file, updating both keys at once.",
    "expressions": [
      "jq --argjson total 500 --arg project \"MY_PROJECT\" '.total = $total | .project = $project' report.empty.json",
      "jq --argjson total 500 --arg project \"MY_PROJECT\" '. + {total: $total, project: $project}' report.empty.json"
    ],
    "data": [
      {
        "input": {
          "total": 0,
          "project": "",
          "Severity": [],
          "issues": []
        },
        "output": {
          "total": 500,
          "project": "MY_PROJECT",
          "Severity": [],
          "issues": []
        }
      }
    ],
    "identifier": 69915675,
    "annotated": {
      "invalid": {
        "variables": false,
        "stream": false,
        "multiple": false,
        "missing": false
      },
      "other_task": null,
      "other_environment": null
    }
  }
]