- name: "sample_frame"
  description: |
    Function that returns a detailed description of sampled frames in the video between a specified range.
    Output consists of one description, based on the sampled frames.
    The default sample rate is 1 fps, and the maximum number of frames is 30.
    If the specified range contains more than 30 frames, i.e., the range exceeds 30 seconds, the fps rate gets reduced so that the number of frames is less than or equal to 30.
  args:
    start_time:
      type: 'string'
      description: |
        The start time of the range to sample frames in the format of mm:ss.
    end_time:
      type: 'string'
      description: |
        The end time of the range to sample frames in the format of mm:ss.
    angle:
      type: "string"
      description: |
        Camera angle of the video.
      enum:
        - "center"
        - "top"
        - "right-bottom"
        - "right-center"
        - "right-top"
        - "left-bottom"
        - "left-center"
        - "left-top"
  required:
    - "start_time"
    - "end_time"
    - "angle"
- name: "zoom_in"
  description: |
    Function that crops one frame and returns the detailed descriptoin of the cropped frame.
    You can specify where to zoom-in by a normalized bounding box in the format of [x1,y1,x2,y2], where 0 < x1 < x2 < 1 and 0 < y1 < y2 < 1.
    (x1, y1) corresponds to the top left corner, and (x2,y2) corresponds to the bottom right coner.
  args:
    frame_id:
      type: "integer"
      description: "the id of the frame to zoom-in"
    angle:
      type: 'string'
      description: |
        Camera angle of the video.
      enum:
        - "center"
        - "top"
        - "right-bottom"
        - "right-center"
        - "right-top"
        - "left-bottom"
        - "left-center"
        - "left-top"
    bounding_box:
      type: "array"
      description: "Normalized bounding box in the format of [x1,y1,x2,y2]"
      items: {
          type: "number"
        }
  required:
    - "frame_id"
    - "angle"
    - "bounding_box"
- name: "check_instruction"
  description: |
    Function that returns the assembly instruction in text.
    An instruction is represented as a directed, acycle partial graph, where a node is a step and a relation is a order of steps.
    For instance, if there is a directed edge between node A and node B (A -> B), A needs to be done before B is performed.
    Instructions are represented as text in the DOT format.
  args: null
  required: null
- name: "check_final_picture"
  description: |
    Function that returns the detailed description of the final picture, parts image, and possibly with an exploded view as well.
  args: null
  required: null
