{
  "video_path": "./ref_datasets/part2/8530287002.mp4",
  "start_time": 0.0,
  "end_time": 3.04,
  "image_path": "./030890.jpg",
  "represents_multiple_types": false,
  "cross_video_duplicate": false,
  "original_detections": [
    {
      "video_path": "./ref_datasets/part2/8530287002.mp4",
      "start_time": 0.0,
      "end_time": 3.04,
      "image_path": "./ref_datasets/extracted_frames/person_-8788694671615632983_1_8530287002.mp4_0.00_3.04.jpg",
      "type": "person"
    }
  ],
  "types": [
    "person"
  ],
  "persons": [
    {
      "body_box": 0,
      "skeleton": 0,
      "face_box": 0,
      "qwen_detailing": {
        "background": false,
        "age": "adult",
        "gender": "female",
        "emotion": "neutral",
        "clothing_description": "The person appears to be wearing a dark top, but the details are not clear due to the focus on the face.",
        "clothing": {
          "vague": false,
          "clothing": [
            {
              "possible_names": [
                "hair",
                "eyebrows",
                "eyes",
                "nose",
                "mouth",
                "ears",
                "skin"
              ],
              "name": "hair",
              "type": "accessory",
              "color": [
                "blonde"
              ],
              "belonging_confident": true,
              "existence_confident": true
            }
          ]
        },
        "objects": [],
        "description": "The person is an adult female with long blonde hair. She is in the foreground of the image, and her expression appears neutral. She seems to be wearing a dark top, though the details are not clearly visible. There are no other objects or items associated with her in the image.",
        "blurry": false,
        "face_seen": true,
        "emotion_description": "The person appears to have a calm and composed expression, with no strong emotions displayed.",
        "meaningful": false,
        "story": "unknown",
        "race": "white",
        "text": "no_text",
        "text_relationship": "no_text",
        "behaviour": "The person in the image appears to be outdoors, possibly in a park or garden setting, as indicated by the greenery in the background. Their long blonde hair is slightly tousled, suggesting they might have been moving or experiencing a gentle breeze. The individual's expression seems calm and contemplative, with their mouth slightly open as if they are about to speak or are in the middle of a conversation. This could indicate that they are engaged in a dialogue with someone off-camera or perhaps reflecting on something thoughtfully. The natural lighting and serene environment contribute to an overall sense of tranquility and introspection.",
        "intention": "They are seeking peace and clarity through quiet reflection in nature",
        "intention_ok": true
      },
      "facex_detailing": {
        "landmarks": [
          [
            0.31723118687403346,
            0.3341424805777414
          ],
          [
            0.314602409791024,
            0.4179976327078683
          ],
          [
            0.31209671681170303,
            0.5010825906481061
          ],
          [
            0.3116001075311076,
            0.5822129930768695
          ],
          [
            0.31730606855735893,
            0.669560432434082
          ],
          [
            0.3364513819283318,
            0.73957245690482
          ],
          [
            0.3668311249065612,
            0.7950398581368583
          ],
          [
            0.40107213626837446,
            0.8403365952627999
          ],
          [
            0.44173035681070316,
            0.8619377953665597
          ],
          [
            0.4824450427134122,
            0.8647997719900948
          ],
          [
            0.5252231470530941,
            0.8479181017194476
          ],
          [
            0.5703754152570452,
            0.8217575890677316
          ],
          [
            0.6066833857624303,
            0.7672191347394671
          ],
          [
            0.6301365304206098,
            0.6926003183637347
          ],
          [
            0.6432730012706348,
            0.6011477879115513
          ],
          [
            0.6552893382097994,
            0.5182662010192871
          ],
          [
            0.6659792970156386,
            0.4184377534048898
          ],
          [
            0.34617879538397706,
            0.2016078063419887
          ],
          [
            0.37340629453815166,
            0.1735325370516096
          ],
          [
            0.3985526252360571,
            0.1790858166558402
          ],
          [
            0.42755967422965024,
            0.19834303855895996
          ],
          [
            0.4547906985949902,
            0.23855643612997873
          ],
          [
            0.5156832283451444,
            0.25444250447409494
          ],
          [
            0.5457192671086107,
            0.2316405943461827
          ],
          [
            0.579651384747454,
            0.22281604153769358
          ],
          [
            0.6114902552394639,
            0.24164642606462755
          ],
          [
            0.6323080604984647,
            0.280555180140904
          ],
          [
            0.47862400822341444,
            0.29441046714782715
          ],
          [
            0.47370684224934806,
            0.3545518602643694
          ],
          [
            0.47052851186266964,
            0.4088101387023926
          ],
          [
            0.4659929674296152,
            0.46640154293605257
          ],
          [
            0.435667520451049,
            0.5159607614789691
          ],
          [
            0.45088044139778327,
            0.5314558574131557
          ],
          [
            0.4648526894549529,
            0.5480314322880336
          ],
          [
            0.48149041552983574,
            0.5427212033952985
          ],
          [
            0.49814309469823326,
            0.5350308418273926
          ],
          [
            0.37383718381059317,
            0.28172876153673443
          ],
          [
            0.39469072928740867,
            0.26585449491228375
          ],
          [
            0.4170985209977343,
            0.2804158755711147
          ],
          [
            0.4353915777678291,
            0.3125577654157366
          ],
          [
            0.4124458930676892,
            0.3064767633165632
          ],
          [
            0.39198668313523133,
            0.2925727367401123
          ],
          [
            0.5282148194454965,
            0.33509254455566406
          ],
          [
            0.5552128234789485,
            0.31659017290387836
          ],
          [
            0.5718556010652155,
            0.3188855988638742
          ],
          [
            0.5922191421900477,
            0.347379275730678
          ],
          [
            0.5728311935705798,
            0.35057095118931364
          ],
          [
            0.5528712775026049,
            0.33558481080191477
          ],
          [
            0.38462918779502314,
            0.6109684535435268
          ],
          [
            0.41446209691819696,
            0.582439695085798
          ],
          [
            0.4433768241178422,
            0.5784309250967843
          ],
          [
            0.4629779279232025,
            0.5899317605154855
          ],
          [
            0.48026715950774296,
            0.581059387751988
          ],
          [
            0.5097217321218479,
            0.6083974157060895
          ],
          [
            0.5372882636175269,
            0.6491069112505231
          ],
          [
            0.506278925672883,
            0.674142565046038
          ],
          [
            0.4759627449636658,
            0.6831134387425014
          ],
          [
            0.45526036519025054,
            0.6793700626918248
          ],
          [
            0.4352481365381252,
            0.6720971379961286
          ],
          [
            0.4081467469710679,
            0.6514081273760114
          ],
          [
            0.39570196129026863,
            0.6125149045671735
          ],
          [
            0.4383377865577737,
            0.6089469364711216
          ],
          [
            0.4586574291189512,
            0.619492667061942
          ],
          [
            0.47612611332109994,
            0.6262196132114956
          ],
          [
            0.5255093787042867,
            0.6474851199558803
          ],
          [
            0.4767614415979811,
            0.6441489628383091
          ],
          [
            0.4590399803859847,
            0.6380703789847237
          ],
          [
            0.43876804826515065,
            0.627793720790318
          ]
        ],
        "visibility": [
          1.0,
          7.432254642480984e-05,
          1.1397418120395741e-06,
          1.112413663584993e-17,
          0.54585200548172,
          0.47640544176101685,
          1.2884342030783369e-11,
          8.040417697246339e-21,
          1.0,
          3.4026534478925896e-08,
          1.567361800243472e-12,
          1.1740851624253423e-09,
          3.601253774832003e-05,
          0.8836851119995117,
          3.026122971996892e-12,
          3.373641088710855e-10,
          0.02205316722393036,
          5.606495268040135e-09,
          5.446044074150676e-14,
          3.3364408458247397e-11,
          3.1755084557805247e-23,
          7.929374031448637e-18,
          1.6046135087322e-08,
          6.444478640332818e-05,
          1.592967809612868e-23,
          7.060192515499663e-21,
          1.2430255087768956e-15,
          1.0464835942247674e-10,
          2.3550370434701796e-14
        ],
        "headpose": {
          "pitch": 4.108984601845372,
          "yaw": 3.3607006660036864,
          "roll": 4.290101581840246
        },
        "attributes": {
          "5 oClock Shadow": 8.142357046381221e-07,
          "Arched Eyebrows": 0.0900905504822731,
          "Attractive": 0.5322673320770264,
          "Bags Under Eyes": 0.017760539427399635,
          "Bald": 1.929310400328177e-07,
          "Bangs": 0.0001114709593821317,
          "Big Lips": 0.2672913372516632,
          "Big Nose": 0.012194103561341763,
          "Black Hair": 6.630022107856348e-05,
          "Blond Hair": 0.560789942741394,
          "Blurry": 3.584526348276995e-05,
          "Brown Hair": 0.07931080460548401,
          "Bushy Eyebrows": 0.028956077992916107,
          "Chubby": 0.0037024549674242735,
          "Double Chin": 0.0021373273339122534,
          "Eyeglasses": 6.99512311257422e-05,
          "Goatee": 1.4604064517698134e-06,
          "Gray Hair": 0.00031111377757042646,
          "Heavy Makeup": 0.7308536171913147,
          "High Cheekbones": 0.30375370383262634,
          "Male": 5.815434633404948e-05,
          "Mouth Slightly Open": 0.999761164188385,
          "Mustache": 2.0265679268050008e-07,
          "Narrow Eyes": 0.5565432906150818,
          "No Beard": 0.9999871253967285,
          "Oval Face": 0.31016701459884644,
          "Pale Skin": 0.0025144144892692566,
          "Pointy Nose": 0.1315765231847763,
          "Receding Hairline": 0.004415017087012529,
          "Rosy Cheeks": 0.023183947429060936,
          "Sideburns": 3.0187246125024103e-07,
          "Smiling": 0.5786678194999695,
          "Straight Hair": 0.5764093995094299,
          "Wavy Hair": 0.28108081221580505,
          "Wearing Earrings": 0.0044852131977677345,
          "Wearing Hat": 2.8254678909434006e-05,
          "Wearing Lipstick": 0.9635164141654968,
          "Wearing Necklace": 0.09448147565126419,
          "Wearing Necktie": 1.3062969628663268e-05,
          "Young": 0.9865512847900391
        },
        "age": [
          0.029450153931975365,
          0.9490134716033936,
          0.9520125389099121,
          0.6895532011985779,
          0.12760888040065765,
          0.0016670044278725982,
          0.0002450842293910682,
          1.433628654012864e-06
        ],
        "race": [
          0.9993997812271118,
          0.0009215929312631488,
          0.26453495025634766,
          0.022735388949513435,
          0.11405433714389801
        ],
        "gender": [
          0.00040228519355878234,
          0.9997407793998718
        ]
      },
      "deepface_detailing": {
        "emotion": {
          "angry": 19.343219697475433,
          "disgust": 5.26596330985285e-06,
          "fear": 26.57769024372101,
          "happy": 0.009386490273755044,
          "sad": 36.90418303012848,
          "surprise": 0.0397024501580745,
          "neutral": 17.12581217288971
        },
        "dominant_emotion": "sad",
        "region": {
          "x": 0,
          "y": 0,
          "w": 1746,
          "h": 2159,
          "left_eye": null,
          "right_eye": null
        },
        "face_confidence": 0.0,
        "age": 33,
        "gender": {
          "Woman": 99.9671220779419,
          "Man": 0.032874062890186906
        },
        "dominant_gender": "Woman",
        "race": {
          "asian": 14.856501809150089,
          "indian": 4.826370169304023,
          "black": 2.0989050477090507,
          "white": 36.387186431121016,
          "middle eastern": 22.429339434465135,
          "latino hispanic": 19.40170046101215
        },
        "dominant_race": "white"
      }
    }
  ],
  "detect_results": {
    "body_boxes": [
      [
        0.08499181270599365,
        0.0024508750066161156,
        0.815173864364624,
        0.9871764183044434
      ]
    ],
    "face_boxes": [
      [
        0.3321478068828583,
        0.016790607944130898,
        0.6355856657028198,
        0.8469833135604858
      ]
    ],
    "skeletons": [
      {
        "dw_body": [
          [
            -1.0,
            -1.0
          ],
          [
            0.4398470535063081,
            0.9553904876690363
          ],
          [
            0.13986951973040904,
            0.9393919961414301
          ],
          [
            0.12352197020583686,
            0.9801154291207913
          ],
          [
            0.19054692325658265,
            0.9626625292724938
          ],
          [
            0.7398245872822071,
            0.9713889791966426
          ],
          [
            0.9131086122426723,
            0.951027262706962
          ],
          [
            0.7120337530904346,
            0.9626625292724938
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            0.3965260472661919,
            0.32854050145101144
          ],
          [
            0.5567320326069989,
            0.3430845846579263
          ],
          [
            0.29190173030893013,
            0.40998736740973396
          ],
          [
            0.6564520847068893,
            0.4536196170304781
          ]
        ],
        "dw_hand_1": [
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ]
        ],
        "dw_hand_2": [
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ]
        ],
        "dw_face": [
          [
            0.31642305459578823,
            0.3256316848096287
          ],
          [
            0.3066145248810451,
            0.40707855076835103
          ],
          [
            0.3066145248810451,
            0.48852541672707356
          ],
          [
            0.3115187897384167,
            0.572881099327179
          ],
          [
            0.3245968293580744,
            0.6456015153617528
          ],
          [
            0.3491181536449326,
            0.7241395646790925
          ],
          [
            0.37690898783670534,
            0.782315897506751
          ],
          [
            0.4161431066956785,
            0.8288569637688785
          ],
          [
            0.4602814904120235,
            0.8463098636171759
          ],
          [
            0.5044198741283682,
            0.8375834136930272
          ],
          [
            0.5436539929873413,
            0.802677613996432
          ],
          [
            0.5779838469889428,
            0.750318914451539
          ],
          [
            0.6074094361331727,
            0.680507315058348
          ],
          [
            0.6253917406102023,
            0.6077868990237745
          ],
          [
            0.6352002703249454,
            0.5263400330650517
          ],
          [
            0.6417392901347744,
            0.44780198374771235
          ],
          [
            0.6417392901347744,
            0.3634463011476069
          ],
          [
            0.34421388878756104,
            0.26163771869920377
          ],
          [
            0.3687352130744192,
            0.24127600220952317
          ],
          [
            0.3932565373612774,
            0.2383671855681402
          ],
          [
            0.41777786164813574,
            0.2470936354922891
          ],
          [
            0.4406644309825367,
            0.26163771869920377
          ],
          [
            0.5142284038431114,
            0.2674553519819697
          ],
          [
            0.542019238034884,
            0.2558200854164378
          ],
          [
            0.5698100722266568,
            0.2529112687750549
          ],
          [
            0.5976009064184296,
            0.26163771869920377
          ],
          [
            0.6204874757528305,
            0.2878170684716503
          ],
          [
            0.4733595300316811,
            0.33726695137516033
          ],
          [
            0.4717247750792237,
            0.38089920099590446
          ],
          [
            0.46845526517430935,
            0.4274402672580316
          ],
          [
            0.46682051022185217,
            0.4739813335201589
          ],
          [
            0.42922114631533625,
            0.520522399782286
          ],
          [
            0.4472034507923656,
            0.5263400330650517
          ],
          [
            0.46518575526939504,
            0.5321576663478179
          ],
          [
            0.48480281469888153,
            0.5292488497064349
          ],
          [
            0.502785119175911,
            0.5263400330650517
          ],
          [
            0.36383094821704753,
            0.3314493180923944
          ],
          [
            0.38835227250390586,
            0.3081787849613309
          ],
          [
            0.41450835174322126,
            0.3139964182440968
          ],
          [
            0.4341254111727078,
            0.3459934012993093
          ],
          [
            0.4096040868858497,
            0.35471985122345795
          ],
          [
            0.3850827625989914,
            0.3518110345820752
          ],
          [
            0.5174979137480259,
            0.35471985122345795
          ],
          [
            0.540384483082427,
            0.32854050145101144
          ],
          [
            0.5681753172741997,
            0.32854050145101144
          ],
          [
            0.5910618866086006,
            0.35471985122345795
          ],
          [
            0.5681753172741997,
            0.3721727510717558
          ],
          [
            0.542019238034884,
            0.3692639344303728
          ],
          [
            0.3932565373612774,
            0.6194221655893063
          ],
          [
            0.4210473715530502,
            0.5961516324582425
          ],
          [
            0.4504729606972801,
            0.5903339991754767
          ],
          [
            0.4635510003169378,
            0.5932428158168598
          ],
          [
            0.47826379488905274,
            0.5932428158168598
          ],
          [
            0.5093241389857398,
            0.6048780823823913
          ],
          [
            0.5371149731775126,
            0.633966248796221
          ],
          [
            0.5158631587955687,
            0.6688720484928162
          ],
          [
            0.49134183450871044,
            0.6921425816238798
          ],
          [
            0.4602814904120235,
            0.697960214906646
          ],
          [
            0.4324906562202507,
            0.686324948341114
          ],
          [
            0.4096040868858497,
            0.6572367819272843
          ],
          [
            0.3997955571711063,
            0.6223309822306891
          ],
          [
            0.43085590126779344,
            0.6136045323065402
          ],
          [
            0.4635510003169378,
            0.6165133489479232
          ],
          [
            0.49624609936608216,
            0.6223309822306891
          ],
          [
            0.5289411984152264,
            0.636875065437604
          ],
          [
            0.4978808543185393,
            0.6543279652859015
          ],
          [
            0.46191624536448056,
            0.6572367819272843
          ],
          [
            0.427586391362879,
            0.6456015153617528
          ]
        ],
        "dw_foot_1": [
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ]
        ],
        "dw_foot_2": [
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ],
          [
            -1.0,
            -1.0
          ]
        ]
      }
    ]
  },
  "new_filename": "030890",
  "objects": [],
  "mask_file": "./person_labeling/./030890.jpg_masks.pkl",
  "hoi_processed": true,
  "scene": "A serene outdoor setting with lush greenery and soft natural light creates a tranquil atmosphere with a focus on the vibrant colors of nature and the gentle play of light through the trees",
  "overall_past": "Before the current scene, the person likely walked slowly through the garden or park, perhaps following a winding path lined with trees and flowers, pausing to breathe in the fresh air and take in the peaceful surroundings. The slight tousling of their long blonde hair suggests they had been moving gently, possibly after a short stroll or a brief conversation with someone who has now stepped out of frame. The calm, contemplative expression and slightly open mouth indicate they were in the middle of sharing a quiet thought or listening to a meaningful remark—perhaps a heartfelt or reflective exchange—before the moment captured in the image, where the soft light filtering through the leaves and the vibrant greenery frame a quiet, introspective pause in their day.",
  "overall_past_clean": "They had been walking at a deliberate pace along a meandering path, their steps soft and unhurried, as if each footfall was a quiet acknowledgment of the stillness around them, pausing only to let the hush of the moment settle into their breath, their thoughts lingering on a fleeting exchange that had left them suspended in reflection.",
  "past_scene_ok": true,
  "overall_future": "Given the serene outdoor setting, the person’s calm and contemplative expression, and the gentle, natural lighting, it is plausible that after the current scene, the individual will take a slow, mindful step forward—perhaps toward a small path winding through the trees or toward a nearby bench—continuing their quiet reflection or preparing to continue a gentle conversation with someone just out of view. The moment feels suspended in stillness, suggesting a pause in thought or dialogue, and the natural environment seems to invite a gentle progression, not of action, but of inner journey—implying a deepening of awareness or emotional clarity.",
  "overall_future_clean": "The figure exhales slowly, then moves forward with deliberate grace, each step grounding them deeper into the present, as if the earth itself is guiding their quiet ascent into clarity.",
  "future_scene_ok": true
}