$BENCH_HOME/bin/storm-pomdp --prism $BENCH_HOME/models/resrc/resrc.prism --prop $BENCH_HOME/models/resrc/resrc.props rbrmax3 -const B1=5,B2=5,B3=60 --timemem --statistics --revised --reward-aware --belief-exploration unfold --size-threshold 16777216
Storm-pomdp. Sequential approach, cost aware, with cutoffs and size threshold 2^24
Storm-POMDP 1.9.1 (dev)
Date: Mon Feb 10 14:16:44 2025
Command line arguments: --prism $BENCH_HOME/models/resrc/resrc.prism --prop $BENCH_HOME/models/resrc/resrc.props rbrmax3 -const 'B1=5,B2=5,B3=60' --timemem --statistics --revised --reward-aware --belief-exploration unfold --size-threshold 16777216
Current working directory: $BENCH_HOME/experiments64gb
Time for model input parsing: 0.007s.
Time for model construction: 0.023s.
--------------------------------------------------------------
Model type: POMDP (sparse)
States: 721
Transitions: 5041
Choices: 2881
Observations: 155
Reward Models: steps, gold, gem
State Labels: 3 labels
* deadlock -> 0 item(s)
* init -> 1 item(s)
* ((x = 3) & (y = 1)) -> 16 item(s)
Choice Labels: 5 labels
* place -> 1 item(s)
* right -> 720 item(s)
* left -> 720 item(s)
* up -> 720 item(s)
* down -> 720 item(s)
--------------------------------------------------------------
Analyzing property 'Pmax=? [true U^{rew{"gold"}>=5 , rew{"gem"}>=5 , rew{"steps"}<=60 }((x = 3) & (y = 1))]'
Extend observation function to become reward aware.
bounded reachability processing done. POMDP Information:
--------------------------------------------------------------
Model type: POMDP (sparse)
States: 746
Transitions: 5217
Choices: 2981
Observations: 283
Reward Models: steps, gold, gem
State Labels: 3 labels
* deadlock -> 0 item(s)
* ((x = 3) & (y = 1)) -> 41 item(s)
* init -> 1 item(s)
Choice Labels: 5 labels
* up -> 745 item(s)
* right -> 745 item(s)
* place -> 1 item(s)
* left -> 745 item(s)
* down -> 745 item(s)
--------------------------------------------------------------
Transformed formula: Pmax=? [true U^{rew{"gold"}>=5 , rew{"gem"}>=5 , rew{"steps"}<=60 }((x = 3) & (y = 1))]
Time for pre-processing: 0.001s.
Exploring the belief MDP...
Exploring the belief space...
Exploration stopped before all beliefs were explored. 16777218 beliefs discovered. 11117477 beliefs explored.
Constructing the belief MDP...
--------------------------------------------------------------
Model type: MDP (sparse)
States: 16777219
Transitions: 65826376
Choices: 50129647
Reward Models: steps, gem, gold
State Labels: 3 labels
* target -> 1036917 item(s)
* init -> 1 item(s)
* bottom -> 1 item(s)
Choice Labels: none
--------------------------------------------------------------
Analyzing property 'Pmax=? [true U^{rew{"gold"}>=5 , rew{"gem"}>=5 , rew{"steps"}<=60 }"target"]' on the belief MDP...
Transformation of transition rewards resulted in a model with 27894694 states. 1.66265303 times more states than the original belief MDP.
Merging of sink states resulted in a model with 20774861 states.
############################## Notes ##############################
Storm-pomdp. Sequential approach, cost aware, with cutoffs and size threshold 2^24