$BENCH_HOME/bin/storm-pomdp --prism $BENCH_HOME/models/resrc/resrc.prism --prop $BENCH_HOME/models/resrc/resrc.props rbrmax3 -const B1=15,B2=15,B3=180 --timemem --statistics --revised --reward-aware --belief-exploration unfold --size-threshold 8388608
Storm-pomdp. Sequential approach, cost aware, with cutoffs and size threshold 2^23
Storm-POMDP 1.9.1 (dev)
Date: Mon Feb 10 14:43:14 2025
Command line arguments: --prism $BENCH_HOME/models/resrc/resrc.prism --prop $BENCH_HOME/models/resrc/resrc.props rbrmax3 -const 'B1=15,B2=15,B3=180' --timemem --statistics --revised --reward-aware --belief-exploration unfold --size-threshold 8388608
Current working directory: $BENCH_HOME/experiments64gb
Time for model input parsing: 0.007s.
Time for model construction: 0.053s.
--------------------------------------------------------------
Model type: POMDP (sparse)
States: 721
Transitions: 5041
Choices: 2881
Observations: 155
Reward Models: steps, gold, gem
State Labels: 3 labels
* deadlock -> 0 item(s)
* init -> 1 item(s)
* ((x = 3) & (y = 1)) -> 16 item(s)
Choice Labels: 5 labels
* place -> 1 item(s)
* right -> 720 item(s)
* left -> 720 item(s)
* up -> 720 item(s)
* down -> 720 item(s)
--------------------------------------------------------------
Analyzing property 'Pmax=? [true U^{rew{"gold"}>=15 , rew{"gem"}>=15 , rew{"steps"}<=180 }((x = 3) & (y = 1))]'
Extend observation function to become reward aware.
bounded reachability processing done. POMDP Information:
--------------------------------------------------------------
Model type: POMDP (sparse)
States: 746
Transitions: 5217
Choices: 2981
Observations: 283
Reward Models: steps, gold, gem
State Labels: 3 labels
* deadlock -> 0 item(s)
* ((x = 3) & (y = 1)) -> 41 item(s)
* init -> 1 item(s)
Choice Labels: 5 labels
* up -> 745 item(s)
* right -> 745 item(s)
* place -> 1 item(s)
* left -> 745 item(s)
* down -> 745 item(s)
--------------------------------------------------------------
Transformed formula: Pmax=? [true U^{rew{"gold"}>=15 , rew{"gem"}>=15 , rew{"steps"}<=180 }((x = 3) & (y = 1))]
Time for pre-processing: 0.001s.
Exploring the belief MDP...
Exploring the belief space...
Exploration stopped before all beliefs were explored. 8388609 beliefs discovered. 5318465 beliefs explored.
Constructing the belief MDP...
--------------------------------------------------------------
Model type: MDP (sparse)
States: 8388610
Transitions: 32142825
Choices: 24344002
Reward Models: steps, gem, gold
State Labels: 3 labels
* target -> 482591 item(s)
* init -> 1 item(s)
* bottom -> 1 item(s)
Choice Labels: none
--------------------------------------------------------------
Analyzing property 'Pmax=? [true U^{rew{"gold"}>=15 , rew{"gem"}>=15 , rew{"steps"}<=180 }"target"]' on the belief MDP...
Transformation of transition rewards resulted in a model with 13707073 states. 1.634010045 times more states than the original belief MDP.
Merging of sink states resulted in a model with 9939462 states.
############################## Notes ##############################
Storm-pomdp. Sequential approach, cost aware, with cutoffs and size threshold 2^23