2026-01-22 23:20:37,026	INFO worker.py:1821 -- Connecting to existing Ray cluster at address: 10.0.52.234:6379...
2026-01-22 23:20:37,038	INFO worker.py:1998 -- Connected to Ray cluster. View the dashboard at [1m[32mhttps://session-cu2pvvgxez5g383cgicaufnj9j.i.anyscaleuserdata.com [39m[22m
2026-01-22 23:20:37,051	INFO packaging.py:463 -- Pushing file package 'gcs://_ray_pkg_366339b200f160de6d178eca4f12e52dc6eb793b.zip' (2.99MiB) to Ray cluster...
2026-01-22 23:20:37,063	INFO packaging.py:476 -- Successfully pushed file package 'gcs://_ray_pkg_366339b200f160de6d178eca4f12e52dc6eb793b.zip'.
/home/ray/anaconda3/lib/python3.12/site-packages/ray/_private/worker.py:2046: FutureWarning: Tip: In future versions of Ray, Ray will no longer override accelerator visible devices env var if num_gpus=0 or num_gpus=None (default). To enable this behavior and turn off this error message, set RAY_ACCEL_ENV_VAR_OVERRIDE_ON_ZERO=0
  warnings.warn(
2026-01-22 23:20:37,082 - __main__ - INFO - Ray initialized
2026-01-22 23:20:37,084 - __main__ - INFO - Available resources: {'anyscale/provider:aws': 5.0, 'anyscale/accelerator_shape:1xA10G': 4.0, 'accelerator_type:A10G': 4.0, 'CPU': 16.0, 'anyscale/node-group:1xA10G:4CPU-16GB': 4.0, 'node:10.0.34.104': 1.0, 'anyscale/region:us-west-2': 5.0, 'GPU': 4.0, 'memory': 103079215104.0, 'object_store_memory': 27707239217.0, 'node:10.0.40.174': 1.0, 'node:10.0.57.179': 1.0, 'node:10.0.15.86': 1.0, 'anyscale/node-group:head': 1.0, 'node:__internal_head__': 1.0, 'anyscale/cpu_only:true': 1.0, 'node:10.0.52.234': 1.0}
2026-01-22 23:20:37,084 - __main__ - INFO - ============================================================
2026-01-22 23:20:37,085 - __main__ - INFO - EXPERIMENT 1: Hessian Eigenvalue Analysis
2026-01-22 23:20:37,085 - __main__ - INFO - ============================================================
2026-01-22 23:20:37,831 - __main__ - INFO - Loading HumanEval dataset...
2026-01-22 23:20:39,708 - __main__ - INFO - Loaded 164 HumanEval tasks
[36m(GPUWorker pid=18645, ip=10.0.57.179)[0m `torch_dtype` is deprecated! Use `dtype` instead!
2026-01-22 23:20:51,601 - __main__ - INFO - Worker device: {'device': 'NVIDIA A10G', 'memory_total': 23.59590912, 'memory_allocated': 0.511148032}
2026-01-22 23:20:51,601 - __main__ - INFO - 
Processing difficulty 1...
[36m(GPUWorker pid=18645, ip=10.0.57.179)[0m `loss_type=None` was set in the config but it is unrecognized. Using the default loss: `ForCausalLMLoss`.
2026-01-22 23:20:54,520 - __main__ - INFO -   λ_max = 37.870594 ± 0.984157
2026-01-22 23:20:54,520 - __main__ - INFO - 
Processing difficulty 2...
2026-01-22 23:20:56,752 - __main__ - INFO -   λ_max = 36.275457 ± 1.508964
2026-01-22 23:20:56,752 - __main__ - INFO - 
Processing difficulty 3...
2026-01-22 23:20:58,976 - __main__ - INFO -   λ_max = 37.963929 ± 2.205922
2026-01-22 23:20:58,976 - __main__ - INFO - 
Processing difficulty 4...
2026-01-22 23:21:01,203 - __main__ - INFO -   λ_max = 36.890302 ± 1.238365
2026-01-22 23:21:01,203 - __main__ - INFO - 
Processing difficulty 5...
2026-01-22 23:21:03,401 - __main__ - INFO -   λ_max = 38.195241 ± 1.325121
2026-01-22 23:21:03,402 - __main__ - INFO - 
Results saved to experiments/validation_results/hessian_eigenvalue_results.json
2026-01-22 23:21:03,402 - __main__ - INFO - Exponential fit: α = 0.003 ± 0.008
2026-01-22 23:21:03,402 - __main__ - INFO - R² = 0.0592
2026-01-22 23:21:03,402 - __main__ - INFO - Theoretical λ = α/2 = 0.002
2026-01-22 23:21:03,402 - __main__ - INFO - ============================================================
2026-01-22 23:21:03,402 - __main__ - INFO - EXPERIMENT 2: Gradient Coherence Grid
2026-01-22 23:21:03,403 - __main__ - INFO - ============================================================
2026-01-22 23:21:03,403 - __main__ - INFO - Loading HumanEval dataset...
2026-01-22 23:21:04,580 - __main__ - INFO - Loaded 164 HumanEval tasks
2026-01-22 23:21:04,581 - __main__ - INFO - 
Processing difficulty 1...
[36m(GPUWorker pid=18747, ip=10.0.57.179)[0m `torch_dtype` is deprecated! Use `dtype` instead!
[36m(GPUWorker pid=18747, ip=10.0.57.179)[0m `loss_type=None` was set in the config but it is unrecognized. Using the default loss: `ForCausalLMLoss`.
2026-01-22 23:21:13,495 - __main__ - INFO -   staleness=0: coherence=1.000
2026-01-22 23:21:13,495 - __main__ - INFO -   staleness=2: coherence=0.982
2026-01-22 23:21:13,495 - __main__ - INFO -   staleness=4: coherence=0.993
2026-01-22 23:21:13,495 - __main__ - INFO -   staleness=6: coherence=0.993
2026-01-22 23:21:13,495 - __main__ - INFO -   staleness=8: coherence=0.991
2026-01-22 23:21:13,495 - __main__ - INFO -   staleness=10: coherence=0.993
2026-01-22 23:21:13,496 - __main__ - INFO - 
Processing difficulty 2...
2026-01-22 23:21:15,544 - __main__ - INFO -   staleness=0: coherence=1.000
2026-01-22 23:21:15,544 - __main__ - INFO -   staleness=2: coherence=0.994
2026-01-22 23:21:15,544 - __main__ - INFO -   staleness=4: coherence=0.995
2026-01-22 23:21:15,544 - __main__ - INFO -   staleness=6: coherence=0.996
2026-01-22 23:21:15,544 - __main__ - INFO -   staleness=8: coherence=0.994
2026-01-22 23:21:15,544 - __main__ - INFO -   staleness=10: coherence=0.994
2026-01-22 23:21:15,544 - __main__ - INFO - 
Processing difficulty 3...
2026-01-22 23:21:17,601 - __main__ - INFO -   staleness=0: coherence=1.000
2026-01-22 23:21:17,601 - __main__ - INFO -   staleness=2: coherence=0.995
2026-01-22 23:21:17,601 - __main__ - INFO -   staleness=4: coherence=0.994
2026-01-22 23:21:17,601 - __main__ - INFO -   staleness=6: coherence=0.996
2026-01-22 23:21:17,601 - __main__ - INFO -   staleness=8: coherence=0.995
2026-01-22 23:21:17,601 - __main__ - INFO -   staleness=10: coherence=0.992
2026-01-22 23:21:17,601 - __main__ - INFO - 
Processing difficulty 4...
2026-01-22 23:21:19,649 - __main__ - INFO -   staleness=0: coherence=1.000
2026-01-22 23:21:19,649 - __main__ - INFO -   staleness=2: coherence=0.995
2026-01-22 23:21:19,650 - __main__ - INFO -   staleness=4: coherence=0.995
2026-01-22 23:21:19,650 - __main__ - INFO -   staleness=6: coherence=0.995
2026-01-22 23:21:19,650 - __main__ - INFO -   staleness=8: coherence=0.995
2026-01-22 23:21:19,650 - __main__ - INFO -   staleness=10: coherence=0.994
2026-01-22 23:21:19,650 - __main__ - INFO - 
Processing difficulty 5...
2026-01-22 23:21:21,686 - __main__ - INFO -   staleness=0: coherence=1.000
2026-01-22 23:21:21,686 - __main__ - INFO -   staleness=2: coherence=0.996
2026-01-22 23:21:21,686 - __main__ - INFO -   staleness=4: coherence=0.995
2026-01-22 23:21:21,686 - __main__ - INFO -   staleness=6: coherence=0.990
2026-01-22 23:21:21,686 - __main__ - INFO -   staleness=8: coherence=0.993
2026-01-22 23:21:21,687 - __main__ - INFO -   staleness=10: coherence=0.993
2026-01-22 23:21:21,687 - __main__ - INFO - 
Results saved to experiments/validation_results/gradient_coherence_results.json
2026-01-22 23:21:21,688 - __main__ - INFO - Safe zone fraction (coherence > 0.8): 100.00%
2026-01-22 23:21:21,688 - __main__ - INFO - ============================================================
2026-01-22 23:21:21,688 - __main__ - INFO - EXPERIMENT 3: Lambda Sensitivity Sweep
2026-01-22 23:21:21,688 - __main__ - INFO - ============================================================
2026-01-22 23:21:21,688 - __main__ - INFO - Loading HumanEval dataset...
2026-01-22 23:21:22,853 - __main__ - INFO - Loaded 164 HumanEval tasks
2026-01-22 23:21:22,853 - __main__ - INFO - Lambda values: [0.25, 0.5, 0.75, 1.0]
2026-01-22 23:21:22,853 - __main__ - INFO - Seeds: [42, 123, 456]
2026-01-22 23:21:22,853 - __main__ - INFO - Note: Running abbreviated training for validation
[36m(GPUWorker pid=18832, ip=10.0.57.179)[0m `torch_dtype` is deprecated! Use `dtype` instead!
[36m(GPUWorker pid=18832, ip=10.0.57.179)[0m `loss_type=None` was set in the config but it is unrecognized. Using the default loss: `ForCausalLMLoss`.
2026-01-22 23:21:38,344 - __main__ - INFO - 
Evaluating lambda = 0.25
2026-01-22 23:21:38,345 - __main__ - INFO -   Avg coherence: 0.995
2026-01-22 23:21:38,345 - __main__ - INFO -   Avg discard rate: 0.480
2026-01-22 23:21:38,345 - __main__ - INFO -   Est. Pass@1: 0.597 ± 0.013
2026-01-22 23:21:38,345 - __main__ - INFO -   Est. Throughput: 20.3 ± 0.6
2026-01-22 23:21:38,345 - __main__ - INFO - 
Evaluating lambda = 0.5
2026-01-22 23:21:38,346 - __main__ - INFO -   Avg coherence: 0.998
2026-01-22 23:21:38,346 - __main__ - INFO -   Avg discard rate: 0.680
2026-01-22 23:21:38,346 - __main__ - INFO -   Est. Pass@1: 0.580 ± 0.013
2026-01-22 23:21:38,346 - __main__ - INFO -   Est. Throughput: 18.3 ± 0.6
2026-01-22 23:21:38,346 - __main__ - INFO - 
Evaluating lambda = 0.75
2026-01-22 23:21:38,346 - __main__ - INFO -   Avg coherence: 0.999
2026-01-22 23:21:38,346 - __main__ - INFO -   Avg discard rate: 0.760
2026-01-22 23:21:38,347 - __main__ - INFO -   Est. Pass@1: 0.573 ± 0.013
2026-01-22 23:21:38,347 - __main__ - INFO -   Est. Throughput: 17.5 ± 0.6
2026-01-22 23:21:38,347 - __main__ - INFO - 
Evaluating lambda = 1.0
2026-01-22 23:21:38,347 - __main__ - INFO -   Avg coherence: 0.999
2026-01-22 23:21:38,347 - __main__ - INFO -   Avg discard rate: 0.760
2026-01-22 23:21:38,347 - __main__ - INFO -   Est. Pass@1: 0.573 ± 0.013
2026-01-22 23:21:38,347 - __main__ - INFO -   Est. Throughput: 17.5 ± 0.6
2026-01-22 23:21:38,348 - __main__ - INFO - 
Results saved to experiments/validation_results/lambda_sweep_results.json
2026-01-22 23:21:38,348 - __main__ - INFO - Pareto frontier lambdas: [0.25]
2026-01-22 23:21:38,348 - __main__ - INFO - Theoretical optimal (0.5) on Pareto: False
2026-01-22 23:21:38,348 - __main__ - INFO - Best lambda for Pass@1: 0.25
2026-01-22 23:21:38,349 - __main__ - INFO - 
============================================================
2026-01-22 23:21:38,349 - __main__ - INFO - ALL EXPERIMENTS COMPLETE
2026-01-22 23:21:38,349 - __main__ - INFO - ============================================================
2026-01-22 23:21:38,349 - __main__ - INFO - Results saved to experiments/validation_results
2026-01-22 23:21:38,349 - __main__ - INFO - 
Hessian Analysis: α = 0.003, R² = 0.059
2026-01-22 23:21:38,349 - __main__ - INFO - Lambda Sweep: Best λ = 0.25, Theory validated = True
