2026-01-22 23:22:33,727 - __main__ - INFO - Output directories created under experiments/gpu_full_results
2026-01-22 23:22:34,349	INFO worker.py:1821 -- Connecting to existing Ray cluster at address: 10.0.52.234:6379...
2026-01-22 23:22:34,362	INFO worker.py:1998 -- Connected to Ray cluster. View the dashboard at [1m[32mhttps://session-cu2pvvgxez5g383cgicaufnj9j.i.anyscaleuserdata.com [39m[22m
2026-01-22 23:22:34,371	INFO packaging.py:392 -- Ignoring upload to cluster for these files: [PosixPath('/home/ray/default/.gitignore')]
2026-01-22 23:22:34,399	INFO packaging.py:691 -- Creating a file package for local module '/home/ray/default'.
2026-01-22 23:22:34,400	INFO packaging.py:392 -- Ignoring upload to cluster for these files: [PosixPath('/home/ray/default/.gitignore')]
2026-01-22 23:22:34,432	INFO packaging.py:463 -- Pushing file package 'gcs://_ray_pkg_729aaa9c1cbed043.zip' (0.49MiB) to Ray cluster...
2026-01-22 23:22:34,435	INFO packaging.py:476 -- Successfully pushed file package 'gcs://_ray_pkg_729aaa9c1cbed043.zip'.
/home/ray/anaconda3/lib/python3.12/site-packages/ray/_private/worker.py:2046: FutureWarning: Tip: In future versions of Ray, Ray will no longer override accelerator visible devices env var if num_gpus=0 or num_gpus=None (default). To enable this behavior and turn off this error message, set RAY_ACCEL_ENV_VAR_OVERRIDE_ON_ZERO=0
  warnings.warn(
2026-01-22 23:22:34,460 - __main__ - INFO - Ray cluster resources: {'anyscale/provider:aws': 5.0, 'anyscale/accelerator_shape:1xA10G': 4.0, 'accelerator_type:A10G': 4.0, 'CPU': 16.0, 'anyscale/node-group:1xA10G:4CPU-16GB': 4.0, 'node:10.0.34.104': 1.0, 'anyscale/region:us-west-2': 5.0, 'GPU': 4.0, 'object_store_memory': 27707239217.0, 'memory': 103079215104.0, 'node:10.0.40.174': 1.0, 'node:10.0.57.179': 1.0, 'node:10.0.15.86': 1.0, 'anyscale/node-group:head': 1.0, 'node:__internal_head__': 1.0, 'anyscale/cpu_only:true': 1.0, 'node:10.0.52.234': 1.0}
2026-01-22 23:22:37,020 - __main__ - INFO - Loading HumanEval dataset...
2026-01-22 23:22:38,980 - __main__ - INFO - Loaded 164 HumanEval tasks
2026-01-22 23:22:38,980 - __main__ - INFO - 
================================================================================
2026-01-22 23:22:38,980 - __main__ - INFO - PHASE 1: TRAINING EXPERIMENTS
2026-01-22 23:22:38,981 - __main__ - INFO - ================================================================================
2026-01-22 23:22:38,981 - __main__ - INFO - ================================================================================
2026-01-22 23:22:38,981 - __main__ - INFO - STARTING ALL TRAINING EXPERIMENTS
2026-01-22 23:22:38,981 - __main__ - INFO - ================================================================================
2026-01-22 23:22:38,982 - __main__ - INFO - 
============================================================
2026-01-22 23:22:38,982 - __main__ - INFO - Training: sync (seed=42)
2026-01-22 23:22:38,982 - __main__ - INFO - ============================================================
/home/ray/anaconda3/lib/python3.12/site-packages/transformers/utils/hub.py:110: FutureWarning: Using `TRANSFORMERS_CACHE` is deprecated and will be removed in v5 of Transformers. Use `HF_HOME` instead.
  warnings.warn(
2026-01-22 23:22:42,685 - __main__ - INFO - Running sync with seed 42, steps 500, model Salesforce/codegen-350M-mono
2026-01-22 23:22:42,686 - src.training.aceas_trainer - INFO - CUDA not available, skipping Megatron bridge initialization
2026-01-22 23:22:42,686 - src.training.aceas_trainer - INFO - Loading model: Salesforce/codegen-350M-mono
`torch_dtype` is deprecated! Use `dtype` instead!
Some weights of the model checkpoint at Salesforce/codegen-350M-mono were not used when initializing CodeGenForCausalLM: ['transformer.h.0.attn.causal_mask', 'transformer.h.1.attn.causal_mask', 'transformer.h.10.attn.causal_mask', 'transformer.h.11.attn.causal_mask', 'transformer.h.12.attn.causal_mask', 'transformer.h.13.attn.causal_mask', 'transformer.h.14.attn.causal_mask', 'transformer.h.15.attn.causal_mask', 'transformer.h.16.attn.causal_mask', 'transformer.h.17.attn.causal_mask', 'transformer.h.18.attn.causal_mask', 'transformer.h.19.attn.causal_mask', 'transformer.h.2.attn.causal_mask', 'transformer.h.3.attn.causal_mask', 'transformer.h.4.attn.causal_mask', 'transformer.h.5.attn.causal_mask', 'transformer.h.6.attn.causal_mask', 'transformer.h.7.attn.causal_mask', 'transformer.h.8.attn.causal_mask', 'transformer.h.9.attn.causal_mask']
- This IS expected if you are initializing CodeGenForCausalLM from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
- This IS NOT expected if you are initializing CodeGenForCausalLM from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).
2026-01-22 23:22:44,328 - src.training.aceas_trainer - INFO - Model explicitly converted to torch.float32
2026-01-22 23:22:44,329 - src.training.aceas_trainer - INFO - Gradient checkpointing enabled
2026-01-22 23:22:44,329 - src.training.aceas_trainer - INFO - Auto-detected LoRA target modules: ['qkv_proj', 'out_proj', 'fc_in', 'fc_out']
2026-01-22 23:22:46,359 - src.training.aceas_trainer - INFO - LoRA applied: 5,242,880 trainable / 361,955,328 total params (1.45%)
2026-01-22 23:22:46,361 - src.training.aceas_trainer - INFO - Using local mode (no Ray workers)
2026-01-22 23:22:46,361 - src.training.aceas_trainer - INFO - Starting ACEAS training
/home/ray/default/src/training/grpo.py:214: UserWarning: CUDA is not available or torch_xla is imported. Disabling autocast.
  with torch.amp.autocast(device_type='cuda', dtype=torch.float16):
`use_cache=True` is incompatible with gradient checkpointing. Setting `use_cache=False`...
Caching is incompatible with gradient checkpointing in CodeGenBlock. Setting `layer_past=None`.
/home/ray/anaconda3/lib/python3.12/site-packages/torch/utils/checkpoint.py:232: UserWarning: None of the inputs have requires_grad=True. Gradients will be None
  check_backward_validity(args)
/home/ray/anaconda3/lib/python3.12/site-packages/torch/utils/checkpoint.py:232: UserWarning: None of the inputs have requires_grad=True. Gradients will be None
  check_backward_validity(args)
/home/ray/anaconda3/lib/python3.12/site-packages/torch/utils/checkpoint.py:232: UserWarning: None of the inputs have requires_grad=True. Gradients will be None
  check_backward_validity(args)
*** SIGTERM received at time=1769125043 on cpu 6 ***
PC: @     0x72a1df8fc749  (unknown)  (unknown)
    @     0x72a1df89d520  (unknown)  (unknown)
{"asctime":"2026-01-22 23:37:23,381","levelname":"E","message":"*** SIGTERM received at time=1769125043 on cpu 6 ***","filename":"logging.cc","lineno":474}
{"asctime":"2026-01-22 23:37:23,381","levelname":"E","message":"PC: @     0x72a1df8fc749  (unknown)  (unknown)","filename":"logging.cc","lineno":474}
{"asctime":"2026-01-22 23:37:23,381","levelname":"E","message":"    @     0x72a1df89d520  (unknown)  (unknown)","filename":"logging.cc","lineno":474}
[36m(autoscaler +4m52s)[0m Tip: use `ray status` to view detailed cluster status. To disable these messages, set RAY_SCHEDULER_EVENTS=0.
