Arch: resnet50_pt
Bs trn: 128
Bs val: 128
Hidden dim: 256
Dataset: celebA
Resample class: 
Slice with: rep
Rep cluster method: gmm
Num anchor: 32
Num positive: 32
Num negative: 32
Num negative easy: 0
Weight anc by loss: False
Weight pos by loss: False
Weight neg by loss: False
Anc loss temp: 0.5
Pos loss temp: 0.5
Neg loss temp: 0.5
Data wide pos: False
Target sample ratio: 1
Balance targets: False
Additional negatives: False
Hard negative factor: 0
Full contrastive: False
Train encoder: False
No projection head: False
Projection dim: 128
Batch factor: None
Temperature: 0.05
Single pos: False
Supervised linear scale up: False
Supervised update delay: 0
Contrastive weight: 0.5
Classifier update interval: 8
Optim: sgd
Max epoch: 5
Lr: 0.0001
Momentum: 0.9
Weight decay: 0.1
Weight decay c: 0.1
Stopping window: 30
Load encoder: 
Freeze encoder: False
Finetune epochs: 0
Clip grad norm: False
Lr scheduler classifier: 
Lr scheduler: 
Grad clip grad norm: False
Erm: False
Erm only: False
Pretrained spurious path: 
Max epoch s: 1
Bs trn s: 32
Lr s: 0.001
Momentum s: 0.9
Weight decay s: 0.0005
Slice temp: 10
Log loss interval: 10
Checkpoint interval: 50
Grad checkpoint interval: 50
Log visual interval: 100
Log grad visual interval: 50
Verbose: True
Seed: 30
Replicate: 0
No cuda: False
Resume: False
New slice: False
Num workers: 12
Evaluate: False
Data cmap: hsv
Test cmap: 
P correlation: 0.9
P corr by class: None
Train classes: ['blond', 'nonblond']
Train class ratios: None
Test shift: random
Flipped: False
Q: 0.7
Pretrained bmodel: False
Cosine: False
Exp: stage_one_erm
Supervised contrast: True
Prioritize spurious pos: False
Contrastive type: cnc
Compute auroc: False
Model type: resnet50_pt_cnc
Criterion: cross_entropy
Pretrained: False
Max grad norm: 1.0
Adam epsilon: 1e-08
Warmup steps: 0
Max grad norm s: 1.0
Adam epsilon s: 1e-08
Warmup steps s: 0
Grad max grad norm: 1.0
Grad adam epsilon: 1e-08
Grad warmup steps: 0
Device: cuda
Img file type: .png
Display image: False
Image path: ./images/celebA/celebA/config/contrastive_umaps
Log interval: 1
Log path: ./logs/celebA/config
Results path: ./results/celebA/config
Model path: ./model/celebA/config
Loss factor: 1
Supersample labels: False
Subsample labels: False
Weigh slice samples by loss: True
Val split: 0.2
Spurious train split: 0.2
Subsample groups: False
Train method: sc
Max robust acc: -1
Max robust epoch: -1
Max robust group acc: (None, None)
Root dir: ./datasets/data/CelebA/
Target name: Blond_Hair
Confounder names: ['Male']
Image mean: 0.449
Image std: 0.226
Augment data: False
Task: celebA
Num classes: 2
Experiment configs: config
Experiment name: cnc-celebA-sw=re-na=32-np=32-nn=32-nne=0-tsr=1-t=0.05-bf=None-cw=0.5-sud=0-me=5-bst=128-o=sgd-lr=0.0001-mo=0.9-wd=0.1-wdc=0.1-spur-me=1-bst=32-lr=0.001-mo=0.9-wd=0.0005-sts=0.2-s=30-r=0
Mi resampled: None

Loading checkpoints for train split:
[-1 -1 -1 ... -1 -1 -1]
<class 'numpy.ndarray'>
[0 1 2 3] [71629 66874 22880  1387]
Loading checkpoints for val split:
[-1 -1 -1 ... -1  1 -1]
<class 'numpy.ndarray'>
[0 1 2 3] [8535 8276 2874  182]
Loading checkpoints for test split:
[-1 -1 -1 ... -1 -1  1]
<class 'numpy.ndarray'>
[0 1 2 3] [9767 7535 2480  180]
Train dataset:
    Blond_Hair = 0, Male = 0 : n = 71629
    Blond_Hair = 0, Male = 1 : n = 66874
    Blond_Hair = 1, Male = 0 : n = 22880
    Blond_Hair = 1, Male = 1 : n = 1387
Val dataset:
    Blond_Hair = 0, Male = 0 : n = 8535
    Blond_Hair = 0, Male = 1 : n = 8276
    Blond_Hair = 1, Male = 0 : n = 2874
    Blond_Hair = 1, Male = 1 : n = 182
Test dataset:
    Blond_Hair = 0, Male = 0 : n = 9767
    Blond_Hair = 0, Male = 1 : n = 7535
    Blond_Hair = 1, Male = 0 : n = 2480
    Blond_Hair = 1, Male = 1 : n = 180
Pretrained model loaded from 
Epoch:   1 | Train Loss: 0.000 | Train Acc: 84.680 | Val Loss: 0.003 | Val Acc: 84.618
Training:
Accuracies by groups:
0, 0  acc: 71214 / 71629 =  99.421
0, 1  acc: 66489 / 66874 =  99.424
1, 0  acc:   125 / 22880 =   0.546
1, 1  acc:     6 /  1387 =   0.433
--------------------------------------
Average acc: 137834 / 162770 =  84.680
Robust  acc:     6 /  1387 =   0.433
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8535 /  8535 = 100.000
0, 1  acc:  8276 /  8276 = 100.000
1, 0  acc:     0 /  2874 =   0.000
1, 1  acc:     0 /   182 =   0.000
------------------------------------
Average acc: 16811 / 19867 =  84.618
Robust  acc:     0 /  2874 =   0.000
------------------------------------
Save biased model at epoch 0
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch0_seed30.pt
New max average-worst acc gap: 84.61770775658127
bias model - Saving best checkpoint at epoch 0
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_worst_avg_gap_best_epoch0_seed30.pt
-------------------------------------------
Avg Test Loss: 0.003 | Avg Test Acc: 86.675
Robust Acc: 0.000 | Best Acc: 100.000
-------------------------------------
Training, Epoch 0:
Accuracies by groups:
0, 0  acc:  9767 /  9767 = 100.000
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:     0 /  2480 =   0.000
1, 1  acc:     0 /   180 =   0.000
------------------------------------
Average acc: 17302 / 19962 =  86.675
Robust  acc:     0 /  2480 =   0.000
------------------------------------
Accuracies by groups:
0, 0  acc:  9767 /  9767 = 100.000
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:     0 /  2480 =   0.000
1, 1  acc:     0 /   180 =   0.000
------------------------------------
Average acc: 17302 / 19962 =  86.675
Robust  acc:     0 /  2480 =   0.000
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9767 /  9767 = 100.000
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:     0 /  2480 =   0.000
1, 1  acc:     0 /   180 =   0.000
------------------------------------
Average acc: 17302 / 19962 =  86.675
Robust  acc:     0 /  2480 =   0.000
------------------------------------
Epoch:   2 | Train Loss: 0.001 | Train Acc: 86.966 | Val Loss: 0.002 | Val Acc: 91.634
Training:
Accuracies by groups:
0, 0  acc: 71357 / 71629 =  99.620
0, 1  acc: 66871 / 66874 =  99.996
1, 0  acc:  3311 / 22880 =  14.471
1, 1  acc:    15 /  1387 =   1.081
--------------------------------------
Average acc: 141554 / 162770 =  86.966
Robust  acc:    15 /  1387 =   1.081
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8310 /  8535 =  97.364
0, 1  acc:  8275 /  8276 =  99.988
1, 0  acc:  1614 /  2874 =  56.159
1, 1  acc:     6 /   182 =   3.297
------------------------------------
Average acc: 18205 / 19867 =  91.634
Robust  acc:     6 /   182 =   3.297
------------------------------------
Save biased model at epoch 1
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch1_seed30.pt
New max average-worst acc gap: 88.33766525415994
bias model - Saving best checkpoint at epoch 1
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_worst_avg_gap_best_epoch1_seed30.pt
-------------------------------------------
Avg Test Loss: 0.001 | Avg Test Acc: 92.215
Robust Acc: 7.778 | Best Acc: 100.000
-------------------------------------
Training, Epoch 1:
Accuracies by groups:
0, 0  acc:  9602 /  9767 =  98.311
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:  1257 /  2480 =  50.685
1, 1  acc:    14 /   180 =   7.778
------------------------------------
Average acc: 18408 / 19962 =  92.215
Robust  acc:    14 /   180 =   7.778
------------------------------------
Accuracies by groups:
0, 0  acc:  9602 /  9767 =  98.311
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:  1257 /  2480 =  50.685
1, 1  acc:    14 /   180 =   7.778
------------------------------------
Average acc: 18408 / 19962 =  92.215
Robust  acc:    14 /   180 =   7.778
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9602 /  9767 =  98.311
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:  1257 /  2480 =  50.685
1, 1  acc:    14 /   180 =   7.778
------------------------------------
Average acc: 18408 / 19962 =  92.215
Robust  acc:    14 /   180 =   7.778
------------------------------------
Epoch:   3 | Train Loss: 0.000 | Train Acc: 93.068 | Val Loss: 0.001 | Val Acc: 93.844
Training:
Accuracies by groups:
0, 0  acc: 69416 / 71629 =  96.910
0, 1  acc: 66754 / 66874 =  99.821
1, 0  acc: 15131 / 22880 =  66.132
1, 1  acc:   185 /  1387 =  13.338
--------------------------------------
Average acc: 151486 / 162770 =  93.068
Robust  acc:   185 /  1387 =  13.338
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8177 /  8535 =  95.806
0, 1  acc:  8257 /  8276 =  99.770
1, 0  acc:  2183 /  2874 =  75.957
1, 1  acc:    27 /   182 =  14.835
------------------------------------
Average acc: 18644 / 19867 =  93.844
Robust  acc:    27 /   182 =  14.835
------------------------------------
Save biased model at epoch 2
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch2_seed30.pt
-------------------------------------------
Avg Test Loss: 0.001 | Avg Test Acc: 94.134
Robust Acc: 19.444 | Best Acc: 99.881
-------------------------------------
Training, Epoch 2:
Accuracies by groups:
0, 0  acc:  9478 /  9767 =  97.041
0, 1  acc:  7526 /  7535 =  99.881
1, 0  acc:  1752 /  2480 =  70.645
1, 1  acc:    35 /   180 =  19.444
------------------------------------
Average acc: 18791 / 19962 =  94.134
Robust  acc:    35 /   180 =  19.444
------------------------------------
Accuracies by groups:
0, 0  acc:  9478 /  9767 =  97.041
0, 1  acc:  7526 /  7535 =  99.881
1, 0  acc:  1752 /  2480 =  70.645
1, 1  acc:    35 /   180 =  19.444
------------------------------------
Average acc: 18791 / 19962 =  94.134
Robust  acc:    35 /   180 =  19.444
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9478 /  9767 =  97.041
0, 1  acc:  7526 /  7535 =  99.881
1, 0  acc:  1752 /  2480 =  70.645
1, 1  acc:    35 /   180 =  19.444
------------------------------------
Average acc: 18791 / 19962 =  94.134
Robust  acc:    35 /   180 =  19.444
------------------------------------
Epoch:   4 | Train Loss: 0.000 | Train Acc: 94.213 | Val Loss: 0.001 | Val Acc: 94.448
Training:
Accuracies by groups:
0, 0  acc: 68930 / 71629 =  96.232
0, 1  acc: 66626 / 66874 =  99.629
1, 0  acc: 17475 / 22880 =  76.377
1, 1  acc:   320 /  1387 =  23.071
--------------------------------------
Average acc: 153351 / 162770 =  94.213
Robust  acc:   320 /  1387 =  23.071
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8174 /  8535 =  95.770
0, 1  acc:  8256 /  8276 =  99.758
1, 0  acc:  2301 /  2874 =  80.063
1, 1  acc:    33 /   182 =  18.132
------------------------------------
Average acc: 18764 / 19867 =  94.448
Robust  acc:    33 /   182 =  18.132
------------------------------------
Save biased model at epoch 3
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch3_seed30.pt
-------------------------------------------
Avg Test Loss: 0.001 | Avg Test Acc: 94.815
Robust Acc: 24.444 | Best Acc: 99.801
-------------------------------------
Training, Epoch 3:
Accuracies by groups:
0, 0  acc:  9469 /  9767 =  96.949
0, 1  acc:  7520 /  7535 =  99.801
1, 0  acc:  1894 /  2480 =  76.371
1, 1  acc:    44 /   180 =  24.444
------------------------------------
Average acc: 18927 / 19962 =  94.815
Robust  acc:    44 /   180 =  24.444
------------------------------------
Accuracies by groups:
0, 0  acc:  9469 /  9767 =  96.949
0, 1  acc:  7520 /  7535 =  99.801
1, 0  acc:  1894 /  2480 =  76.371
1, 1  acc:    44 /   180 =  24.444
------------------------------------
Average acc: 18927 / 19962 =  94.815
Robust  acc:    44 /   180 =  24.444
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9469 /  9767 =  96.949
0, 1  acc:  7520 /  7535 =  99.801
1, 0  acc:  1894 /  2480 =  76.371
1, 1  acc:    44 /   180 =  24.444
------------------------------------
Average acc: 18927 / 19962 =  94.815
Robust  acc:    44 /   180 =  24.444
------------------------------------
Epoch:   5 | Train Loss: 0.000 | Train Acc: 94.622 | Val Loss: 0.001 | Val Acc: 94.775
Training:
Accuracies by groups:
0, 0  acc: 68763 / 71629 =  95.999
0, 1  acc: 66579 / 66874 =  99.559
1, 0  acc: 18300 / 22880 =  79.983
1, 1  acc:   375 /  1387 =  27.037
--------------------------------------
Average acc: 154017 / 162770 =  94.622
Robust  acc:   375 /  1387 =  27.037
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8167 /  8535 =  95.688
0, 1  acc:  8252 /  8276 =  99.710
1, 0  acc:  2369 /  2874 =  82.429
1, 1  acc:    41 /   182 =  22.527
------------------------------------
Average acc: 18829 / 19867 =  94.775
Robust  acc:    41 /   182 =  22.527
------------------------------------
Save biased model at epoch 4
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch4_seed30.pt
-------------------------------------------
Avg Test Loss: 0.001 | Avg Test Acc: 95.246
Robust Acc: 30.000 | Best Acc: 99.761
-------------------------------------
Training, Epoch 4:
Accuracies by groups:
0, 0  acc:  9464 /  9767 =  96.898
0, 1  acc:  7517 /  7535 =  99.761
1, 0  acc:  1978 /  2480 =  79.758
1, 1  acc:    54 /   180 =  30.000
------------------------------------
Average acc: 19013 / 19962 =  95.246
Robust  acc:    54 /   180 =  30.000
------------------------------------
Accuracies by groups:
0, 0  acc:  9464 /  9767 =  96.898
0, 1  acc:  7517 /  7535 =  99.761
1, 0  acc:  1978 /  2480 =  79.758
1, 1  acc:    54 /   180 =  30.000
------------------------------------
Average acc: 19013 / 19962 =  95.246
Robust  acc:    54 /   180 =  30.000
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9464 /  9767 =  96.898
0, 1  acc:  7517 /  7535 =  99.761
1, 0  acc:  1978 /  2480 =  79.758
1, 1  acc:    54 /   180 =  30.000
------------------------------------
Average acc: 19013 / 19962 =  95.246
Robust  acc:    54 /   180 =  30.000
------------------------------------
replace: True
Checkpoint saved at ./model/celebA/config/bias-end_seed30.pt
training biased model is done
