Arch: resnet50_pt
Bs trn: 128
Bs val: 128
Hidden dim: 256
Dataset: celebA
Resample class: 
Slice with: rep
Rep cluster method: gmm
Num anchor: 32
Num positive: 32
Num negative: 32
Num negative easy: 0
Weight anc by loss: False
Weight pos by loss: False
Weight neg by loss: False
Anc loss temp: 0.5
Pos loss temp: 0.5
Neg loss temp: 0.5
Data wide pos: False
Target sample ratio: 1
Balance targets: False
Additional negatives: False
Hard negative factor: 0
Full contrastive: False
Train encoder: False
No projection head: False
Projection dim: 128
Batch factor: None
Temperature: 0.05
Single pos: False
Supervised linear scale up: False
Supervised update delay: 0
Contrastive weight: 0.5
Classifier update interval: 8
Optim: sgd
Max epoch: 5
Lr: 0.0001
Momentum: 0.9
Weight decay: 0.1
Weight decay c: 0.1
Stopping window: 30
Load encoder: 
Freeze encoder: False
Finetune epochs: 0
Clip grad norm: False
Lr scheduler classifier: 
Lr scheduler: 
Grad clip grad norm: False
Erm: False
Erm only: False
Pretrained spurious path: 
Max epoch s: 1
Bs trn s: 32
Lr s: 0.001
Momentum s: 0.9
Weight decay s: 0.0005
Slice temp: 10
Log loss interval: 10
Checkpoint interval: 50
Grad checkpoint interval: 50
Log visual interval: 100
Log grad visual interval: 50
Verbose: True
Seed: 42
Replicate: 0
No cuda: False
Resume: False
New slice: False
Num workers: 12
Evaluate: False
Data cmap: hsv
Test cmap: 
P correlation: 0.9
P corr by class: None
Train classes: ['blond', 'nonblond']
Train class ratios: None
Test shift: random
Flipped: False
Q: 0.7
Pretrained bmodel: False
Cosine: False
Exp: stage_one_erm
Supervised contrast: True
Prioritize spurious pos: False
Contrastive type: cnc
Compute auroc: False
Model type: resnet50_pt_cnc
Criterion: cross_entropy
Pretrained: False
Max grad norm: 1.0
Adam epsilon: 1e-08
Warmup steps: 0
Max grad norm s: 1.0
Adam epsilon s: 1e-08
Warmup steps s: 0
Grad max grad norm: 1.0
Grad adam epsilon: 1e-08
Grad warmup steps: 0
Device: cuda
Img file type: .png
Display image: False
Image path: ./images/celebA/celebA/config/contrastive_umaps
Log interval: 1
Log path: ./logs/celebA/config
Results path: ./results/celebA/config
Model path: ./model/celebA/config
Loss factor: 1
Supersample labels: False
Subsample labels: False
Weigh slice samples by loss: True
Val split: 0.2
Spurious train split: 0.2
Subsample groups: False
Train method: sc
Max robust acc: -1
Max robust epoch: -1
Max robust group acc: (None, None)
Root dir: ./datasets/data/CelebA/
Target name: Blond_Hair
Confounder names: ['Male']
Image mean: 0.449
Image std: 0.226
Augment data: False
Task: celebA
Num classes: 2
Experiment configs: config
Experiment name: cnc-celebA-sw=re-na=32-np=32-nn=32-nne=0-tsr=1-t=0.05-bf=None-cw=0.5-sud=0-me=5-bst=128-o=sgd-lr=0.0001-mo=0.9-wd=0.1-wdc=0.1-spur-me=1-bst=32-lr=0.001-mo=0.9-wd=0.0005-sts=0.2-s=42-r=0
Mi resampled: None

Loading checkpoints for train split:
[-1 -1 -1 ... -1 -1 -1]
<class 'numpy.ndarray'>
[0 1 2 3] [71629 66874 22880  1387]
Loading checkpoints for val split:
[-1 -1 -1 ... -1  1 -1]
<class 'numpy.ndarray'>
[0 1 2 3] [8535 8276 2874  182]
Loading checkpoints for test split:
[-1 -1 -1 ... -1 -1  1]
<class 'numpy.ndarray'>
[0 1 2 3] [9767 7535 2480  180]
Train dataset:
    Blond_Hair = 0, Male = 0 : n = 71629
    Blond_Hair = 0, Male = 1 : n = 66874
    Blond_Hair = 1, Male = 0 : n = 22880
    Blond_Hair = 1, Male = 1 : n = 1387
Val dataset:
    Blond_Hair = 0, Male = 0 : n = 8535
    Blond_Hair = 0, Male = 1 : n = 8276
    Blond_Hair = 1, Male = 0 : n = 2874
    Blond_Hair = 1, Male = 1 : n = 182
Test dataset:
    Blond_Hair = 0, Male = 0 : n = 9767
    Blond_Hair = 0, Male = 1 : n = 7535
    Blond_Hair = 1, Male = 0 : n = 2480
    Blond_Hair = 1, Male = 1 : n = 180
Pretrained model loaded from 
Epoch:   1 | Train Loss: 0.000 | Train Acc: 84.850 | Val Loss: 0.003 | Val Acc: 84.618
Training:
Accuracies by groups:
0, 0  acc: 71372 / 71629 =  99.641
0, 1  acc: 66656 / 66874 =  99.674
1, 0  acc:    77 / 22880 =   0.337
1, 1  acc:     5 /  1387 =   0.360
--------------------------------------
Average acc: 138110 / 162770 =  84.850
Robust  acc:    77 / 22880 =   0.337
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8535 /  8535 = 100.000
0, 1  acc:  8276 /  8276 = 100.000
1, 0  acc:     0 /  2874 =   0.000
1, 1  acc:     0 /   182 =   0.000
------------------------------------
Average acc: 16811 / 19867 =  84.618
Robust  acc:     0 /  2874 =   0.000
------------------------------------
Save biased model at epoch 0
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch0_seed42.pt
New max average-worst acc gap: 84.61770775658127
bias model - Saving best checkpoint at epoch 0
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_worst_avg_gap_best_epoch0_seed42.pt
-------------------------------------------
Avg Test Loss: 0.003 | Avg Test Acc: 86.675
Robust Acc: 0.000 | Best Acc: 100.000
-------------------------------------
Training, Epoch 0:
Accuracies by groups:
0, 0  acc:  9767 /  9767 = 100.000
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:     0 /  2480 =   0.000
1, 1  acc:     0 /   180 =   0.000
------------------------------------
Average acc: 17302 / 19962 =  86.675
Robust  acc:     0 /  2480 =   0.000
------------------------------------
Accuracies by groups:
0, 0  acc:  9767 /  9767 = 100.000
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:     0 /  2480 =   0.000
1, 1  acc:     0 /   180 =   0.000
------------------------------------
Average acc: 17302 / 19962 =  86.675
Robust  acc:     0 /  2480 =   0.000
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9767 /  9767 = 100.000
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:     0 /  2480 =   0.000
1, 1  acc:     0 /   180 =   0.000
------------------------------------
Average acc: 17302 / 19962 =  86.675
Robust  acc:     0 /  2480 =   0.000
------------------------------------
Epoch:   2 | Train Loss: 0.001 | Train Acc: 87.888 | Val Loss: 0.001 | Val Acc: 92.525
Training:
Accuracies by groups:
0, 0  acc: 71117 / 71629 =  99.285
0, 1  acc: 66862 / 66874 =  99.982
1, 0  acc:  5057 / 22880 =  22.102
1, 1  acc:    20 /  1387 =   1.442
--------------------------------------
Average acc: 143056 / 162770 =  87.888
Robust  acc:    20 /  1387 =   1.442
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8227 /  8535 =  96.391
0, 1  acc:  8272 /  8276 =  99.952
1, 0  acc:  1874 /  2874 =  65.205
1, 1  acc:     9 /   182 =   4.945
------------------------------------
Average acc: 18382 / 19867 =  92.525
Robust  acc:     9 /   182 =   4.945
------------------------------------
Save biased model at epoch 1
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch1_seed42.pt
New max average-worst acc gap: 87.58023825472358
bias model - Saving best checkpoint at epoch 1
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_worst_avg_gap_best_epoch1_seed42.pt
-------------------------------------------
Avg Test Loss: 0.001 | Avg Test Acc: 92.811
Robust Acc: 11.111 | Best Acc: 99.973
-------------------------------------
Training, Epoch 1:
Accuracies by groups:
0, 0  acc:  9510 /  9767 =  97.369
0, 1  acc:  7533 /  7535 =  99.973
1, 0  acc:  1464 /  2480 =  59.032
1, 1  acc:    20 /   180 =  11.111
------------------------------------
Average acc: 18527 / 19962 =  92.811
Robust  acc:    20 /   180 =  11.111
------------------------------------
Accuracies by groups:
0, 0  acc:  9510 /  9767 =  97.369
0, 1  acc:  7533 /  7535 =  99.973
1, 0  acc:  1464 /  2480 =  59.032
1, 1  acc:    20 /   180 =  11.111
------------------------------------
Average acc: 18527 / 19962 =  92.811
Robust  acc:    20 /   180 =  11.111
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9510 /  9767 =  97.369
0, 1  acc:  7533 /  7535 =  99.973
1, 0  acc:  1464 /  2480 =  59.032
1, 1  acc:    20 /   180 =  11.111
------------------------------------
Average acc: 18527 / 19962 =  92.811
Robust  acc:    20 /   180 =  11.111
------------------------------------
Epoch:   3 | Train Loss: 0.000 | Train Acc: 93.349 | Val Loss: 0.001 | Val Acc: 94.030
Training:
Accuracies by groups:
0, 0  acc: 69199 / 71629 =  96.608
0, 1  acc: 66724 / 66874 =  99.776
1, 0  acc: 15812 / 22880 =  69.108
1, 1  acc:   209 /  1387 =  15.068
--------------------------------------
Average acc: 151944 / 162770 =  93.349
Robust  acc:   209 /  1387 =  15.068
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8167 /  8535 =  95.688
0, 1  acc:  8258 /  8276 =  99.783
1, 0  acc:  2226 /  2874 =  77.453
1, 1  acc:    30 /   182 =  16.484
------------------------------------
Average acc: 18681 / 19867 =  94.030
Robust  acc:    30 /   182 =  16.484
------------------------------------
Save biased model at epoch 2
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch2_seed42.pt
-------------------------------------------
Avg Test Loss: 0.001 | Avg Test Acc: 94.264
Robust Acc: 21.667 | Best Acc: 99.854
-------------------------------------
Training, Epoch 2:
Accuracies by groups:
0, 0  acc:  9458 /  9767 =  96.836
0, 1  acc:  7524 /  7535 =  99.854
1, 0  acc:  1796 /  2480 =  72.419
1, 1  acc:    39 /   180 =  21.667
------------------------------------
Average acc: 18817 / 19962 =  94.264
Robust  acc:    39 /   180 =  21.667
------------------------------------
Accuracies by groups:
0, 0  acc:  9458 /  9767 =  96.836
0, 1  acc:  7524 /  7535 =  99.854
1, 0  acc:  1796 /  2480 =  72.419
1, 1  acc:    39 /   180 =  21.667
------------------------------------
Average acc: 18817 / 19962 =  94.264
Robust  acc:    39 /   180 =  21.667
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9458 /  9767 =  96.836
0, 1  acc:  7524 /  7535 =  99.854
1, 0  acc:  1796 /  2480 =  72.419
1, 1  acc:    39 /   180 =  21.667
------------------------------------
Average acc: 18817 / 19962 =  94.264
Robust  acc:    39 /   180 =  21.667
------------------------------------
Epoch:   4 | Train Loss: 0.000 | Train Acc: 94.184 | Val Loss: 0.001 | Val Acc: 94.463
Training:
Accuracies by groups:
0, 0  acc: 68841 / 71629 =  96.108
0, 1  acc: 66618 / 66874 =  99.617
1, 0  acc: 17531 / 22880 =  76.622
1, 1  acc:   313 /  1387 =  22.567
--------------------------------------
Average acc: 153303 / 162770 =  94.184
Robust  acc:   313 /  1387 =  22.567
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8110 /  8535 =  95.021
0, 1  acc:  8250 /  8276 =  99.686
1, 0  acc:  2370 /  2874 =  82.463
1, 1  acc:    37 /   182 =  20.330
------------------------------------
Average acc: 18767 / 19867 =  94.463
Robust  acc:    37 /   182 =  20.330
------------------------------------
Save biased model at epoch 3
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch3_seed42.pt
-------------------------------------------
Avg Test Loss: 0.001 | Avg Test Acc: 94.990
Robust Acc: 28.333 | Best Acc: 99.748
-------------------------------------
Training, Epoch 3:
Accuracies by groups:
0, 0  acc:  9423 /  9767 =  96.478
0, 1  acc:  7516 /  7535 =  99.748
1, 0  acc:  1972 /  2480 =  79.516
1, 1  acc:    51 /   180 =  28.333
------------------------------------
Average acc: 18962 / 19962 =  94.990
Robust  acc:    51 /   180 =  28.333
------------------------------------
Accuracies by groups:
0, 0  acc:  9423 /  9767 =  96.478
0, 1  acc:  7516 /  7535 =  99.748
1, 0  acc:  1972 /  2480 =  79.516
1, 1  acc:    51 /   180 =  28.333
------------------------------------
Average acc: 18962 / 19962 =  94.990
Robust  acc:    51 /   180 =  28.333
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9423 /  9767 =  96.478
0, 1  acc:  7516 /  7535 =  99.748
1, 0  acc:  1972 /  2480 =  79.516
1, 1  acc:    51 /   180 =  28.333
------------------------------------
Average acc: 18962 / 19962 =  94.990
Robust  acc:    51 /   180 =  28.333
------------------------------------
Epoch:   5 | Train Loss: 0.000 | Train Acc: 94.657 | Val Loss: 0.001 | Val Acc: 94.821
Training:
Accuracies by groups:
0, 0  acc: 68860 / 71629 =  96.134
0, 1  acc: 66583 / 66874 =  99.565
1, 0  acc: 18248 / 22880 =  79.755
1, 1  acc:   383 /  1387 =  27.614
--------------------------------------
Average acc: 154074 / 162770 =  94.657
Robust  acc:   383 /  1387 =  27.614
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8194 /  8535 =  96.005
0, 1  acc:  8257 /  8276 =  99.770
1, 0  acc:  2350 /  2874 =  81.768
1, 1  acc:    37 /   182 =  20.330
------------------------------------
Average acc: 18838 / 19867 =  94.821
Robust  acc:    37 /   182 =  20.330
------------------------------------
Save biased model at epoch 4
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch4_seed42.pt
-------------------------------------------
Avg Test Loss: 0.001 | Avg Test Acc: 95.216
Robust Acc: 28.889 | Best Acc: 99.761
-------------------------------------
Training, Epoch 4:
Accuracies by groups:
0, 0  acc:  9476 /  9767 =  97.021
0, 1  acc:  7517 /  7535 =  99.761
1, 0  acc:  1962 /  2480 =  79.113
1, 1  acc:    52 /   180 =  28.889
------------------------------------
Average acc: 19007 / 19962 =  95.216
Robust  acc:    52 /   180 =  28.889
------------------------------------
Accuracies by groups:
0, 0  acc:  9476 /  9767 =  97.021
0, 1  acc:  7517 /  7535 =  99.761
1, 0  acc:  1962 /  2480 =  79.113
1, 1  acc:    52 /   180 =  28.889
------------------------------------
Average acc: 19007 / 19962 =  95.216
Robust  acc:    52 /   180 =  28.889
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9476 /  9767 =  97.021
0, 1  acc:  7517 /  7535 =  99.761
1, 0  acc:  1962 /  2480 =  79.113
1, 1  acc:    52 /   180 =  28.889
------------------------------------
Average acc: 19007 / 19962 =  95.216
Robust  acc:    52 /   180 =  28.889
------------------------------------
replace: True
Checkpoint saved at ./model/celebA/config/bias-end_seed42.pt
training biased model is done
