Arch: resnet50_pt
Bs trn: 128
Bs val: 128
Hidden dim: 256
Dataset: celebA
Resample class: 
Slice with: rep
Rep cluster method: gmm
Num anchor: 32
Num positive: 32
Num negative: 32
Num negative easy: 0
Weight anc by loss: False
Weight pos by loss: False
Weight neg by loss: False
Anc loss temp: 0.5
Pos loss temp: 0.5
Neg loss temp: 0.5
Data wide pos: False
Target sample ratio: 1
Balance targets: False
Additional negatives: False
Hard negative factor: 0
Full contrastive: False
Train encoder: False
No projection head: False
Projection dim: 128
Batch factor: None
Temperature: 0.05
Single pos: False
Supervised linear scale up: False
Supervised update delay: 0
Contrastive weight: 0.5
Classifier update interval: 8
Optim: sgd
Max epoch: 5
Lr: 0.0001
Momentum: 0.9
Weight decay: 0.1
Weight decay c: 0.1
Stopping window: 30
Load encoder: 
Freeze encoder: False
Finetune epochs: 0
Clip grad norm: False
Lr scheduler classifier: 
Lr scheduler: 
Grad clip grad norm: False
Erm: False
Erm only: False
Pretrained spurious path: 
Max epoch s: 1
Bs trn s: 32
Lr s: 0.001
Momentum s: 0.9
Weight decay s: 0.0005
Slice temp: 10
Log loss interval: 10
Checkpoint interval: 50
Grad checkpoint interval: 50
Log visual interval: 100
Log grad visual interval: 50
Verbose: True
Seed: 18
Replicate: 0
No cuda: False
Resume: False
New slice: False
Num workers: 12
Evaluate: False
Data cmap: hsv
Test cmap: 
P correlation: 0.9
P corr by class: None
Train classes: ['blond', 'nonblond']
Train class ratios: None
Test shift: random
Flipped: False
Q: 0.7
Pretrained bmodel: False
Cosine: False
Exp: stage_one_erm
Supervised contrast: True
Prioritize spurious pos: False
Contrastive type: cnc
Compute auroc: False
Model type: resnet50_pt_cnc
Criterion: cross_entropy
Pretrained: False
Max grad norm: 1.0
Adam epsilon: 1e-08
Warmup steps: 0
Max grad norm s: 1.0
Adam epsilon s: 1e-08
Warmup steps s: 0
Grad max grad norm: 1.0
Grad adam epsilon: 1e-08
Grad warmup steps: 0
Device: cuda
Img file type: .png
Display image: False
Image path: ./images/celebA/celebA/config/contrastive_umaps
Log interval: 1
Log path: ./logs/celebA/config
Results path: ./results/celebA/config
Model path: ./model/celebA/config
Loss factor: 1
Supersample labels: False
Subsample labels: False
Weigh slice samples by loss: True
Val split: 0.2
Spurious train split: 0.2
Subsample groups: False
Train method: sc
Max robust acc: -1
Max robust epoch: -1
Max robust group acc: (None, None)
Root dir: ./datasets/data/CelebA/
Target name: Blond_Hair
Confounder names: ['Male']
Image mean: 0.449
Image std: 0.226
Augment data: False
Task: celebA
Num classes: 2
Experiment configs: config
Experiment name: cnc-celebA-sw=re-na=32-np=32-nn=32-nne=0-tsr=1-t=0.05-bf=None-cw=0.5-sud=0-me=5-bst=128-o=sgd-lr=0.0001-mo=0.9-wd=0.1-wdc=0.1-spur-me=1-bst=32-lr=0.001-mo=0.9-wd=0.0005-sts=0.2-s=18-r=0
Mi resampled: None

Loading checkpoints for train split:
[-1 -1 -1 ... -1 -1 -1]
<class 'numpy.ndarray'>
[0 1 2 3] [71629 66874 22880  1387]
Loading checkpoints for val split:
[-1 -1 -1 ... -1  1 -1]
<class 'numpy.ndarray'>
[0 1 2 3] [8535 8276 2874  182]
Loading checkpoints for test split:
[-1 -1 -1 ... -1 -1  1]
<class 'numpy.ndarray'>
[0 1 2 3] [9767 7535 2480  180]
Train dataset:
    Blond_Hair = 0, Male = 0 : n = 71629
    Blond_Hair = 0, Male = 1 : n = 66874
    Blond_Hair = 1, Male = 0 : n = 22880
    Blond_Hair = 1, Male = 1 : n = 1387
Val dataset:
    Blond_Hair = 0, Male = 0 : n = 8535
    Blond_Hair = 0, Male = 1 : n = 8276
    Blond_Hair = 1, Male = 0 : n = 2874
    Blond_Hair = 1, Male = 1 : n = 182
Test dataset:
    Blond_Hair = 0, Male = 0 : n = 9767
    Blond_Hair = 0, Male = 1 : n = 7535
    Blond_Hair = 1, Male = 0 : n = 2480
    Blond_Hair = 1, Male = 1 : n = 180
Pretrained model loaded from 
Epoch:   1 | Train Loss: 0.000 | Train Acc: 84.868 | Val Loss: 0.003 | Val Acc: 84.618
Training:
Accuracies by groups:
0, 0  acc: 71360 / 71629 =  99.624
0, 1  acc: 66688 / 66874 =  99.722
1, 0  acc:    88 / 22880 =   0.385
1, 1  acc:     4 /  1387 =   0.288
--------------------------------------
Average acc: 138140 / 162770 =  84.868
Robust  acc:     4 /  1387 =   0.288
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8535 /  8535 = 100.000
0, 1  acc:  8276 /  8276 = 100.000
1, 0  acc:     0 /  2874 =   0.000
1, 1  acc:     0 /   182 =   0.000
------------------------------------
Average acc: 16811 / 19867 =  84.618
Robust  acc:     0 /  2874 =   0.000
------------------------------------
Save biased model at epoch 0
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch0_seed18.pt
New max average-worst acc gap: 84.61770775658127
bias model - Saving best checkpoint at epoch 0
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_worst_avg_gap_best_epoch0_seed18.pt
-------------------------------------------
Avg Test Loss: 0.003 | Avg Test Acc: 86.675
Robust Acc: 0.000 | Best Acc: 100.000
-------------------------------------
Training, Epoch 0:
Accuracies by groups:
0, 0  acc:  9767 /  9767 = 100.000
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:     0 /  2480 =   0.000
1, 1  acc:     0 /   180 =   0.000
------------------------------------
Average acc: 17302 / 19962 =  86.675
Robust  acc:     0 /  2480 =   0.000
------------------------------------
Accuracies by groups:
0, 0  acc:  9767 /  9767 = 100.000
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:     0 /  2480 =   0.000
1, 1  acc:     0 /   180 =   0.000
------------------------------------
Average acc: 17302 / 19962 =  86.675
Robust  acc:     0 /  2480 =   0.000
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9767 /  9767 = 100.000
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:     0 /  2480 =   0.000
1, 1  acc:     0 /   180 =   0.000
------------------------------------
Average acc: 17302 / 19962 =  86.675
Robust  acc:     0 /  2480 =   0.000
------------------------------------
Epoch:   2 | Train Loss: 0.001 | Train Acc: 87.449 | Val Loss: 0.002 | Val Acc: 91.997
Training:
Accuracies by groups:
0, 0  acc: 71264 / 71629 =  99.490
0, 1  acc: 66867 / 66874 =  99.990
1, 0  acc:  4201 / 22880 =  18.361
1, 1  acc:     9 /  1387 =   0.649
--------------------------------------
Average acc: 142341 / 162770 =  87.449
Robust  acc:     9 /  1387 =   0.649
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8257 /  8535 =  96.743
0, 1  acc:  8273 /  8276 =  99.964
1, 0  acc:  1741 /  2874 =  60.578
1, 1  acc:     6 /   182 =   3.297
------------------------------------
Average acc: 18277 / 19867 =  91.997
Robust  acc:     6 /   182 =   3.297
------------------------------------
Save biased model at epoch 1
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch1_seed18.pt
New max average-worst acc gap: 88.70007528083735
bias model - Saving best checkpoint at epoch 1
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_worst_avg_gap_best_epoch1_seed18.pt
-------------------------------------------
Avg Test Loss: 0.001 | Avg Test Acc: 92.526
Robust Acc: 6.667 | Best Acc: 99.987
------------------------------------
Training, Epoch 1:
Accuracies by groups:
0, 0  acc:  9562 /  9767 =  97.901
0, 1  acc:  7534 /  7535 =  99.987
1, 0  acc:  1362 /  2480 =  54.919
1, 1  acc:    12 /   180 =   6.667
------------------------------------
Average acc: 18470 / 19962 =  92.526
Robust  acc:    12 /   180 =   6.667
------------------------------------
Accuracies by groups:
0, 0  acc:  9562 /  9767 =  97.901
0, 1  acc:  7534 /  7535 =  99.987
1, 0  acc:  1362 /  2480 =  54.919
1, 1  acc:    12 /   180 =   6.667
------------------------------------
Average acc: 18470 / 19962 =  92.526
Robust  acc:    12 /   180 =   6.667
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9562 /  9767 =  97.901
0, 1  acc:  7534 /  7535 =  99.987
1, 0  acc:  1362 /  2480 =  54.919
1, 1  acc:    12 /   180 =   6.667
------------------------------------
Average acc: 18470 / 19962 =  92.526
Robust  acc:    12 /   180 =   6.667
------------------------------------
Epoch:   3 | Train Loss: 0.000 | Train Acc: 93.150 | Val Loss: 0.001 | Val Acc: 94.030
Training:
Accuracies by groups:
0, 0  acc: 69346 / 71629 =  96.813
0, 1  acc: 66726 / 66874 =  99.779
1, 0  acc: 15346 / 22880 =  67.072
1, 1  acc:   202 /  1387 =  14.564
--------------------------------------
Average acc: 151620 / 162770 =  93.150
Robust  acc:   202 /  1387 =  14.564
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8141 /  8535 =  95.384
0, 1  acc:  8254 /  8276 =  99.734
1, 0  acc:  2257 /  2874 =  78.532
1, 1  acc:    29 /   182 =  15.934
------------------------------------
Average acc: 18681 / 19867 =  94.030
Robust  acc:    29 /   182 =  15.934
------------------------------------
Save biased model at epoch 2
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch2_seed18.pt
-------------------------------------------
Avg Test Loss: 0.001 | Avg Test Acc: 94.299
Robust Acc: 21.667 | Best Acc: 99.854
-------------------------------------
Training, Epoch 2:
Accuracies by groups:
0, 0  acc:  9433 /  9767 =  96.580
0, 1  acc:  7524 /  7535 =  99.854
1, 0  acc:  1828 /  2480 =  73.710
1, 1  acc:    39 /   180 =  21.667
------------------------------------
Average acc: 18824 / 19962 =  94.299
Robust  acc:    39 /   180 =  21.667
------------------------------------
Accuracies by groups:
0, 0  acc:  9433 /  9767 =  96.580
0, 1  acc:  7524 /  7535 =  99.854
1, 0  acc:  1828 /  2480 =  73.710
1, 1  acc:    39 /   180 =  21.667
------------------------------------
Average acc: 18824 / 19962 =  94.299
Robust  acc:    39 /   180 =  21.667
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9433 /  9767 =  96.580
0, 1  acc:  7524 /  7535 =  99.854
1, 0  acc:  1828 /  2480 =  73.710
1, 1  acc:    39 /   180 =  21.667
------------------------------------
Average acc: 18824 / 19962 =  94.299
Robust  acc:    39 /   180 =  21.667
------------------------------------
Epoch:   4 | Train Loss: 0.000 | Train Acc: 94.191 | Val Loss: 0.001 | Val Acc: 94.453
Training:
Accuracies by groups:
0, 0  acc: 68903 / 71629 =  96.194
0, 1  acc: 66620 / 66874 =  99.620
1, 0  acc: 17466 / 22880 =  76.337
1, 1  acc:   326 /  1387 =  23.504
--------------------------------------
Average acc: 153315 / 162770 =  94.191
Robust  acc:   326 /  1387 =  23.504
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8112 /  8535 =  95.044
0, 1  acc:  8250 /  8276 =  99.686
1, 0  acc:  2364 /  2874 =  82.255
1, 1  acc:    39 /   182 =  21.429
------------------------------------
Average acc: 18765 / 19867 =  94.453
Robust  acc:    39 /   182 =  21.429
------------------------------------
Save biased model at epoch 3
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch3_seed18.pt
-------------------------------------------
Avg Test Loss: 0.001 | Avg Test Acc: 94.920
Robust Acc: 28.333 | Best Acc: 99.721
-------------------------------------
Training, Epoch 3:
Accuracies by groups:
0, 0  acc:  9423 /  9767 =  96.478
0, 1  acc:  7514 /  7535 =  99.721
1, 0  acc:  1960 /  2480 =  79.032
1, 1  acc:    51 /   180 =  28.333
------------------------------------
Average acc: 18948 / 19962 =  94.920
Robust  acc:    51 /   180 =  28.333
------------------------------------
Accuracies by groups:
0, 0  acc:  9423 /  9767 =  96.478
0, 1  acc:  7514 /  7535 =  99.721
1, 0  acc:  1960 /  2480 =  79.032
1, 1  acc:    51 /   180 =  28.333
------------------------------------
Average acc: 18948 / 19962 =  94.920
Robust  acc:    51 /   180 =  28.333
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9423 /  9767 =  96.478
0, 1  acc:  7514 /  7535 =  99.721
1, 0  acc:  1960 /  2480 =  79.032
1, 1  acc:    51 /   180 =  28.333
------------------------------------
Average acc: 18948 / 19962 =  94.920
Robust  acc:    51 /   180 =  28.333
------------------------------------
Epoch:   5 | Train Loss: 0.000 | Train Acc: 94.603 | Val Loss: 0.001 | Val Acc: 94.805
Training:
Accuracies by groups:
0, 0  acc: 68775 / 71629 =  96.016
0, 1  acc: 66558 / 66874 =  99.527
1, 0  acc: 18251 / 22880 =  79.768
1, 1  acc:   401 /  1387 =  28.911
--------------------------------------
Average acc: 153985 / 162770 =  94.603
Robust  acc:   401 /  1387 =  28.911
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8194 /  8535 =  96.005
0, 1  acc:  8256 /  8276 =  99.758
1, 0  acc:  2348 /  2874 =  81.698
1, 1  acc:    37 /   182 =  20.330
------------------------------------
Average acc: 18835 / 19867 =  94.805
Robust  acc:    37 /   182 =  20.330
------------------------------------
Save biased model at epoch 4
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch4_seed18.pt
-------------------------------------------
Avg Test Loss: 0.001 | Avg Test Acc: 95.176
Robust Acc: 28.889 | Best Acc: 99.748
-------------------------------------
Training, Epoch 4:
Accuracies by groups:
0, 0  acc:  9476 /  9767 =  97.021
0, 1  acc:  7516 /  7535 =  99.748
1, 0  acc:  1955 /  2480 =  78.831
1, 1  acc:    52 /   180 =  28.889
------------------------------------
Average acc: 18999 / 19962 =  95.176
Robust  acc:    52 /   180 =  28.889
------------------------------------
Accuracies by groups:
0, 0  acc:  9476 /  9767 =  97.021
0, 1  acc:  7516 /  7535 =  99.748
1, 0  acc:  1955 /  2480 =  78.831
1, 1  acc:    52 /   180 =  28.889
------------------------------------
Average acc: 18999 / 19962 =  95.176
Robust  acc:    52 /   180 =  28.889
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9476 /  9767 =  97.021
0, 1  acc:  7516 /  7535 =  99.748
1, 0  acc:  1955 /  2480 =  78.831
1, 1  acc:    52 /   180 =  28.889
------------------------------------
Average acc: 18999 / 19962 =  95.176
Robust  acc:    52 /   180 =  28.889
------------------------------------
replace: True
Checkpoint saved at ./model/celebA/config/bias-end_seed18.pt
training biased model is done
