Arch: resnet50_pt
Bs trn: 128
Bs val: 128
Hidden dim: 256
Dataset: celebA
Resample class: 
Slice with: rep
Rep cluster method: gmm
Num anchor: 32
Num positive: 32
Num negative: 32
Num negative easy: 0
Weight anc by loss: False
Weight pos by loss: False
Weight neg by loss: False
Anc loss temp: 0.5
Pos loss temp: 0.5
Neg loss temp: 0.5
Data wide pos: False
Target sample ratio: 1
Balance targets: False
Additional negatives: False
Hard negative factor: 0
Full contrastive: False
Train encoder: False
No projection head: False
Projection dim: 128
Batch factor: None
Temperature: 0.05
Single pos: False
Supervised linear scale up: False
Supervised update delay: 0
Contrastive weight: 0.5
Classifier update interval: 8
Optim: sgd
Max epoch: 5
Lr: 0.0001
Momentum: 0.9
Weight decay: 0.1
Weight decay c: 0.1
Stopping window: 30
Load encoder: 
Freeze encoder: False
Finetune epochs: 0
Clip grad norm: False
Lr scheduler classifier: 
Lr scheduler: 
Grad clip grad norm: False
Erm: False
Erm only: False
Pretrained spurious path: 
Max epoch s: 1
Bs trn s: 32
Lr s: 0.001
Momentum s: 0.9
Weight decay s: 0.0005
Slice temp: 10
Log loss interval: 10
Checkpoint interval: 50
Grad checkpoint interval: 50
Log visual interval: 100
Log grad visual interval: 50
Verbose: True
Seed: 27
Replicate: 0
No cuda: False
Resume: False
New slice: False
Num workers: 12
Evaluate: False
Data cmap: hsv
Test cmap: 
P correlation: 0.9
P corr by class: None
Train classes: ['blond', 'nonblond']
Train class ratios: None
Test shift: random
Flipped: False
Q: 0.7
Pretrained bmodel: False
Cosine: False
Exp: stage_one_erm
Supervised contrast: True
Prioritize spurious pos: False
Contrastive type: cnc
Compute auroc: False
Model type: resnet50_pt_cnc
Criterion: cross_entropy
Pretrained: False
Max grad norm: 1.0
Adam epsilon: 1e-08
Warmup steps: 0
Max grad norm s: 1.0
Adam epsilon s: 1e-08
Warmup steps s: 0
Grad max grad norm: 1.0
Grad adam epsilon: 1e-08
Grad warmup steps: 0
Device: cuda
Img file type: .png
Display image: False
Image path: ./images/celebA/celebA/config/contrastive_umaps
Log interval: 1
Log path: ./logs/celebA/config
Results path: ./results/celebA/config
Model path: ./model/celebA/config
Loss factor: 1
Supersample labels: False
Subsample labels: False
Weigh slice samples by loss: True
Val split: 0.2
Spurious train split: 0.2
Subsample groups: False
Train method: sc
Max robust acc: -1
Max robust epoch: -1
Max robust group acc: (None, None)
Root dir: ./datasets/data/CelebA/
Target name: Blond_Hair
Confounder names: ['Male']
Image mean: 0.449
Image std: 0.226
Augment data: False
Task: celebA
Num classes: 2
Experiment configs: config
Experiment name: cnc-celebA-sw=re-na=32-np=32-nn=32-nne=0-tsr=1-t=0.05-bf=None-cw=0.5-sud=0-me=5-bst=128-o=sgd-lr=0.0001-mo=0.9-wd=0.1-wdc=0.1-spur-me=1-bst=32-lr=0.001-mo=0.9-wd=0.0005-sts=0.2-s=27-r=0
Mi resampled: None

Loading checkpoints for train split:
[-1 -1 -1 ... -1 -1 -1]
<class 'numpy.ndarray'>
[0 1 2 3] [71629 66874 22880  1387]
Loading checkpoints for val split:
[-1 -1 -1 ... -1  1 -1]
<class 'numpy.ndarray'>
[0 1 2 3] [8535 8276 2874  182]
Loading checkpoints for test split:
[-1 -1 -1 ... -1 -1  1]
<class 'numpy.ndarray'>
[0 1 2 3] [9767 7535 2480  180]
Train dataset:
    Blond_Hair = 0, Male = 0 : n = 71629
    Blond_Hair = 0, Male = 1 : n = 66874
    Blond_Hair = 1, Male = 0 : n = 22880
    Blond_Hair = 1, Male = 1 : n = 1387
Val dataset:
    Blond_Hair = 0, Male = 0 : n = 8535
    Blond_Hair = 0, Male = 1 : n = 8276
    Blond_Hair = 1, Male = 0 : n = 2874
    Blond_Hair = 1, Male = 1 : n = 182
Test dataset:
    Blond_Hair = 0, Male = 0 : n = 9767
    Blond_Hair = 0, Male = 1 : n = 7535
    Blond_Hair = 1, Male = 0 : n = 2480
    Blond_Hair = 1, Male = 1 : n = 180
Pretrained model loaded from 
Epoch:   1 | Train Loss: 0.000 | Train Acc: 85.025 | Val Loss: 0.003 | Val Acc: 84.618
Training:
Accuracies by groups:
0, 0  acc: 71587 / 71629 =  99.941
0, 1  acc: 66797 / 66874 =  99.885
1, 0  acc:     9 / 22880 =   0.039
1, 1  acc:     2 /  1387 =   0.144
--------------------------------------
Average acc: 138395 / 162770 =  85.025
Robust  acc:     9 / 22880 =   0.039
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8535 /  8535 = 100.000
0, 1  acc:  8276 /  8276 = 100.000
1, 0  acc:     0 /  2874 =   0.000
1, 1  acc:     0 /   182 =   0.000
------------------------------------
Average acc: 16811 / 19867 =  84.618
Robust  acc:     0 /  2874 =   0.000
------------------------------------
Save biased model at epoch 0
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch0_seed27.pt
New max average-worst acc gap: 84.61770775658127
bias model - Saving best checkpoint at epoch 0
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_worst_avg_gap_best_epoch0_seed27.pt
-------------------------------------------
Avg Test Loss: 0.003 | Avg Test Acc: 86.675
Robust Acc: 0.000 | Best Acc: 100.000
-------------------------------------
Training, Epoch 0:
Accuracies by groups:
0, 0  acc:  9767 /  9767 = 100.000
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:     0 /  2480 =   0.000
1, 1  acc:     0 /   180 =   0.000
------------------------------------
Average acc: 17302 / 19962 =  86.675
Robust  acc:     0 /  2480 =   0.000
------------------------------------
Accuracies by groups:
0, 0  acc:  9767 /  9767 = 100.000
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:     0 /  2480 =   0.000
1, 1  acc:     0 /   180 =   0.000
------------------------------------
Average acc: 17302 / 19962 =  86.675
Robust  acc:     0 /  2480 =   0.000
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9767 /  9767 = 100.000
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:     0 /  2480 =   0.000
1, 1  acc:     0 /   180 =   0.000
------------------------------------
Average acc: 17302 / 19962 =  86.675
Robust  acc:     0 /  2480 =   0.000
------------------------------------
Epoch:   2 | Train Loss: 0.001 | Train Acc: 86.698 | Val Loss: 0.002 | Val Acc: 91.227
Training:
Accuracies by groups:
0, 0  acc: 71405 / 71629 =  99.687
0, 1  acc: 66870 / 66874 =  99.994
1, 0  acc:  2834 / 22880 =  12.386
1, 1  acc:    10 /  1387 =   0.721
--------------------------------------
Average acc: 141119 / 162770 =  86.698
Robust  acc:    10 /  1387 =   0.721
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8338 /  8535 =  97.692
0, 1  acc:  8276 /  8276 = 100.000
1, 0  acc:  1504 /  2874 =  52.331
1, 1  acc:     6 /   182 =   3.297
------------------------------------
Average acc: 18124 / 19867 =  91.227
Robust  acc:     6 /   182 =   3.297
------------------------------------
Save biased model at epoch 1
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch1_seed27.pt
New max average-worst acc gap: 87.92995397414786
bias model - Saving best checkpoint at epoch 1
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_worst_avg_gap_best_epoch1_seed27.pt
-------------------------------------------
Avg Test Loss: 0.001 | Avg Test Acc: 91.940
Robust Acc: 3.889 | Best Acc: 100.000
-------------------------------------
Training, Epoch 1:
Accuracies by groups:
0, 0  acc:  9633 /  9767 =  98.628
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:  1178 /  2480 =  47.500
1, 1  acc:     7 /   180 =   3.889
------------------------------------
Average acc: 18353 / 19962 =  91.940
Robust  acc:     7 /   180 =   3.889
------------------------------------
Accuracies by groups:
0, 0  acc:  9633 /  9767 =  98.628
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:  1178 /  2480 =  47.500
1, 1  acc:     7 /   180 =   3.889
------------------------------------
Average acc: 18353 / 19962 =  91.940
Robust  acc:     7 /   180 =   3.889
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9633 /  9767 =  98.628
0, 1  acc:  7535 /  7535 = 100.000
1, 0  acc:  1178 /  2480 =  47.500
1, 1  acc:     7 /   180 =   3.889
------------------------------------
Average acc: 18353 / 19962 =  91.940
Robust  acc:     7 /   180 =   3.889
------------------------------------
Epoch:   3 | Train Loss: 0.000 | Train Acc: 92.929 | Val Loss: 0.001 | Val Acc: 93.945
Training:
Accuracies by groups:
0, 0  acc: 69445 / 71629 =  96.951
0, 1  acc: 66744 / 66874 =  99.806
1, 0  acc: 14896 / 22880 =  65.105
1, 1  acc:   176 /  1387 =  12.689
--------------------------------------
Average acc: 151261 / 162770 =  92.929
Robust  acc:   176 /  1387 =  12.689
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8176 /  8535 =  95.794
0, 1  acc:  8257 /  8276 =  99.770
1, 0  acc:  2206 /  2874 =  76.757
1, 1  acc:    25 /   182 =  13.736
------------------------------------
Average acc: 18664 / 19867 =  93.945
Robust  acc:    25 /   182 =  13.736
------------------------------------
Save biased model at epoch 2
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch2_seed27.pt
-------------------------------------------
Avg Test Loss: 0.001 | Avg Test Acc: 94.154
Robust Acc: 18.889 | Best Acc: 99.920
-------------------------------------
Training, Epoch 2:
Accuracies by groups:
0, 0  acc:  9475 /  9767 =  97.010
0, 1  acc:  7529 /  7535 =  99.920
1, 0  acc:  1757 /  2480 =  70.847
1, 1  acc:    34 /   180 =  18.889
------------------------------------
Average acc: 18795 / 19962 =  94.154
Robust  acc:    34 /   180 =  18.889
------------------------------------
Accuracies by groups:
0, 0  acc:  9475 /  9767 =  97.010
0, 1  acc:  7529 /  7535 =  99.920
1, 0  acc:  1757 /  2480 =  70.847
1, 1  acc:    34 /   180 =  18.889
------------------------------------
Average acc: 18795 / 19962 =  94.154
Robust  acc:    34 /   180 =  18.889
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9475 /  9767 =  97.010
0, 1  acc:  7529 /  7535 =  99.920
1, 0  acc:  1757 /  2480 =  70.847
1, 1  acc:    34 /   180 =  18.889
------------------------------------
Average acc: 18795 / 19962 =  94.154
Robust  acc:    34 /   180 =  18.889
------------------------------------
Epoch:   4 | Train Loss: 0.000 | Train Acc: 94.162 | Val Loss: 0.001 | Val Acc: 94.443
Training:
Accuracies by groups:
0, 0  acc: 68878 / 71629 =  96.159
0, 1  acc: 66620 / 66874 =  99.620
1, 0  acc: 17455 / 22880 =  76.289
1, 1  acc:   314 /  1387 =  22.639
--------------------------------------
Average acc: 153267 / 162770 =  94.162
Robust  acc:   314 /  1387 =  22.639
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8103 /  8535 =  94.938
0, 1  acc:  8250 /  8276 =  99.686
1, 0  acc:  2373 /  2874 =  82.568
1, 1  acc:    37 /   182 =  20.330
------------------------------------
Average acc: 18763 / 19867 =  94.443
Robust  acc:    37 /   182 =  20.330
------------------------------------
Save biased model at epoch 3
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch3_seed27.pt
-------------------------------------------
Avg Test Loss: 0.001 | Avg Test Acc: 94.960
Robust Acc: 27.778 | Best Acc: 99.774
-------------------------------------
Training, Epoch 3:
Accuracies by groups:
0, 0  acc:  9423 /  9767 =  96.478
0, 1  acc:  7518 /  7535 =  99.774
1, 0  acc:  1965 /  2480 =  79.234
1, 1  acc:    50 /   180 =  27.778
------------------------------------
Average acc: 18956 / 19962 =  94.960
Robust  acc:    50 /   180 =  27.778
------------------------------------
Accuracies by groups:
0, 0  acc:  9423 /  9767 =  96.478
0, 1  acc:  7518 /  7535 =  99.774
1, 0  acc:  1965 /  2480 =  79.234
1, 1  acc:    50 /   180 =  27.778
------------------------------------
Average acc: 18956 / 19962 =  94.960
Robust  acc:    50 /   180 =  27.778
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9423 /  9767 =  96.478
0, 1  acc:  7518 /  7535 =  99.774
1, 0  acc:  1965 /  2480 =  79.234
1, 1  acc:    50 /   180 =  27.778
------------------------------------
Average acc: 18956 / 19962 =  94.960
Robust  acc:    50 /   180 =  27.778
------------------------------------
Epoch:   5 | Train Loss: 0.000 | Train Acc: 94.619 | Val Loss: 0.001 | Val Acc: 94.760
Training:
Accuracies by groups:
0, 0  acc: 68861 / 71629 =  96.136
0, 1  acc: 66556 / 66874 =  99.524
1, 0  acc: 18216 / 22880 =  79.615
1, 1  acc:   378 /  1387 =  27.253
--------------------------------------
Average acc: 154011 / 162770 =  94.619
Robust  acc:   378 /  1387 =  27.253
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc:  8145 /  8535 =  95.431
0, 1  acc:  8252 /  8276 =  99.710
1, 0  acc:  2390 /  2874 =  83.159
1, 1  acc:    39 /   182 =  21.429
------------------------------------
Average acc: 18826 / 19867 =  94.760
Robust  acc:    39 /   182 =  21.429
------------------------------------
Save biased model at epoch 4
replace: True
Checkpoint saved at ./model/celebA/config/stage_one_erm_model_b_epoch4_seed27.pt
-------------------------------------------
Avg Test Loss: 0.001 | Avg Test Acc: 95.281
Robust Acc: 30.000 | Best Acc: 99.735
-------------------------------------
Training, Epoch 4:
Accuracies by groups:
0, 0  acc:  9455 /  9767 =  96.806
0, 1  acc:  7515 /  7535 =  99.735
1, 0  acc:  1996 /  2480 =  80.484
1, 1  acc:    54 /   180 =  30.000
------------------------------------
Average acc: 19020 / 19962 =  95.281
Robust  acc:    54 /   180 =  30.000
------------------------------------
Accuracies by groups:
0, 0  acc:  9455 /  9767 =  96.806
0, 1  acc:  7515 /  7535 =  99.735
1, 0  acc:  1996 /  2480 =  80.484
1, 1  acc:    54 /   180 =  30.000
------------------------------------
Average acc: 19020 / 19962 =  95.281
Robust  acc:    54 /   180 =  30.000
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc:  9455 /  9767 =  96.806
0, 1  acc:  7515 /  7535 =  99.735
1, 0  acc:  1996 /  2480 =  80.484
1, 1  acc:    54 /   180 =  30.000
------------------------------------
Average acc: 19020 / 19962 =  95.281
Robust  acc:    54 /   180 =  30.000
------------------------------------
replace: True
Checkpoint saved at ./model/celebA/config/bias-end_seed27.pt
training biased model is done
