Arch: bert-base-uncased_pt
Bs trn: 16
Bs val: 16
Hidden dim: 256
Dataset: civilcomments
Resample class: 
Slice with: rep
Rep cluster method: gmm
Num anchor: 32
Num positive: 32
Num negative: 32
Num negative easy: 0
Weight anc by loss: False
Weight pos by loss: False
Weight neg by loss: False
Anc loss temp: 0.5
Pos loss temp: 0.5
Neg loss temp: 0.5
Data wide pos: False
Target sample ratio: 1
Balance targets: False
Additional negatives: False
Hard negative factor: 0
Full contrastive: False
Train encoder: False
No projection head: False
Projection dim: 128
Batch factor: None
Temperature: 0.05
Single pos: False
Supervised linear scale up: False
Supervised update delay: 0
Contrastive weight: 0.5
Classifier update interval: 8
Optim: sgd
Max epoch: 1
Lr: 1e-05
Momentum: 0.9
Weight decay: 0.01
Weight decay c: 0.01
Stopping window: 30
Load encoder: 
Freeze encoder: False
Finetune epochs: 0
Clip grad norm: False
Lr scheduler classifier: 
Lr scheduler: 
Grad clip grad norm: False
Erm: False
Erm only: False
Pretrained spurious path: 
Max epoch s: 1
Bs trn s: 32
Lr s: 0.001
Momentum s: 0.9
Weight decay s: 0.0005
Slice temp: 10
Log loss interval: 10
Checkpoint interval: 50
Grad checkpoint interval: 50
Log visual interval: 100
Log grad visual interval: 50
Verbose: True
Seed: 12
Replicate: 0
No cuda: False
Resume: False
New slice: False
Num workers: 32
Evaluate: False
Data cmap: hsv
Test cmap: 
P correlation: 0.9
P corr by class: None
Train classes: ['non_toxic', 'toxic']
Train class ratios: None
Test shift: random
Flipped: False
Q: 0.7
Pretrained bmodel: False
Cosine: False
Exp: JTT_sgd_no_gce
Supervised contrast: True
Prioritize spurious pos: False
Contrastive type: cnc
Compute auroc: False
Model type: bert-base-uncased_pt_cnc
Criterion: cross_entropy
Pretrained: False
Max grad norm: 1.0
Adam epsilon: 1e-08
Warmup steps: 0
Max grad norm s: 1.0
Adam epsilon s: 1e-08
Warmup steps s: 0
Grad max grad norm: 1.0
Grad adam epsilon: 1e-08
Grad warmup steps: 0
Device: cuda
Img file type: .png
Display image: False
Image path: ./images/civilcomments/civilcomments/config/contrastive_umaps
Log interval: 1
Log path: ./logs/civilcomments/config
Results path: ./results/civilcomments/config
Model path: ./model/civilcomments/config
Loss factor: 1
Supersample labels: False
Subsample labels: False
Weigh slice samples by loss: True
Val split: 0.1
Spurious train split: 0.2
Subsample groups: False
Train method: sc
Max robust acc: -1
Max robust epoch: -1
Max robust group acc: (None, None)
Root dir: ./datasets/data/CivilComments/
Target name: toxic
Confounder names: ['identities']
Image mean: 0
Image std: 0
Augment data: False
Max token length: 300
Task: civilcomments
Num classes: 2
Experiment configs: config
Experiment name: cnc-civilcomments-sw=re-na=32-np=32-nn=32-nne=0-tsr=1-t=0.05-bf=None-cw=0.5-sud=0-me=1-bst=16-o=sgd-lr=1e-05-mo=0.9-wd=0.01-wdc=0.01-spur-me=1-bst=32-lr=0.001-mo=0.9-wd=0.0005-sts=0.2-s=12-r=0
Mi resampled: None

Pretrained model loaded from 
male 0
[213150  25373  26078   4437]
1 1 25373
3 3 4437
female 1
[207241  31282  25553   4962]
1 1 31282
3 3 4962
LGBTQ 2
[232368   6155  28250   2265]
1 1 6155
3 3 2265
christian 3
[214231  24292  28069   2446]
1 1 24292
3 3 2446
muslim 4
[227694  10829  27390   3125]
1 1 10829
3 3 3125
other_religions 5
[232982   5541  29512   1003]
1 1 5541
3 3 1003
black 6
[231738   6785  27404   3111]
1 1 6785
3 3 3111
white 7
[226507  12016  25833   4682]
1 1 12016
3 3 4682
[213150  25373  26078   4437]
[207241  31282  25553   4962]
[232368   6155  28250   2265]
[214231  24292  28069   2446]
[227694  10829  27390   3125]
[232982   5541  29512   1003]
[231738   6785  27404   3111]
[226507  12016  25833   4682]
Epoch:   1 | Train Loss: 0.021 | Train Acc: 88.566 | Val Loss: 0.000 | Val Acc: 89.198
Training:
Accuracies by groups:
0, 0  acc: 147372 / 148186 =  99.451
0, 1  acc:  9860 /  9955 =  99.046
0, 2  acc: 16407 / 16544 =  99.172
0, 3  acc:  8482 /  8553 =  99.170
0, 4  acc:  3265 /  3291 =  99.210
0, 5  acc:   360 /   368 =  97.826
0, 6  acc:   385 /   390 =  98.718
0, 7  acc:   430 /   431 =  99.768
0, 8  acc: 16455 / 16537 =  99.504
0, 9  acc:  1057 /  1062 =  99.529
0, 10  acc:  1136 /  1138 =  99.824
0, 11  acc:   793 /   795 =  99.748
0, 12  acc:   648 /   651 =  99.539
0, 13  acc:    76 /    77 =  98.701
0, 14  acc:   122 /   122 = 100.000
0, 15  acc:    67 /    67 = 100.000
0, 16  acc:  6654 /  6714 =  99.106
0, 17  acc:   203 /   205 =  99.024
0, 18  acc:   652 /   658 =  99.088
0, 19  acc:   200 /   202 =  99.010
0, 20  acc:   109 /   111 =  98.198
0, 21  acc:    13 /    13 = 100.000
0, 22  acc:    49 /    50 =  98.000
0, 23  acc:     8 /     8 = 100.000
0, 24  acc:   935 /   948 =  98.629
0, 25  acc:    42 /    43 =  97.674
0, 26  acc:    79 /    79 = 100.000
0, 27  acc:    35 /    35 = 100.000
0, 28  acc:    35 /    36 =  97.222
0, 29  acc:     2 /     2 = 100.000
0, 30  acc:     5 /     5 = 100.000
0, 31  acc:     5 /     6 =  83.333
0, 32  acc:  2564 /  2588 =  99.073
0, 33  acc:   114 /   115 =  99.130
0, 34  acc:   103 /   104 =  99.038
0, 35  acc:    40 /    40 = 100.000
0, 36  acc:    39 /    40 =  97.500
0, 37  acc:     5 /     5 = 100.000
0, 38  acc:    14 /    15 =  93.333
0, 39  acc:     3 /     3 = 100.000
0, 40  acc:   901 /   905 =  99.558
0, 41  acc:    40 /    40 = 100.000
0, 42  acc:    37 /    38 =  97.368
0, 43  acc:    49 /    49 = 100.000
0, 44  acc:    27 /    27 = 100.000
0, 45  acc:     2 /     2 = 100.000
0, 46  acc:     7 /     7 = 100.000
0, 47  acc:     4 /     4 = 100.000
0, 48  acc:   441 /   444 =  99.324
0, 49  acc:     6 /     6 = 100.000
0, 50  acc:    25 /    25 = 100.000
0, 51  acc:    21 /    21 = 100.000
0, 52  acc:     9 /     9 = 100.000
0, 53  acc:     1 /     1 = 100.000
0, 54  acc:     5 /     5 = 100.000
0, 55  acc:     1 /     1 = 100.000
0, 56  acc:   479 /   483 =  99.172
0, 57  acc:    13 /    13 = 100.000
0, 58  acc:    23 /    23 = 100.000
0, 59  acc:    23 /    23 = 100.000
0, 60  acc:    17 /    17 = 100.000
0, 61  acc:     3 /     3 = 100.000
0, 62  acc:     7 /     7 = 100.000
0, 63  acc:     3 /     3 = 100.000
0, 64  acc:  2846 /  2874 =  99.026
0, 65  acc:   366 /   371 =  98.652
0, 66  acc:   268 /   273 =  98.168
0, 67  acc:   123 /   124 =  99.194
0, 68  acc:    64 /    64 = 100.000
0, 69  acc:     3 /     3 = 100.000
0, 70  acc:    25 /    25 = 100.000
0, 71  acc:     9 /     9 = 100.000
0, 72  acc:    82 /    85 =  96.471
0, 73  acc:     6 /     6 = 100.000
0, 74  acc:    15 /    15 = 100.000
0, 75  acc:    15 /    15 = 100.000
0, 76  acc:     5 /     5 = 100.000
0, 77  acc:     0 /     0 =     nan
0, 78  acc:     4 /     4 = 100.000
0, 79  acc:     1 /     1 = 100.000
0, 80  acc:    74 /    77 =  96.104
0, 81  acc:     7 /     7 = 100.000
0, 82  acc:    13 /    13 = 100.000
0, 83  acc:     7 /     7 = 100.000
0, 84  acc:     7 /     7 = 100.000
0, 85  acc:     1 /     1 = 100.000
0, 86  acc:     7 /     7 = 100.000
0, 87  acc:     2 /     2 = 100.000
0, 88  acc:     7 /     7 = 100.000
0, 89  acc:     2 /     2 = 100.000
0, 90  acc:     1 /     1 = 100.000
0, 91  acc:     1 /     1 = 100.000
0, 92  acc:     0 /     0 =     nan
0, 93  acc:     1 /     1 = 100.000
0, 94  acc:    71 /    72 =  98.611
0, 95  acc:     4 /     4 = 100.000
0, 96  acc:     5 /     5 = 100.000
0, 97  acc:     1 /     1 = 100.000
0, 98  acc:    11 /    11 = 100.000
0, 99  acc:     3 /     3 = 100.000
0, 100  acc:     2 /     2 = 100.000
0, 101  acc:    10 /    10 = 100.000
0, 102  acc:     2 /     2 = 100.000
0, 103  acc:     3 /     3 = 100.000
0, 104  acc:     0 /     0 =     nan
0, 105  acc:     1 /     1 = 100.000
0, 106  acc:    18 /    19 =  94.737
0, 107  acc:     1 /     1 = 100.000
0, 108  acc:     0 /     0 =     nan
0, 109  acc:     2 /     2 = 100.000
0, 110  acc:     4 /     4 = 100.000
0, 111  acc:     1 /     1 = 100.000
0, 112  acc:     5 /     7 =  71.429
0, 113  acc:     1 /     1 = 100.000
0, 114  acc:     1 /     1 = 100.000
0, 115  acc:     1 /     1 = 100.000
0, 116  acc:     1 /     1 = 100.000
0, 117  acc:  5860 /  5989 =  97.846
0, 118  acc:  1378 /  1412 =  97.592
0, 119  acc:   379 /   387 =  97.933
0, 120  acc:   392 /   395 =  99.241
0, 121  acc:    65 /    65 = 100.000
0, 122  acc:    17 /    19 =  89.474
0, 123  acc:     6 /     8 =  75.000
0, 124  acc:    23 /    23 = 100.000
0, 125  acc:   391 /   400 =  97.750
0, 126  acc:    94 /    95 =  98.947
0, 127  acc:    31 /    31 = 100.000
0, 128  acc:    34 /    34 = 100.000
0, 129  acc:     6 /     6 = 100.000
0, 130  acc:     4 /     4 = 100.000
0, 131  acc:     3 /     3 = 100.000
0, 132  acc:     0 /     0 =     nan
0, 133  acc:   154 /   162 =  95.062
0, 134  acc:    34 /    34 = 100.000
0, 135  acc:    18 /    20 =  90.000
0, 136  acc:    15 /    17 =  88.235
0, 137  acc:     6 /     6 = 100.000
0, 138  acc:     0 /     1 =   0.000
0, 139  acc:     0 /     0 =     nan
0, 140  acc:     0 /     0 =     nan
0, 141  acc:    37 /    43 =  86.047
0, 142  acc:     8 /     8 = 100.000
0, 143  acc:     3 /     3 = 100.000
0, 144  acc:     5 /     5 = 100.000
0, 145  acc:     1 /     1 = 100.000
0, 146  acc:     1 /     1 = 100.000
0, 147  acc:     0 /     0 =     nan
0, 148  acc:   102 /   104 =  98.077
0, 149  acc:    15 /    15 = 100.000
0, 150  acc:     6 /     6 = 100.000
0, 151  acc:     5 /     7 =  71.429
0, 152  acc:     2 /     2 = 100.000
0, 153  acc:     1 /     1 = 100.000
0, 154  acc:    29 /    29 = 100.000
0, 155  acc:     5 /     5 = 100.000
0, 156  acc:     1 /     1 = 100.000
0, 157  acc:     1 /     1 = 100.000
0, 158  acc:     1 /     2 =  50.000
0, 159  acc:    21 /    22 =  95.455
0, 160  acc:     1 /     1 = 100.000
0, 161  acc:     1 /     1 = 100.000
0, 162  acc:     2 /     2 = 100.000
0, 163  acc:     0 /     0 =     nan
0, 164  acc:    13 /    13 = 100.000
0, 165  acc:     1 /     1 = 100.000
0, 166  acc:     3 /     3 = 100.000
0, 167  acc:     1 /     1 = 100.000
0, 168  acc:     1 /     1 = 100.000
0, 169  acc:  1633 /  1657 =  98.552
0, 170  acc:   318 /   322 =  98.758
0, 171  acc:   119 /   119 = 100.000
0, 172  acc:   163 /   167 =  97.605
0, 173  acc:    22 /    22 = 100.000
0, 174  acc:     5 /     5 = 100.000
0, 175  acc:     3 /     3 = 100.000
0, 176  acc:    17 /    17 = 100.000
0, 177  acc:    85 /    85 = 100.000
0, 178  acc:    12 /    12 = 100.000
0, 179  acc:     7 /     8 =  87.500
0, 180  acc:     5 /     5 = 100.000
0, 181  acc:     3 /     3 = 100.000
0, 182  acc:     2 /     2 = 100.000
0, 183  acc:     4 /     4 = 100.000
0, 184  acc:    35 /    38 =  92.105
0, 185  acc:     5 /     5 = 100.000
0, 186  acc:     6 /     6 = 100.000
0, 187  acc:     8 /     8 = 100.000
0, 188  acc:     2 /     2 = 100.000
0, 189  acc:     2 /     2 = 100.000
0, 190  acc:     0 /     0 =     nan
0, 191  acc:    23 /    23 = 100.000
0, 192  acc:     1 /     1 = 100.000
0, 193  acc:     1 /     1 = 100.000
0, 194  acc:     2 /     2 = 100.000
0, 195  acc:     2 /     2 = 100.000
0, 196  acc:    36 /    37 =  97.297
0, 197  acc:     7 /     8 =  87.500
0, 198  acc:     2 /     3 =  66.667
0, 199  acc:     5 /     5 = 100.000
0, 200  acc:     3 /     3 = 100.000
0, 201  acc:     1 /     1 = 100.000
0, 202  acc:     0 /     0 =     nan
0, 203  acc:     1 /     1 = 100.000
0, 204  acc:    16 /    16 = 100.000
0, 205  acc:     2 /     2 = 100.000
0, 206  acc:     2 /     2 = 100.000
0, 207  acc:     0 /     1 =   0.000
0, 208  acc:     3 /     3 = 100.000
0, 209  acc:     6 /     6 = 100.000
0, 210  acc:     1 /     1 = 100.000
0, 211  acc:     1 /     1 = 100.000
0, 212  acc:     1 /     1 = 100.000
0, 213  acc:     1 /     1 = 100.000
0, 214  acc:     1 /     1 = 100.000
0, 215  acc:     0 /     0 =     nan
0, 216  acc:     5 /     5 = 100.000
0, 217  acc:     0 /     0 =     nan
0, 218  acc:     4 /     4 = 100.000
0, 219  acc:     0 /     0 =     nan
0, 220  acc:     1 /     1 = 100.000
0, 221  acc:     2 /     2 = 100.000
1, 0  acc:   496 / 12731 =   3.896
1, 1  acc:    88 /  1440 =   6.111
1, 2  acc:   114 /  2439 =   4.674
1, 3  acc:    57 /  1007 =   5.660
1, 4  acc:    36 /  1087 =   3.312
1, 5  acc:    11 /   196 =   5.612
1, 6  acc:     6 /   118 =   5.085
1, 7  acc:     1 /   134 =   0.746
1, 8  acc:    46 /  1112 =   4.137
1, 9  acc:     2 /    79 =   2.532
1, 10  acc:     3 /   104 =   2.885
1, 11  acc:     2 /    50 =   4.000
1, 12  acc:     6 /   167 =   3.593
1, 13  acc:     0 /    22 =   0.000
1, 14  acc:     2 /    28 =   7.143
1, 15  acc:     0 /    27 =   0.000
1, 16  acc:    76 /  1783 =   4.262
1, 17  acc:     4 /    70 =   5.714
1, 18  acc:     7 /   142 =   4.930
1, 19  acc:     1 /    54 =   1.852
1, 20  acc:     2 /    70 =   2.857
1, 21  acc:     1 /    11 =   9.091
1, 22  acc:     1 /    34 =   2.941
1, 23  acc:     1 /    11 =   9.091
1, 24  acc:     8 /   223 =   3.587
1, 25  acc:     0 /     5 =   0.000
1, 26  acc:     0 /    22 =   0.000
1, 27  acc:     0 /     3 =   0.000
1, 28  acc:     1 /    35 =   2.857
1, 29  acc:     0 /     2 =   0.000
1, 30  acc:     0 /     6 =   0.000
1, 31  acc:     0 /     2 =   0.000
1, 32  acc:    14 /   361 =   3.878
1, 33  acc:     1 /    19 =   5.263
1, 34  acc:     1 /    16 =   6.250
1, 35  acc:     0 /     6 =   0.000
1, 36  acc:     0 /    12 =   0.000
1, 37  acc:     0 /     2 =   0.000
1, 38  acc:     0 /     7 =   0.000
1, 39  acc:     0 /     1 =   0.000
1, 40  acc:     2 /    84 =   2.381
1, 41  acc:     1 /     4 =  25.000
1, 42  acc:     0 /     3 =   0.000
1, 43  acc:     0 /     4 =   0.000
1, 44  acc:     0 /     6 =   0.000
1, 45  acc:     0 /     1 =   0.000
1, 46  acc:     0 /     3 =   0.000
1, 47  acc:     0 /     0 =     nan
1, 48  acc:     5 /   117 =   4.274
1, 49  acc:     0 /     3 =   0.000
1, 50  acc:     1 /     9 =  11.111
1, 51  acc:     0 /     5 =   0.000
1, 52  acc:     2 /     4 =  50.000
1, 53  acc:     0 /     0 =     nan
1, 54  acc:     1 /     8 =  12.500
1, 55  acc:     0 /     0 =     nan
1, 56  acc:     2 /    88 =   2.273
1, 57  acc:     0 /     4 =   0.000
1, 58  acc:     0 /     4 =   0.000
1, 59  acc:     0 /     3 =   0.000
1, 60  acc:     0 /     7 =   0.000
1, 61  acc:     0 /     0 =     nan
1, 62  acc:     0 /     5 =   0.000
1, 63  acc:     0 /     0 =     nan
1, 64  acc:    60 /  1225 =   4.898
1, 65  acc:    12 /   175 =   6.857
1, 66  acc:     7 /   108 =   6.481
1, 67  acc:     2 /    46 =   4.348
1, 68  acc:     3 /    57 =   5.263
1, 69  acc:     0 /     9 =   0.000
1, 70  acc:     1 /    18 =   5.556
1, 71  acc:     0 /     5 =   0.000
1, 72  acc:     1 /    26 =   3.846
1, 73  acc:     0 /     1 =   0.000
1, 74  acc:     0 /     1 =   0.000
1, 75  acc:     0 /     1 =   0.000
1, 76  acc:     0 /     5 =   0.000
1, 77  acc:     0 /     1 =   0.000
1, 78  acc:     0 /     0 =     nan
1, 79  acc:     0 /     1 =   0.000
1, 80  acc:     2 /    46 =   4.348
1, 81  acc:     1 /     6 =  16.667
1, 82  acc:     2 /    13 =  15.385
1, 83  acc:     0 /     2 =   0.000
1, 84  acc:     0 /     2 =   0.000
1, 85  acc:     0 /     0 =     nan
1, 86  acc:     0 /     5 =   0.000
1, 87  acc:     0 /     0 =     nan
1, 88  acc:     0 /     5 =   0.000
1, 89  acc:     0 /     0 =     nan
1, 90  acc:     0 /     0 =     nan
1, 91  acc:     0 /     0 =     nan
1, 92  acc:     0 /     1 =   0.000
1, 93  acc:     0 /     1 =   0.000
1, 94  acc:     5 /    36 =  13.889
1, 95  acc:     1 /     5 =  20.000
1, 96  acc:     0 /     3 =   0.000
1, 97  acc:     0 /     0 =     nan
1, 98  acc:     0 /     5 =   0.000
1, 99  acc:     0 /     0 =     nan
1, 100  acc:     0 /     5 =   0.000
1, 101  acc:     0 /     1 =   0.000
1, 102  acc:     0 /     0 =     nan
1, 103  acc:     0 /     0 =     nan
1, 104  acc:     0 /     1 =   0.000
1, 105  acc:     0 /     0 =     nan
1, 106  acc:     0 /     6 =   0.000
1, 107  acc:     0 /     0 =     nan
1, 108  acc:     0 /     5 =   0.000
1, 109  acc:     0 /     0 =     nan
1, 110  acc:     0 /     2 =   0.000
1, 111  acc:     0 /     1 =   0.000
1, 112  acc:     0 /     3 =   0.000
1, 113  acc:     0 /     0 =     nan
1, 114  acc:     0 /     0 =     nan
1, 115  acc:     0 /     0 =     nan
1, 116  acc:     0 /     0 =     nan
1, 117  acc:   176 /  2098 =   8.389
1, 118  acc:    35 /   495 =   7.071
1, 119  acc:    11 /   127 =   8.661
1, 120  acc:     9 /   134 =   6.716
1, 121  acc:     3 /    36 =   8.333
1, 122  acc:     1 /    18 =   5.556
1, 123  acc:     0 /     7 =   0.000
1, 124  acc:     1 /     9 =  11.111
1, 125  acc:    10 /   130 =   7.692
1, 126  acc:     1 /    31 =   3.226
1, 127  acc:     0 /     7 =   0.000
1, 128  acc:     1 /    11 =   9.091
1, 129  acc:     0 /     4 =   0.000
1, 130  acc:     0 /     2 =   0.000
1, 131  acc:     0 /     0 =     nan
1, 132  acc:     0 /     2 =   0.000
1, 133  acc:    13 /   116 =  11.207
1, 134  acc:     1 /    22 =   4.545
1, 135  acc:     0 /    10 =   0.000
1, 136  acc:     1 /    11 =   9.091
1, 137  acc:     0 /     2 =   0.000
1, 138  acc:     0 /     1 =   0.000
1, 139  acc:     0 /     1 =   0.000
1, 140  acc:     1 /     1 = 100.000
1, 141  acc:     2 /    28 =   7.143
1, 142  acc:     0 /     5 =   0.000
1, 143  acc:     0 /     1 =   0.000
1, 144  acc:     0 /     1 =   0.000
1, 145  acc:     1 /     1 = 100.000
1, 146  acc:     0 /     0 =     nan
1, 147  acc:     0 /     1 =   0.000
1, 148  acc:     5 /    49 =  10.204
1, 149  acc:     1 /     5 =  20.000
1, 150  acc:     0 /     2 =   0.000
1, 151  acc:     1 /     5 =  20.000
1, 152  acc:     0 /     1 =   0.000
1, 153  acc:     0 /     1 =   0.000
1, 154  acc:     0 /    10 =   0.000
1, 155  acc:     0 /     0 =     nan
1, 156  acc:     0 /     0 =     nan
1, 157  acc:     0 /     2 =   0.000
1, 158  acc:     0 /     0 =     nan
1, 159  acc:     1 /     8 =  12.500
1, 160  acc:     0 /     1 =   0.000
1, 161  acc:     0 /     0 =     nan
1, 162  acc:     0 /     0 =     nan
1, 163  acc:     1 /     1 = 100.000
1, 164  acc:     0 /     6 =   0.000
1, 165  acc:     0 /     1 =   0.000
1, 166  acc:     0 /     0 =     nan
1, 167  acc:     0 /     0 =     nan
1, 168  acc:     0 /     0 =     nan
1, 169  acc:    33 /   828 =   3.986
1, 170  acc:    11 /   159 =   6.918
1, 171  acc:     0 /    62 =   0.000
1, 172  acc:     4 /    53 =   7.547
1, 173  acc:     1 /    15 =   6.667
1, 174  acc:     0 /     3 =   0.000
1, 175  acc:     1 /     2 =  50.000
1, 176  acc:     0 /    11 =   0.000
1, 177  acc:     0 /    18 =   0.000
1, 178  acc:     1 /     3 =  33.333
1, 179  acc:     0 /     2 =   0.000
1, 180  acc:     0 /     2 =   0.000
1, 181  acc:     0 /     2 =   0.000
1, 182  acc:     0 /     1 =   0.000
1, 183  acc:     0 /     4 =   0.000
1, 184  acc:     4 /    33 =  12.121
1, 185  acc:     0 /     8 =   0.000
1, 186  acc:     1 /     5 =  20.000
1, 187  acc:     0 /     2 =   0.000
1, 188  acc:     0 /     1 =   0.000
1, 189  acc:     0 /     1 =   0.000
1, 190  acc:     0 /     3 =   0.000
1, 191  acc:     0 /     2 =   0.000
1, 192  acc:     0 /     3 =   0.000
1, 193  acc:     0 /     1 =   0.000
1, 194  acc:     0 /     1 =   0.000
1, 195  acc:     0 /     1 =   0.000
1, 196  acc:     2 /    25 =   8.000
1, 197  acc:     0 /     3 =   0.000
1, 198  acc:     0 /     0 =     nan
1, 199  acc:     0 /     0 =     nan
1, 200  acc:     0 /     0 =     nan
1, 201  acc:     0 /     0 =     nan
1, 202  acc:     1 /     1 = 100.000
1, 203  acc:     0 /     0 =     nan
1, 204  acc:     0 /     4 =   0.000
1, 205  acc:     0 /     1 =   0.000
1, 206  acc:     0 /     0 =     nan
1, 207  acc:     0 /     0 =     nan
1, 208  acc:     0 /     0 =     nan
1, 209  acc:     1 /     5 =  20.000
1, 210  acc:     0 /     0 =     nan
1, 211  acc:     1 /     1 = 100.000
1, 212  acc:     0 /     1 =   0.000
1, 213  acc:     0 /     3 =   0.000
1, 214  acc:     0 /     0 =     nan
1, 215  acc:     0 /     1 =   0.000
1, 216  acc:     0 /     0 =     nan
1, 217  acc:     0 /     3 =   0.000
1, 218  acc:     0 /     2 =   0.000
1, 219  acc:     0 /     2 =   0.000
1, 220  acc:     0 /     0 =     nan
1, 221  acc:     0 /     0 =     nan
--------------------------------------
Average acc: 238276 / 269038 =  88.566
Robust  acc:     0 /     1 =   0.000
--------------------------------------
Validating:
Accuracies by groups:
0, 0  acc: 24179 / 25373 =  95.294
0, 1  acc: 30045 / 31282 =  96.046
0, 2  acc:  5854 /  6155 =  95.110
0, 3  acc: 23831 / 24292 =  98.102
0, 4  acc: 10209 / 10829 =  94.275
0, 5  acc:  5267 /  5541 =  95.055
0, 6  acc:  6214 /  6785 =  91.584
0, 7  acc: 10468 / 12016 =  87.117
1, 0  acc:  1371 /  4437 =  30.899
1, 1  acc:  1428 /  4962 =  28.779
1, 2  acc:   567 /  2265 =  25.033
1, 3  acc:   573 /  2446 =  23.426
1, 4  acc:   875 /  3125 =  28.000
1, 5  acc:   291 /  1003 =  29.013
1, 6  acc:  1007 /  3111 =  32.369
1, 7  acc:  1936 /  4682 =  41.350
--------------------------------------
Average acc: 124115 / 148304 =  83.690
Robust  acc:   573 /  2446 =  23.426
--------------------------------------
Save biased model at epoch 0
replace: True
Checkpoint saved at ./model/civilcomments/config/JTT_sgd_no_gce_model_b_epoch0_seed12.pt
New max average-worst acc gap: 60.26358192277391
bias model - Saving best checkpoint at epoch 0
replace: True
Checkpoint saved at ./model/civilcomments/config/JTT_sgd_no_gce_model_b_worst_avg_gap_best_epoch0_seed12.pt
male 0
[106466  12092  13021   2203]
1 1 12092
3 3 2203
female 1
[104379  14179  12954   2270]
1 1 14179
3 3 2270
LGBTQ 2
[115348   3210  14008   1216]
1 1 3210
3 3 1216
christian 3
[106457  12101  13964   1260]
1 1 12101
3 3 1260
muslim 4
[113203   5355  13597   1627]
1 1 5355
3 3 1627
other_religions 5
[115578   2980  14704    520]
1 1 2980
3 3 520
black 6
[115223   3335  13687   1537]
1 1 3335
3 3 1537
white 7
[112835   5723  12978   2246]
1 1 5723
3 3 2246
[106466  12092  13021   2203]
[104379  14179  12954   2270]
[115348   3210  14008   1216]
[106457  12101  13964   1260]
[113203   5355  13597   1627]
[115578   2980  14704    520]
[115223   3335  13687   1537]
[112835   5723  12978   2246]
Robust Acc: 23.571 | Best Acc: 97.810
-------------------------------------
Training, Epoch 0:
Accuracies by groups:
0, 0  acc: 11515 / 12092 =  95.228
0, 1  acc: 13616 / 14179 =  96.029
0, 2  acc:  3040 /  3210 =  94.704
0, 3  acc: 11836 / 12101 =  97.810
0, 4  acc:  5032 /  5355 =  93.968
0, 5  acc:  2847 /  2980 =  95.537
0, 6  acc:  3050 /  3335 =  91.454
0, 7  acc:  4975 /  5723 =  86.930
1, 0  acc:   719 /  2203 =  32.637
1, 1  acc:   679 /  2270 =  29.912
1, 2  acc:   295 /  1216 =  24.260
1, 3  acc:   297 /  1260 =  23.571
1, 4  acc:   430 /  1627 =  26.429
1, 5  acc:   158 /   520 =  30.385
1, 6  acc:   537 /  1537 =  34.938
1, 7  acc:   958 /  2246 =  42.654
------------------------------------
Average acc: 59984 / 71854 =  83.480
Robust  acc:   297 /  1260 =  23.571
------------------------------------
Accuracies by groups:
0, 0  acc: 11515 / 12092 =  95.228
0, 1  acc: 13616 / 14179 =  96.029
0, 2  acc:  3040 /  3210 =  94.704
0, 3  acc: 11836 / 12101 =  97.810
0, 4  acc:  5032 /  5355 =  93.968
0, 5  acc:  2847 /  2980 =  95.537
0, 6  acc:  3050 /  3335 =  91.454
0, 7  acc:  4975 /  5723 =  86.930
1, 0  acc:   719 /  2203 =  32.637
1, 1  acc:   679 /  2270 =  29.912
1, 2  acc:   295 /  1216 =  24.260
1, 3  acc:   297 /  1260 =  23.571
1, 4  acc:   430 /  1627 =  26.429
1, 5  acc:   158 /   520 =  30.385
1, 6  acc:   537 /  1537 =  34.938
1, 7  acc:   958 /  2246 =  42.654
------------------------------------
Average acc: 59984 / 71854 =  83.480
Robust  acc:   297 /  1260 =  23.571
------------------------------------
Testing:
Accuracies by groups:
0, 0  acc: 11515 / 12092 =  95.228
0, 1  acc: 13616 / 14179 =  96.029
0, 2  acc:  3040 /  3210 =  94.704
0, 3  acc: 11836 / 12101 =  97.810
0, 4  acc:  5032 /  5355 =  93.968
0, 5  acc:  2847 /  2980 =  95.537
0, 6  acc:  3050 /  3335 =  91.454
0, 7  acc:  4975 /  5723 =  86.930
1, 0  acc:   719 /  2203 =  32.637
1, 1  acc:   679 /  2270 =  29.912
1, 2  acc:   295 /  1216 =  24.260
1, 3  acc:   297 /  1260 =  23.571
1, 4  acc:   430 /  1627 =  26.429
1, 5  acc:   158 /   520 =  30.385
1, 6  acc:   537 /  1537 =  34.938
1, 7  acc:   958 /  2246 =  42.654
------------------------------------
Average acc: 59984 / 71854 =  83.480
Robust  acc:   297 /  1260 =  23.571
------------------------------------
replace: True
Checkpoint saved at ./model/civilcomments/config/bias-end_seed12.pt
training biased model is done
