
=== Start adding workers ===
=> Add worker SGDMWorker(index=0, momentum=0.9)
=> Add worker SGDMWorker(index=1, momentum=0.9)
=> Add worker SGDMWorker(index=2, momentum=0.9)
=> Add worker SGDMWorker(index=3, momentum=0.9)
=> Add worker SGDMWorker(index=4, momentum=0.9)
=> Add worker SGDMWorker(index=5, momentum=0.9)
=> Add worker SGDMWorker(index=6, momentum=0.9)
=> Add worker SGDMWorker(index=7, momentum=0.9)
=> Add worker SGDMWorker(index=8, momentum=0.9)
=> Add worker SGDMWorker(index=9, momentum=0.9)

=== Start adding graph ===
<codes.graph_utils.Dumbbell object at 0x7fbef87ff6d0>

Train epoch 1
[E 1B0  |    320/60000 (  1%) ] Loss: 2.2959 top1= 10.0000

=== Peeking data label distribution E1B0 ===
Worker 0 has targets: tensor([4, 8, 8, 6, 9], device='cuda:0')
Worker 1 has targets: tensor([5, 3, 6, 0, 9], device='cuda:0')
Worker 2 has targets: tensor([2, 9, 9, 3, 1], device='cuda:0')
Worker 3 has targets: tensor([6, 9, 8, 1, 2], device='cuda:0')
Worker 4 has targets: tensor([5, 8, 9, 1, 8], device='cuda:0')
Worker 5 has targets: tensor([6, 7, 5, 2, 3], device='cuda:0')
Worker 6 has targets: tensor([3, 2, 8, 7, 9], device='cuda:0')
Worker 7 has targets: tensor([3, 8, 7, 8, 7], device='cuda:0')
Worker 8 has targets: tensor([8, 0, 2, 4, 8], device='cuda:0')
Worker 9 has targets: tensor([5, 3, 4, 6, 3], device='cuda:0')


[E 1B10 |   3520/60000 (  6%) ] Loss: 1.9205 top1= 47.1875
[E 1B20 |   6720/60000 ( 11%) ] Loss: 0.8712 top1= 72.5000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.4969 top1= 84.4050


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.5069 top1= 83.3834


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.5272 top1= 84.3149

Train epoch 2
[E 2B0  |    320/60000 (  1%) ] Loss: 0.6914 top1= 80.9375
[E 2B10 |   3520/60000 (  6%) ] Loss: 0.4375 top1= 84.6875
[E 2B20 |   6720/60000 ( 11%) ] Loss: 0.3834 top1= 88.7500

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2935 top1= 91.3662


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2993 top1= 91.0457


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.3079 top1= 90.9956

Train epoch 3
[E 3B0  |    320/60000 (  1%) ] Loss: 0.3601 top1= 87.5000
[E 3B10 |   3520/60000 (  6%) ] Loss: 0.2383 top1= 90.9375
[E 3B20 |   6720/60000 ( 11%) ] Loss: 0.2512 top1= 92.5000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2380 top1= 93.0389


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2384 top1= 92.9387


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2482 top1= 92.6983

Train epoch 4
[E 4B0  |    320/60000 (  1%) ] Loss: 0.2034 top1= 94.3750
[E 4B10 |   3520/60000 (  6%) ] Loss: 0.1502 top1= 95.6250
[E 4B20 |   6720/60000 ( 11%) ] Loss: 0.1739 top1= 95.3125

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2076 top1= 93.7500


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2108 top1= 93.6398


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2130 top1= 93.5897

Train epoch 5
[E 5B0  |    320/60000 (  1%) ] Loss: 0.1355 top1= 95.6250
[E 5B10 |   3520/60000 (  6%) ] Loss: 0.0997 top1= 96.5625
[E 5B20 |   6720/60000 ( 11%) ] Loss: 0.1154 top1= 96.2500

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2018 top1= 94.0505


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2002 top1= 93.9403


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2168 top1= 93.7400

Train epoch 6
[E 6B0  |    320/60000 (  1%) ] Loss: 0.0979 top1= 97.1875
[E 6B10 |   3520/60000 (  6%) ] Loss: 0.0600 top1= 98.7500
[E 6B20 |   6720/60000 ( 11%) ] Loss: 0.0739 top1= 97.5000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2001 top1= 94.3610


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2124 top1= 93.9403


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2061 top1= 94.1306

Train epoch 7
[E 7B0  |    320/60000 (  1%) ] Loss: 0.0802 top1= 98.1250
[E 7B10 |   3520/60000 (  6%) ] Loss: 0.0394 top1= 99.0625
[E 7B20 |   6720/60000 ( 11%) ] Loss: 0.0313 top1= 99.6875

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2106 top1= 94.2208


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2177 top1= 93.9002


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2214 top1= 93.9103

Train epoch 8
[E 8B0  |    320/60000 (  1%) ] Loss: 0.0696 top1= 97.5000
[E 8B10 |   3520/60000 (  6%) ] Loss: 0.0359 top1= 99.3750
[E 8B20 |   6720/60000 ( 11%) ] Loss: 0.0378 top1= 98.4375

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.1994 top1= 94.5312


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.1934 top1= 94.6815


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2223 top1= 93.7600

Train epoch 9
[E 9B0  |    320/60000 (  1%) ] Loss: 0.0384 top1= 98.7500
[E 9B10 |   3520/60000 (  6%) ] Loss: 0.0294 top1= 99.3750
[E 9B20 |   6720/60000 ( 11%) ] Loss: 0.0247 top1= 99.6875

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.1903 top1= 95.0721


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2005 top1= 94.8417


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.1928 top1= 95.0120

Train epoch 10
[E10B0  |    320/60000 (  1%) ] Loss: 0.0418 top1= 98.4375
[E10B10 |   3520/60000 (  6%) ] Loss: 0.0348 top1= 98.4375
[E10B20 |   6720/60000 ( 11%) ] Loss: 0.0204 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2080 top1= 94.6514


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2290 top1= 94.2308


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.1998 top1= 94.7616

Train epoch 11
[E11B0  |    320/60000 (  1%) ] Loss: 0.0393 top1= 98.4375
[E11B10 |   3520/60000 (  6%) ] Loss: 0.0179 top1= 99.0625
[E11B20 |   6720/60000 ( 11%) ] Loss: 0.0170 top1= 99.6875

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.1942 top1= 94.9519


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2122 top1= 94.6314


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2008 top1= 94.9920

Train epoch 12
[E12B0  |    320/60000 (  1%) ] Loss: 0.0192 top1= 99.3750
[E12B10 |   3520/60000 (  6%) ] Loss: 0.0115 top1= 99.6875
[E12B20 |   6720/60000 ( 11%) ] Loss: 0.0088 top1= 99.6875

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.1858 top1= 95.3926


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.1965 top1= 95.1022


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.1910 top1= 95.2724

Train epoch 13
[E13B0  |    320/60000 (  1%) ] Loss: 0.0068 top1=100.0000
[E13B10 |   3520/60000 (  6%) ] Loss: 0.0171 top1= 99.3750
[E13B20 |   6720/60000 ( 11%) ] Loss: 0.0062 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.1903 top1= 95.5429


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.1910 top1= 95.5429


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.1987 top1= 95.2925

Train epoch 14
[E14B0  |    320/60000 (  1%) ] Loss: 0.0043 top1=100.0000
[E14B10 |   3520/60000 (  6%) ] Loss: 0.0043 top1=100.0000
[E14B20 |   6720/60000 ( 11%) ] Loss: 0.0044 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2021 top1= 95.3225


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2071 top1= 95.2724


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2032 top1= 95.2123

Train epoch 15
[E15B0  |    320/60000 (  1%) ] Loss: 0.0056 top1=100.0000
[E15B10 |   3520/60000 (  6%) ] Loss: 0.0053 top1=100.0000
[E15B20 |   6720/60000 ( 11%) ] Loss: 0.0053 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2089 top1= 95.4627


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2094 top1= 95.3526


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2106 top1= 95.4227

Train epoch 16
[E16B0  |    320/60000 (  1%) ] Loss: 0.0062 top1=100.0000
[E16B10 |   3520/60000 (  6%) ] Loss: 0.0047 top1=100.0000
[E16B20 |   6720/60000 ( 11%) ] Loss: 0.0038 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2043 top1= 95.4627


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2065 top1= 95.4928


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2034 top1= 95.4427

Train epoch 17
[E17B0  |    320/60000 (  1%) ] Loss: 0.0031 top1=100.0000
[E17B10 |   3520/60000 (  6%) ] Loss: 0.0023 top1=100.0000
[E17B20 |   6720/60000 ( 11%) ] Loss: 0.0038 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2059 top1= 95.6330


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2073 top1= 95.5529


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2067 top1= 95.6530

Train epoch 18
[E18B0  |    320/60000 (  1%) ] Loss: 0.0024 top1=100.0000
[E18B10 |   3520/60000 (  6%) ] Loss: 0.0017 top1=100.0000
[E18B20 |   6720/60000 ( 11%) ] Loss: 0.0016 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2070 top1= 95.7031


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2042 top1= 95.7632


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2130 top1= 95.5429

Train epoch 19
[E19B0  |    320/60000 (  1%) ] Loss: 0.0016 top1=100.0000
[E19B10 |   3520/60000 (  6%) ] Loss: 0.0014 top1=100.0000
[E19B20 |   6720/60000 ( 11%) ] Loss: 0.0021 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2061 top1= 95.7332


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2043 top1= 95.7632


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2084 top1= 95.7031

Train epoch 20
[E20B0  |    320/60000 (  1%) ] Loss: 0.0010 top1=100.0000
[E20B10 |   3520/60000 (  6%) ] Loss: 0.0012 top1=100.0000
[E20B20 |   6720/60000 ( 11%) ] Loss: 0.0010 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2082 top1= 95.6931


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2073 top1= 95.7232


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2093 top1= 95.6731

Train epoch 21
[E21B0  |    320/60000 (  1%) ] Loss: 0.0009 top1=100.0000
[E21B10 |   3520/60000 (  6%) ] Loss: 0.0009 top1=100.0000
[E21B20 |   6720/60000 ( 11%) ] Loss: 0.0009 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2100 top1= 95.7031


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2092 top1= 95.7833


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2110 top1= 95.6931

Train epoch 22
[E22B0  |    320/60000 (  1%) ] Loss: 0.0008 top1=100.0000
[E22B10 |   3520/60000 (  6%) ] Loss: 0.0008 top1=100.0000
[E22B20 |   6720/60000 ( 11%) ] Loss: 0.0008 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2121 top1= 95.6931


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2114 top1= 95.7632


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2131 top1= 95.6931

Train epoch 23
[E23B0  |    320/60000 (  1%) ] Loss: 0.0007 top1=100.0000
[E23B10 |   3520/60000 (  6%) ] Loss: 0.0007 top1=100.0000
[E23B20 |   6720/60000 ( 11%) ] Loss: 0.0008 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2140 top1= 95.7131


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2133 top1= 95.7432


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2149 top1= 95.6831

Train epoch 24
[E24B0  |    320/60000 (  1%) ] Loss: 0.0007 top1=100.0000
[E24B10 |   3520/60000 (  6%) ] Loss: 0.0007 top1=100.0000
[E24B20 |   6720/60000 ( 11%) ] Loss: 0.0007 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2157 top1= 95.7232


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2150 top1= 95.7432


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2165 top1= 95.6731

Train epoch 25
[E25B0  |    320/60000 (  1%) ] Loss: 0.0006 top1=100.0000
[E25B10 |   3520/60000 (  6%) ] Loss: 0.0006 top1=100.0000
[E25B20 |   6720/60000 ( 11%) ] Loss: 0.0007 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2173 top1= 95.7131


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2167 top1= 95.7432


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2181 top1= 95.6831

Train epoch 26
[E26B0  |    320/60000 (  1%) ] Loss: 0.0006 top1=100.0000
[E26B10 |   3520/60000 (  6%) ] Loss: 0.0006 top1=100.0000
[E26B20 |   6720/60000 ( 11%) ] Loss: 0.0006 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2189 top1= 95.6931


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2182 top1= 95.7432


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2196 top1= 95.6731

Train epoch 27
[E27B0  |    320/60000 (  1%) ] Loss: 0.0005 top1=100.0000
[E27B10 |   3520/60000 (  6%) ] Loss: 0.0005 top1=100.0000
[E27B20 |   6720/60000 ( 11%) ] Loss: 0.0006 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2203 top1= 95.7031


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2197 top1= 95.7432


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2211 top1= 95.6631

Train epoch 28
[E28B0  |    320/60000 (  1%) ] Loss: 0.0005 top1=100.0000
[E28B10 |   3520/60000 (  6%) ] Loss: 0.0005 top1=100.0000
[E28B20 |   6720/60000 ( 11%) ] Loss: 0.0005 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2217 top1= 95.7131


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2211 top1= 95.7332


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2224 top1= 95.6631

Train epoch 29
[E29B0  |    320/60000 (  1%) ] Loss: 0.0005 top1=100.0000
[E29B10 |   3520/60000 (  6%) ] Loss: 0.0005 top1=100.0000
[E29B20 |   6720/60000 ( 11%) ] Loss: 0.0005 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2230 top1= 95.7131


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2224 top1= 95.7232


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2237 top1= 95.6631

Train epoch 30
[E30B0  |    320/60000 (  1%) ] Loss: 0.0005 top1=100.0000
[E30B10 |   3520/60000 (  6%) ] Loss: 0.0005 top1=100.0000
[E30B20 |   6720/60000 ( 11%) ] Loss: 0.0005 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2243 top1= 95.7131


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2237 top1= 95.7131


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2250 top1= 95.6831

