
=== Start adding workers ===
=> Add worker SGDMWorker(index=0, momentum=0.9)
=> Add worker SGDMWorker(index=1, momentum=0.9)
=> Add worker SGDMWorker(index=2, momentum=0.9)
=> Add worker SGDMWorker(index=3, momentum=0.9)
=> Add worker SGDMWorker(index=4, momentum=0.9)
=> Add worker SGDMWorker(index=5, momentum=0.9)
=> Add worker SGDMWorker(index=6, momentum=0.9)
=> Add worker SGDMWorker(index=7, momentum=0.9)
=> Add worker SGDMWorker(index=8, momentum=0.9)
=> Add worker SGDMWorker(index=9, momentum=0.9)

=== Start adding graph ===
<codes.graph_utils.Dumbbell object at 0x7f2e50a7b6d0>

Train epoch 1
[E 1B0  |    320/60000 (  1%) ] Loss: 2.2959 top1= 10.0000

=== Peeking data label distribution E1B0 ===
Worker 0 has targets: tensor([4, 8, 8, 6, 9], device='cuda:0')
Worker 1 has targets: tensor([5, 3, 6, 0, 9], device='cuda:0')
Worker 2 has targets: tensor([2, 9, 9, 3, 1], device='cuda:0')
Worker 3 has targets: tensor([6, 9, 8, 1, 2], device='cuda:0')
Worker 4 has targets: tensor([5, 8, 9, 1, 8], device='cuda:0')
Worker 5 has targets: tensor([6, 7, 5, 2, 3], device='cuda:0')
Worker 6 has targets: tensor([3, 2, 8, 7, 9], device='cuda:0')
Worker 7 has targets: tensor([3, 8, 7, 8, 7], device='cuda:0')
Worker 8 has targets: tensor([8, 0, 2, 4, 8], device='cuda:0')
Worker 9 has targets: tensor([5, 3, 4, 6, 3], device='cuda:0')


[E 1B10 |   3520/60000 (  6%) ] Loss: 1.9224 top1= 46.5625
[E 1B20 |   6720/60000 ( 11%) ] Loss: 0.8763 top1= 72.8125

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.5013 top1= 84.1146


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.5035 top1= 83.2833


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.5506 top1= 83.3133

Train epoch 2
[E 2B0  |    320/60000 (  1%) ] Loss: 0.6920 top1= 80.6250
[E 2B10 |   3520/60000 (  6%) ] Loss: 0.4575 top1= 84.6875
[E 2B20 |   6720/60000 ( 11%) ] Loss: 0.3939 top1= 88.4375

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.3005 top1= 90.9756


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.3071 top1= 90.6550


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.3262 top1= 90.3245

Train epoch 3
[E 3B0  |    320/60000 (  1%) ] Loss: 0.3463 top1= 88.1250
[E 3B10 |   3520/60000 (  6%) ] Loss: 0.2230 top1= 93.1250
[E 3B20 |   6720/60000 ( 11%) ] Loss: 0.2607 top1= 92.5000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2430 top1= 92.8986


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2532 top1= 92.4179


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2582 top1= 92.3377

Train epoch 4
[E 4B0  |    320/60000 (  1%) ] Loss: 0.1690 top1= 94.6875
[E 4B10 |   3520/60000 (  6%) ] Loss: 0.1292 top1= 95.6250
[E 4B20 |   6720/60000 ( 11%) ] Loss: 0.1381 top1= 95.9375

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2207 top1= 93.5597


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2312 top1= 93.2492


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2339 top1= 93.0088

Train epoch 5
[E 5B0  |    320/60000 (  1%) ] Loss: 0.1029 top1= 96.8750
[E 5B10 |   3520/60000 (  6%) ] Loss: 0.0663 top1= 97.5000
[E 5B20 |   6720/60000 ( 11%) ] Loss: 0.1046 top1= 96.8750

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2403 top1= 93.3494


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2533 top1= 92.8886


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2692 top1= 92.3878

Train epoch 6
[E 6B0  |    320/60000 (  1%) ] Loss: 0.1049 top1= 95.6250
[E 6B10 |   3520/60000 (  6%) ] Loss: 0.0416 top1= 99.0625
[E 6B20 |   6720/60000 ( 11%) ] Loss: 0.0301 top1= 99.6875

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2397 top1= 93.3894


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2614 top1= 92.8385


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2623 top1= 92.7784

Train epoch 7
[E 7B0  |    320/60000 (  1%) ] Loss: 0.0674 top1= 97.5000
[E 7B10 |   3520/60000 (  6%) ] Loss: 0.0462 top1= 99.0625
[E 7B20 |   6720/60000 ( 11%) ] Loss: 0.0456 top1= 98.1250

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2295 top1= 93.8902


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2436 top1= 93.4595


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2502 top1= 93.3594

Train epoch 8
[E 8B0  |    320/60000 (  1%) ] Loss: 0.0546 top1= 99.0625
[E 8B10 |   3520/60000 (  6%) ] Loss: 0.0379 top1= 97.8125
[E 8B20 |   6720/60000 ( 11%) ] Loss: 0.0275 top1= 99.0625

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2095 top1= 94.6114


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2413 top1= 94.1406


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2226 top1= 93.9303

Train epoch 9
[E 9B0  |    320/60000 (  1%) ] Loss: 0.0415 top1= 98.7500
[E 9B10 |   3520/60000 (  6%) ] Loss: 0.0248 top1= 99.3750
[E 9B20 |   6720/60000 ( 11%) ] Loss: 0.0318 top1= 99.3750

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2335 top1= 93.9804


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2711 top1= 93.4395


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2557 top1= 93.2993

Train epoch 10
[E10B0  |    320/60000 (  1%) ] Loss: 0.0379 top1= 98.4375
[E10B10 |   3520/60000 (  6%) ] Loss: 0.0152 top1=100.0000
[E10B20 |   6720/60000 ( 11%) ] Loss: 0.0137 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2144 top1= 94.8417


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2261 top1= 94.5913


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2371 top1= 94.0104

Train epoch 11
[E11B0  |    320/60000 (  1%) ] Loss: 0.0110 top1= 99.6875
[E11B10 |   3520/60000 (  6%) ] Loss: 0.0185 top1= 99.6875
[E11B20 |   6720/60000 ( 11%) ] Loss: 0.0058 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2091 top1= 94.9319


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2192 top1= 94.6014


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2282 top1= 94.2107

Train epoch 12
[E12B0  |    320/60000 (  1%) ] Loss: 0.0068 top1=100.0000
[E12B10 |   3520/60000 (  6%) ] Loss: 0.0045 top1=100.0000
[E12B20 |   6720/60000 ( 11%) ] Loss: 0.0083 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2219 top1= 94.7716


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2386 top1= 94.5413


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2197 top1= 94.5913

Train epoch 13
[E13B0  |    320/60000 (  1%) ] Loss: 0.0091 top1= 99.6875
[E13B10 |   3520/60000 (  6%) ] Loss: 0.0084 top1=100.0000
[E13B20 |   6720/60000 ( 11%) ] Loss: 0.0047 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2298 top1= 94.6715


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2405 top1= 94.4912


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2317 top1= 94.4111

Train epoch 14
[E14B0  |    320/60000 (  1%) ] Loss: 0.0068 top1=100.0000
[E14B10 |   3520/60000 (  6%) ] Loss: 0.0022 top1=100.0000
[E14B20 |   6720/60000 ( 11%) ] Loss: 0.0051 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2215 top1= 95.0621


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2297 top1= 95.0120


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2220 top1= 94.8317

Train epoch 15
[E15B0  |    320/60000 (  1%) ] Loss: 0.0022 top1=100.0000
[E15B10 |   3520/60000 (  6%) ] Loss: 0.0018 top1=100.0000
[E15B20 |   6720/60000 ( 11%) ] Loss: 0.0019 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2213 top1= 95.0921


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2283 top1= 95.1322


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2201 top1= 95.0220

Train epoch 16
[E16B0  |    320/60000 (  1%) ] Loss: 0.0020 top1=100.0000
[E16B10 |   3520/60000 (  6%) ] Loss: 0.0017 top1=100.0000
[E16B20 |   6720/60000 ( 11%) ] Loss: 0.0020 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2237 top1= 95.0821


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2283 top1= 95.1923


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2236 top1= 95.0120

Train epoch 17
[E17B0  |    320/60000 (  1%) ] Loss: 0.0016 top1=100.0000
[E17B10 |   3520/60000 (  6%) ] Loss: 0.0018 top1=100.0000
[E17B20 |   6720/60000 ( 11%) ] Loss: 0.0024 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2250 top1= 95.1022


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2283 top1= 95.2324


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2253 top1= 95.1122

Train epoch 18
[E18B0  |    320/60000 (  1%) ] Loss: 0.0019 top1=100.0000
[E18B10 |   3520/60000 (  6%) ] Loss: 0.0020 top1=100.0000
[E18B20 |   6720/60000 ( 11%) ] Loss: 0.0027 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2254 top1= 95.1923


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2278 top1= 95.2324


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2261 top1= 95.1122

Train epoch 19
[E19B0  |    320/60000 (  1%) ] Loss: 0.0021 top1=100.0000
[E19B10 |   3520/60000 (  6%) ] Loss: 0.0022 top1=100.0000
[E19B20 |   6720/60000 ( 11%) ] Loss: 0.0029 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2256 top1= 95.2524


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2273 top1= 95.2825


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2268 top1= 95.1322

Train epoch 20
[E20B0  |    320/60000 (  1%) ] Loss: 0.0024 top1=100.0000
[E20B10 |   3520/60000 (  6%) ] Loss: 0.0024 top1=100.0000
[E20B20 |   6720/60000 ( 11%) ] Loss: 0.0031 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2259 top1= 95.2524


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2267 top1= 95.3325


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2274 top1= 95.2224

Train epoch 21
[E21B0  |    320/60000 (  1%) ] Loss: 0.0027 top1=100.0000
[E21B10 |   3520/60000 (  6%) ] Loss: 0.0026 top1=100.0000
[E21B20 |   6720/60000 ( 11%) ] Loss: 0.0033 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2256 top1= 95.2825


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2261 top1= 95.3225


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2272 top1= 95.3025

Train epoch 22
[E22B0  |    320/60000 (  1%) ] Loss: 0.0029 top1=100.0000
[E22B10 |   3520/60000 (  6%) ] Loss: 0.0027 top1=100.0000
[E22B20 |   6720/60000 ( 11%) ] Loss: 0.0033 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2264 top1= 95.3025


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2265 top1= 95.3225


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2278 top1= 95.2925

Train epoch 23
[E23B0  |    320/60000 (  1%) ] Loss: 0.0031 top1=100.0000
[E23B10 |   3520/60000 (  6%) ] Loss: 0.0026 top1=100.0000
[E23B20 |   6720/60000 ( 11%) ] Loss: 0.0028 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2281 top1= 95.3425


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2276 top1= 95.3425


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2299 top1= 95.3125

Train epoch 24
[E24B0  |    320/60000 (  1%) ] Loss: 0.0025 top1=100.0000
[E24B10 |   3520/60000 (  6%) ] Loss: 0.0021 top1=100.0000
[E24B20 |   6720/60000 ( 11%) ] Loss: 0.0023 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2302 top1= 95.3626


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2295 top1= 95.3826


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2317 top1= 95.3225

Train epoch 25
[E25B0  |    320/60000 (  1%) ] Loss: 0.0021 top1=100.0000
[E25B10 |   3520/60000 (  6%) ] Loss: 0.0017 top1=100.0000
[E25B20 |   6720/60000 ( 11%) ] Loss: 0.0020 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2323 top1= 95.4026


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2314 top1= 95.4427


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2337 top1= 95.3626

Train epoch 26
[E26B0  |    320/60000 (  1%) ] Loss: 0.0018 top1=100.0000
[E26B10 |   3520/60000 (  6%) ] Loss: 0.0015 top1=100.0000
[E26B20 |   6720/60000 ( 11%) ] Loss: 0.0016 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2348 top1= 95.4127


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2339 top1= 95.4627


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2362 top1= 95.3325

Train epoch 27
[E27B0  |    320/60000 (  1%) ] Loss: 0.0016 top1=100.0000
[E27B10 |   3520/60000 (  6%) ] Loss: 0.0013 top1=100.0000
[E27B20 |   6720/60000 ( 11%) ] Loss: 0.0014 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2369 top1= 95.4026


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2360 top1= 95.4527


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2383 top1= 95.3526

Train epoch 28
[E28B0  |    320/60000 (  1%) ] Loss: 0.0014 top1=100.0000
[E28B10 |   3520/60000 (  6%) ] Loss: 0.0012 top1=100.0000
[E28B20 |   6720/60000 ( 11%) ] Loss: 0.0013 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2389 top1= 95.4127


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2380 top1= 95.4728


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2402 top1= 95.3526

Train epoch 29
[E29B0  |    320/60000 (  1%) ] Loss: 0.0013 top1=100.0000
[E29B10 |   3520/60000 (  6%) ] Loss: 0.0011 top1=100.0000
[E29B20 |   6720/60000 ( 11%) ] Loss: 0.0012 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2409 top1= 95.4227


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2400 top1= 95.4728


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2421 top1= 95.3626

Train epoch 30
[E30B0  |    320/60000 (  1%) ] Loss: 0.0011 top1=100.0000
[E30B10 |   3520/60000 (  6%) ] Loss: 0.0010 top1=100.0000
[E30B20 |   6720/60000 ( 11%) ] Loss: 0.0010 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2427 top1= 95.4327


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2418 top1= 95.4627


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2439 top1= 95.3926

