
=== Start adding workers ===
=> Add worker SGDMWorker(index=0, momentum=0.9)
=> Add worker SGDMWorker(index=1, momentum=0.9)
=> Add worker SGDMWorker(index=2, momentum=0.9)
=> Add worker SGDMWorker(index=3, momentum=0.9)
=> Add worker SGDMWorker(index=4, momentum=0.9)
=> Add worker SGDMWorker(index=5, momentum=0.9)
=> Add worker SGDMWorker(index=6, momentum=0.9)
=> Add worker SGDMWorker(index=7, momentum=0.9)
=> Add worker SGDMWorker(index=8, momentum=0.9)
=> Add worker SGDMWorker(index=9, momentum=0.9)

=== Start adding graph ===
<codes.graph_utils.Dumbbell object at 0x7f47d7c9e6d0>

Train epoch 1
[E 1B0  |    320/60000 (  1%) ] Loss: 2.2959 top1= 10.0000

=== Peeking data label distribution E1B0 ===
Worker 0 has targets: tensor([4, 8, 8, 6, 9], device='cuda:0')
Worker 1 has targets: tensor([5, 3, 6, 0, 9], device='cuda:0')
Worker 2 has targets: tensor([2, 9, 9, 3, 1], device='cuda:0')
Worker 3 has targets: tensor([6, 9, 8, 1, 2], device='cuda:0')
Worker 4 has targets: tensor([5, 8, 9, 1, 8], device='cuda:0')
Worker 5 has targets: tensor([6, 7, 5, 2, 3], device='cuda:0')
Worker 6 has targets: tensor([3, 2, 8, 7, 9], device='cuda:0')
Worker 7 has targets: tensor([3, 8, 7, 8, 7], device='cuda:0')
Worker 8 has targets: tensor([8, 0, 2, 4, 8], device='cuda:0')
Worker 9 has targets: tensor([5, 3, 4, 6, 3], device='cuda:0')


[E 1B10 |   3520/60000 (  6%) ] Loss: 1.9214 top1= 46.5625
[E 1B20 |   6720/60000 ( 11%) ] Loss: 0.8657 top1= 72.1875

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.4913 top1= 84.8357


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.5029 top1= 83.5036


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.5207 top1= 84.7857

Train epoch 2
[E 2B0  |    320/60000 (  1%) ] Loss: 0.6843 top1= 81.2500
[E 2B10 |   3520/60000 (  6%) ] Loss: 0.4189 top1= 86.2500
[E 2B20 |   6720/60000 ( 11%) ] Loss: 0.3872 top1= 89.0625

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2899 top1= 91.4964


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2939 top1= 91.3361


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.3057 top1= 91.0056

Train epoch 3
[E 3B0  |    320/60000 (  1%) ] Loss: 0.3524 top1= 88.7500
[E 3B10 |   3520/60000 (  6%) ] Loss: 0.2485 top1= 91.2500
[E 3B20 |   6720/60000 ( 11%) ] Loss: 0.2605 top1= 92.1875

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2343 top1= 93.1190


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2343 top1= 93.0589


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2431 top1= 92.7885

Train epoch 4
[E 4B0  |    320/60000 (  1%) ] Loss: 0.2044 top1= 94.3750
[E 4B10 |   3520/60000 (  6%) ] Loss: 0.1544 top1= 95.3125
[E 4B20 |   6720/60000 ( 11%) ] Loss: 0.1654 top1= 95.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2056 top1= 93.8001


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2091 top1= 93.6298


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2107 top1= 93.8201

Train epoch 5
[E 5B0  |    320/60000 (  1%) ] Loss: 0.1318 top1= 95.3125
[E 5B10 |   3520/60000 (  6%) ] Loss: 0.1017 top1= 96.8750
[E 5B20 |   6720/60000 ( 11%) ] Loss: 0.1134 top1= 95.9375

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2009 top1= 94.0605


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.1998 top1= 94.0004


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2125 top1= 93.8201

Train epoch 6
[E 6B0  |    320/60000 (  1%) ] Loss: 0.1005 top1= 97.8125
[E 6B10 |   3520/60000 (  6%) ] Loss: 0.0628 top1= 98.4375
[E 6B20 |   6720/60000 ( 11%) ] Loss: 0.0811 top1= 97.5000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.1966 top1= 94.3409


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2041 top1= 94.0705


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2035 top1= 94.3209

Train epoch 7
[E 7B0  |    320/60000 (  1%) ] Loss: 0.0846 top1= 97.5000
[E 7B10 |   3520/60000 (  6%) ] Loss: 0.0366 top1= 99.3750
[E 7B20 |   6720/60000 ( 11%) ] Loss: 0.0415 top1= 98.7500

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2126 top1= 94.1406


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2211 top1= 93.8401


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2270 top1= 93.6999

Train epoch 8
[E 8B0  |    320/60000 (  1%) ] Loss: 0.0961 top1= 96.2500
[E 8B10 |   3520/60000 (  6%) ] Loss: 0.0306 top1= 99.3750
[E 8B20 |   6720/60000 ( 11%) ] Loss: 0.0375 top1= 98.7500

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.1988 top1= 94.5513


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.1912 top1= 94.6915


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2180 top1= 93.9303

Train epoch 9
[E 9B0  |    320/60000 (  1%) ] Loss: 0.0472 top1= 98.4375
[E 9B10 |   3520/60000 (  6%) ] Loss: 0.0373 top1= 98.7500
[E 9B20 |   6720/60000 ( 11%) ] Loss: 0.0211 top1= 99.6875

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.1883 top1= 94.8417


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.1965 top1= 94.8518


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.1949 top1= 94.8017

Train epoch 10
[E10B0  |    320/60000 (  1%) ] Loss: 0.0355 top1= 99.0625
[E10B10 |   3520/60000 (  6%) ] Loss: 0.0205 top1= 99.6875
[E10B20 |   6720/60000 ( 11%) ] Loss: 0.0290 top1= 99.3750

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.1988 top1= 94.7316


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2080 top1= 94.6414


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2020 top1= 94.6815

Train epoch 11
[E11B0  |    320/60000 (  1%) ] Loss: 0.0275 top1= 99.6875
[E11B10 |   3520/60000 (  6%) ] Loss: 0.0153 top1= 99.3750
[E11B20 |   6720/60000 ( 11%) ] Loss: 0.0114 top1= 99.6875

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.1829 top1= 95.4227


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.1929 top1= 95.0120


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.1870 top1= 95.3325

Train epoch 12
[E12B0  |    320/60000 (  1%) ] Loss: 0.0193 top1= 99.3750
[E12B10 |   3520/60000 (  6%) ] Loss: 0.0189 top1= 99.6875
[E12B20 |   6720/60000 ( 11%) ] Loss: 0.0061 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.1843 top1= 95.4627


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.1864 top1= 95.2925


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.1921 top1= 95.2925

Train epoch 13
[E13B0  |    320/60000 (  1%) ] Loss: 0.0074 top1= 99.6875
[E13B10 |   3520/60000 (  6%) ] Loss: 0.0093 top1= 99.6875
[E13B20 |   6720/60000 ( 11%) ] Loss: 0.0048 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.1943 top1= 95.5128


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.1928 top1= 95.5529


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.1987 top1= 95.3425

Train epoch 14
[E14B0  |    320/60000 (  1%) ] Loss: 0.0040 top1=100.0000
[E14B10 |   3520/60000 (  6%) ] Loss: 0.0054 top1=100.0000
[E14B20 |   6720/60000 ( 11%) ] Loss: 0.0037 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2051 top1= 95.2624


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2017 top1= 95.3225


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2117 top1= 95.0821

Train epoch 15
[E15B0  |    320/60000 (  1%) ] Loss: 0.0067 top1=100.0000
[E15B10 |   3520/60000 (  6%) ] Loss: 0.0026 top1=100.0000
[E15B20 |   6720/60000 ( 11%) ] Loss: 0.0040 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2000 top1= 95.4227


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2029 top1= 95.4127


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.1986 top1= 95.5028

Train epoch 16
[E16B0  |    320/60000 (  1%) ] Loss: 0.0055 top1=100.0000
[E16B10 |   3520/60000 (  6%) ] Loss: 0.0047 top1=100.0000
[E16B20 |   6720/60000 ( 11%) ] Loss: 0.0030 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.1992 top1= 95.5629


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2002 top1= 95.5329


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.1995 top1= 95.6430

Train epoch 17
[E17B0  |    320/60000 (  1%) ] Loss: 0.0021 top1=100.0000
[E17B10 |   3520/60000 (  6%) ] Loss: 0.0016 top1=100.0000
[E17B20 |   6720/60000 ( 11%) ] Loss: 0.0021 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2048 top1= 95.5829


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2020 top1= 95.6631


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2096 top1= 95.5128

Train epoch 18
[E18B0  |    320/60000 (  1%) ] Loss: 0.0019 top1=100.0000
[E18B10 |   3520/60000 (  6%) ] Loss: 0.0012 top1=100.0000
[E18B20 |   6720/60000 ( 11%) ] Loss: 0.0027 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2027 top1= 95.8333


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2017 top1= 95.8834


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2039 top1= 95.8033

Train epoch 19
[E19B0  |    320/60000 (  1%) ] Loss: 0.0011 top1=100.0000
[E19B10 |   3520/60000 (  6%) ] Loss: 0.0011 top1=100.0000
[E19B20 |   6720/60000 ( 11%) ] Loss: 0.0010 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2052 top1= 95.7332


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2045 top1= 95.8033


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2061 top1= 95.6931

Train epoch 20
[E20B0  |    320/60000 (  1%) ] Loss: 0.0009 top1=100.0000
[E20B10 |   3520/60000 (  6%) ] Loss: 0.0008 top1=100.0000
[E20B20 |   6720/60000 ( 11%) ] Loss: 0.0010 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2076 top1= 95.7833


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2069 top1= 95.7632


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2084 top1= 95.7432

Train epoch 21
[E21B0  |    320/60000 (  1%) ] Loss: 0.0008 top1=100.0000
[E21B10 |   3520/60000 (  6%) ] Loss: 0.0007 top1=100.0000
[E21B20 |   6720/60000 ( 11%) ] Loss: 0.0009 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2100 top1= 95.7833


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2093 top1= 95.7532


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2108 top1= 95.7532

Train epoch 22
[E22B0  |    320/60000 (  1%) ] Loss: 0.0007 top1=100.0000
[E22B10 |   3520/60000 (  6%) ] Loss: 0.0007 top1=100.0000
[E22B20 |   6720/60000 ( 11%) ] Loss: 0.0008 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2121 top1= 95.7632


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2115 top1= 95.7232


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2129 top1= 95.7432

Train epoch 23
[E23B0  |    320/60000 (  1%) ] Loss: 0.0007 top1=100.0000
[E23B10 |   3520/60000 (  6%) ] Loss: 0.0006 top1=100.0000
[E23B20 |   6720/60000 ( 11%) ] Loss: 0.0008 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2141 top1= 95.7332


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2135 top1= 95.7131


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2149 top1= 95.7432

Train epoch 24
[E24B0  |    320/60000 (  1%) ] Loss: 0.0006 top1=100.0000
[E24B10 |   3520/60000 (  6%) ] Loss: 0.0006 top1=100.0000
[E24B20 |   6720/60000 ( 11%) ] Loss: 0.0007 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2160 top1= 95.7131


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2153 top1= 95.7131


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2167 top1= 95.7332

Train epoch 25
[E25B0  |    320/60000 (  1%) ] Loss: 0.0006 top1=100.0000
[E25B10 |   3520/60000 (  6%) ] Loss: 0.0005 top1=100.0000
[E25B20 |   6720/60000 ( 11%) ] Loss: 0.0007 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2177 top1= 95.7031


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2171 top1= 95.7131


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2183 top1= 95.7332

Train epoch 26
[E26B0  |    320/60000 (  1%) ] Loss: 0.0005 top1=100.0000
[E26B10 |   3520/60000 (  6%) ] Loss: 0.0005 top1=100.0000
[E26B20 |   6720/60000 ( 11%) ] Loss: 0.0006 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2193 top1= 95.7131


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2187 top1= 95.7031


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2199 top1= 95.7432

Train epoch 27
[E27B0  |    320/60000 (  1%) ] Loss: 0.0005 top1=100.0000
[E27B10 |   3520/60000 (  6%) ] Loss: 0.0005 top1=100.0000
[E27B20 |   6720/60000 ( 11%) ] Loss: 0.0006 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2207 top1= 95.7232


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2201 top1= 95.7232


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2213 top1= 95.7532

Train epoch 28
[E28B0  |    320/60000 (  1%) ] Loss: 0.0005 top1=100.0000
[E28B10 |   3520/60000 (  6%) ] Loss: 0.0004 top1=100.0000
[E28B20 |   6720/60000 ( 11%) ] Loss: 0.0005 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2222 top1= 95.7332


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2216 top1= 95.7131


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2228 top1= 95.7232

Train epoch 29
[E29B0  |    320/60000 (  1%) ] Loss: 0.0005 top1=100.0000
[E29B10 |   3520/60000 (  6%) ] Loss: 0.0004 top1=100.0000
[E29B20 |   6720/60000 ( 11%) ] Loss: 0.0005 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2235 top1= 95.7332


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2230 top1= 95.7232


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2241 top1= 95.7031

Train epoch 30
[E30B0  |    320/60000 (  1%) ] Loss: 0.0004 top1=100.0000
[E30B10 |   3520/60000 (  6%) ] Loss: 0.0004 top1=100.0000
[E30B20 |   6720/60000 ( 11%) ] Loss: 0.0005 top1=100.0000

=> Averaged model (Global Average Validation Accuracy) | Eval Loss=0.2248 top1= 95.7232


=> Averaged model (Clique1 Average Validation Accuracy) | Eval Loss=0.2243 top1= 95.7232


=> Averaged model (Clique2 Average Validation Accuracy) | Eval Loss=0.2254 top1= 95.7031

