{
  "epoch": [
    1,
    2,
    3,
    4,
    5,
    6,
    7,
    8,
    9,
    10,
    11,
    12,
    13,
    14,
    15,
    16,
    17,
    18,
    19,
    20,
    21,
    22,
    23,
    24,
    25,
    26,
    27,
    28,
    29,
    30,
    31,
    32,
    33,
    34,
    35,
    36,
    37,
    38,
    39,
    40,
    41,
    42,
    43,
    44,
    45,
    46,
    47,
    48,
    49,
    50,
    51,
    52,
    53,
    54,
    55,
    56,
    57,
    58,
    59,
    60,
    61,
    62,
    63,
    64,
    65,
    66,
    67,
    68,
    69,
    70,
    71,
    72,
    73,
    74,
    75,
    76,
    77,
    78,
    79,
    80,
    81,
    82,
    83,
    84,
    85,
    86,
    87,
    88,
    89,
    90,
    91,
    92,
    93,
    94,
    95,
    96,
    97,
    98,
    99,
    100,
    101,
    102,
    103,
    104,
    105,
    106,
    107,
    108,
    109,
    110,
    111,
    112,
    113,
    114,
    115,
    116,
    117,
    118,
    119,
    120,
    121,
    122,
    123,
    124,
    125,
    126,
    127,
    128,
    129,
    130,
    131,
    132,
    133,
    134,
    135,
    136,
    137,
    138,
    139,
    140,
    141,
    142,
    143,
    144,
    145,
    146,
    147,
    148,
    149,
    150,
    151,
    152,
    153,
    154,
    155,
    156,
    157,
    158,
    159,
    160,
    161,
    162,
    163,
    164,
    165,
    166,
    167,
    168,
    169,
    170,
    171,
    172,
    173,
    174,
    175,
    176,
    177,
    178,
    179,
    180,
    181,
    182,
    183,
    184,
    185,
    186,
    187,
    188,
    189,
    190,
    191,
    192,
    193,
    194,
    195,
    196,
    197,
    198,
    199,
    200,
    201,
    202,
    203,
    204,
    205,
    206,
    207,
    208,
    209,
    210,
    211,
    212,
    213,
    214,
    215,
    216,
    217,
    218,
    219,
    220,
    221,
    222,
    223,
    224,
    225,
    226,
    227,
    228,
    229,
    230,
    231,
    232,
    233,
    234,
    235,
    236,
    237,
    238,
    239,
    240,
    241,
    242,
    243,
    244,
    245,
    246,
    247,
    248,
    249,
    250,
    251,
    252,
    253,
    254,
    255,
    256,
    257,
    258,
    259,
    260
  ],
  "train_loss": [
    2.168850806451613,
    2.0854334677419355,
    2.024445564516129,
    1.950100806451613,
    1.859375,
    1.7512600806451613,
    1.6446572580645162,
    1.5395665322580645,
    1.4357358870967742,
    1.3409778225806452,
    1.2515120967741935,
    1.1633064516129032,
    1.0773689516129032,
    1.006804435483871,
    0.9344758064516129,
    0.8605090725806451,
    0.7907006048387096,
    0.7279485887096774,
    0.6657006048387096,
    0.6111391129032258,
    0.5542464717741935,
    0.5045362903225806,
    0.4515498991935484,
    0.4039188508064516,
    0.3623991935483871,
    0.3248487903225806,
    0.2916141633064516,
    0.26052167338709675,
    0.2332094254032258,
    0.20901587701612903,
    0.1878150201612903,
    0.16724420362903225,
    0.1509734122983871,
    0.13734879032258066,
    0.12671685987903225,
    0.11797505040322581,
    0.1100680443548387,
    0.10280682963709678,
    0.09929435483870967,
    0.09409652217741936,
    0.08924521169354839,
    0.08500819052419355,
    0.08233051915322581,
    0.07952683971774194,
    0.07576234879032258,
    0.07239163306451613,
    0.0672331779233871,
    0.063232421875,
    0.06092489919354839,
    0.05920803931451613,
    0.056396484375,
    0.054443359375,
    0.05242723034274194,
    0.05188382056451613,
    0.05103326612903226,
    0.05024571572580645,
    0.05032447076612903,
    0.051001764112903226,
    0.0537109375,
    0.058349609375,
    0.0692414314516129,
    0.08168472782258064,
    0.0884734122983871,
    0.08488218245967742,
    0.07306892641129033,
    0.06287014868951613,
    0.055931829637096774,
    0.049710181451612906,
    0.04570154989919355,
    0.04290574596774194,
    0.041070753528225805,
    0.03968466481854839,
    0.03884986139112903,
    0.03840883316532258,
    0.03818831905241935,
    0.03780241935483871,
    0.03777091733870968,
    0.03788117439516129,
    0.03861359627016129,
    0.038416708669354836,
    0.03724325856854839,
    0.03658171622983871,
    0.03633757560483871,
    0.035943800403225805,
    0.03560515372983871,
    0.03554214969758065,
    0.03557365171370968,
    0.035424017137096774,
    0.035211378528225805,
    0.03539251512096774,
    0.03721175655241935,
    0.04129126764112903,
    0.04924552671370968,
    0.08145633820564516,
    0.1266538558467742,
    0.12172379032258064,
    0.0896389868951613,
    0.06501228578629033,
    0.05000157510080645,
    0.043433404737903226,
    0.04014931955645161,
    0.03847183719758065,
    0.037542527721774195,
    0.03681798135080645,
    0.03655021421370968,
    0.03599892893145161,
    0.03572328629032258,
    0.03543976814516129,
    0.035140498991935484,
    0.03483335433467742,
    0.03480185231854839,
    0.03466796875,
    0.03438445060483871,
    0.03434507308467742,
    0.03430569556451613,
    0.03450258316532258,
    0.03486485635080645,
    0.035211378528225805,
    0.03520350302419355,
    0.034691595262096774,
    0.03508537046370968,
    0.03536101310483871,
    0.035668157762096774,
    0.03626669606854839,
    0.038487588205645164,
    0.04178742439516129,
    0.05134041078629032,
    0.06696541078629033,
    0.07924332157258064,
    0.07673891129032258,
    0.06639837449596774,
    0.055160030241935484,
    0.047032510080645164,
    0.04139364919354839,
    0.038141066028225805,
    0.036392704133064516,
    0.03556577620967742,
    0.034896358366935484,
    0.03437657510080645,
    0.034156060987903226,
    0.03386466733870968,
    0.03373865927419355,
    0.03365202872983871,
    0.03348664314516129,
    0.03333700856854839,
    0.03324250252016129,
    0.03321100050403226,
    0.03310074344758065,
    0.03299836189516129,
    0.03299048639112903,
    0.032982610887096774,
    0.03285660282258065,
    0.03285660282258065,
    0.03277784778225806,
    0.03270696824596774,
    0.032628213205645164,
    0.03259671118951613,
    0.032557333669354836,
    0.03253370715725806,
    0.03249039188508065,
    0.03246676537298387,
    0.03241951234879032,
    0.03243132560483871,
    0.032415574596774195,
    0.03271878150201613,
    0.03291566910282258,
    0.03401430191532258,
    0.03892074092741935,
    0.06252362651209678,
    0.11315524193548387,
    0.11175340221774194,
    0.08040889616935484,
    0.05694776965725806,
    0.04479586693548387,
    0.03927513860887097,
    0.036865234375,
    0.03536888860887097,
    0.03462071572580645,
    0.033935546875,
    0.033541771673387094,
    0.03334488407258065,
    0.03317162298387097,
    0.03296685987903226,
    0.03291960685483871,
    0.03281722530241935,
    0.03267546622983871,
    0.03265183971774194,
    0.03254552041330645,
    0.032454952116935484,
    0.03246282762096774,
    0.03238801033266129,
    0.03226594002016129,
    0.03227775327620968,
    0.0323486328125,
    0.032198998235887094,
    0.032159620715725805,
    0.03216749621975806,
    0.03208480342741935,
    0.032037550403225805,
    0.03206117691532258,
    0.031966670866935484,
    0.03202967489919355,
    0.03200211063508065,
    0.03201392389112903,
    0.03191154233870968,
    0.0328369140625,
    0.03406549269153226,
    0.03411668346774194,
    0.03393948462701613,
    0.03357327368951613,
    0.034329322076612906,
    0.035888671875,
    0.044457220262096774,
    0.07054088961693548,
    0.08716607862903226,
    0.07768397177419355,
    0.06083039314516129,
    0.047867313508064516,
    0.040621849798387094,
    0.03722750756048387,
    0.03504599294354839,
    0.03382528981854839,
    0.03336851058467742,
    0.033116494455645164,
    0.03279359879032258,
    0.032608524445564516,
    0.032470703125,
    0.03250220514112903,
    0.03227775327620968,
    0.03223443800403226,
    0.03214386970766129,
    0.032080865675403226,
    0.03203361265120968,
    0.03199423513104839,
    0.03202967489919355,
    0.03190366683467742,
    0.031836725050403226,
    0.03186428931451613,
    0.03176978326612903,
    0.03174615675403226,
    0.03178159652217742,
    0.03170677923387097,
    0.031683152721774195,
    0.03169890372983871,
    0.031631961945564516,
    0.03162014868951613,
    0.03160833543346774,
    0.03157683341733871,
    0.031580771169354836,
    0.031517767137096774,
    0.031529580393145164,
    0.031517767137096774,
    0.03158470892137097,
    0.031631961945564516,
    0.03157683341733871,
    0.03200211063508065,
    0.03323462701612903,
    0.034266318044354836,
    0.03845608618951613,
    0.052261844758064516
  ],
  "learning_rate": [
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05,
    8e-05
  ],
  "grad_norm": [
    0.12857918796778953,
    0.10375172093987656,
    0.1529353061016207,
    0.23533531383598408,
    0.357323743450601,
    0.5072932394881289,
    0.6885610407239666,
    0.8216777422799213,
    0.9797590790996452,
    1.1523747940415048,
    1.2733610992039537,
    1.3175054998476745,
    1.43829960682509,
    1.6098300995442434,
    1.7054864152940328,
    1.6772609949970765,
    1.7951754438335497,
    1.8490047707812145,
    1.964747102221256,
    2.0734222678203778,
    2.0331227678481834,
    2.158576703535263,
    2.037905651399258,
    2.0522903494914435,
    2.0752193382799677,
    2.0623002445951792,
    1.9624735301785043,
    1.9401118174073977,
    1.940305926007318,
    1.845913701883383,
    1.7969543918645894,
    1.6493202668862876,
    1.6090122654466044,
    1.5260814896064077,
    1.5275056860237526,
    1.502749838927928,
    1.437183467191447,
    1.4345202665699335,
    1.4013777215063883,
    1.420590118239239,
    1.3736446641272764,
    1.3026816903883336,
    1.3095337808158696,
    1.3006773041489579,
    1.2359298947696142,
    1.1695601686293717,
    1.0088566493535451,
    0.9584852864640265,
    0.9172784123906347,
    0.8999112400583893,
    0.8123407390222205,
    0.7617851188435449,
    0.7459298216851208,
    0.7296083729327411,
    0.7353133327784308,
    0.7111283395718634,
    0.7677166332404634,
    0.8540033320705298,
    1.0345894783032477,
    1.272817258577277,
    1.642154964833459,
    1.9189818092208069,
    1.9906457111393143,
    1.8159641860388045,
    1.4765162886943963,
    1.200337962969704,
    0.9750817285284444,
    0.7128052288002958,
    0.5321032181785235,
    0.39090592193892776,
    0.3134676500573347,
    0.258494036636618,
    0.23653100485986042,
    0.24140163591334335,
    0.24341165210663307,
    0.2467841145080671,
    0.26242342788679435,
    0.2986921201992225,
    0.35008129442969665,
    0.31225237969680725,
    0.24486285283459266,
    0.2223768978460437,
    0.2147676823465841,
    0.19752554449925572,
    0.19436467281050887,
    0.19427201785045034,
    0.20957628651153448,
    0.19673769742598154,
    0.20124053879506684,
    0.28499641315660723,
    0.4348695393753986,
    0.7352823811168617,
    1.157776194065677,
    2.1494060153564827,
    2.554813749620186,
    2.274764733140268,
    1.710324510215203,
    1.2032531731568932,
    0.7409642740697792,
    0.48754199619567623,
    0.3476634704609284,
    0.2688304461135096,
    0.24396594808938474,
    0.23229553842557668,
    0.22145571725611998,
    0.21163839366216472,
    0.19854557156278507,
    0.19264820131703378,
    0.1868812763644587,
    0.17830506659993345,
    0.18513971177483657,
    0.17803494877259465,
    0.16452983608723148,
    0.17075147735282842,
    0.18417846581945926,
    0.21092312091464308,
    0.24414942215660582,
    0.2669411514950536,
    0.2509742858438865,
    0.22816503242626704,
    0.26757777126176424,
    0.29474921995220105,
    0.3389925931882303,
    0.43793260895080577,
    0.5682154979287757,
    0.8128219963591987,
    1.1776908607869996,
    1.5867756367050647,
    1.717691475343049,
    1.6226383416096466,
    1.328230157137506,
    1.0272513585164267,
    0.731034559317762,
    0.5289811276107532,
    0.3411889274495544,
    0.25926660057890855,
    0.21820209987277592,
    0.1709523423315586,
    0.1608493177427193,
    0.1532003477036415,
    0.14467350273048335,
    0.1432341520514158,
    0.14258963690347232,
    0.13923789391193794,
    0.1353723094654687,
    0.13313600526706842,
    0.13387481646734098,
    0.13195437794404324,
    0.13055344154323836,
    0.13190280477315666,
    0.12821400383263706,
    0.13390339762948855,
    0.13012414805461398,
    0.12912697829352066,
    0.12507640944229229,
    0.12350109873584889,
    0.12583805775393073,
    0.12194865059421041,
    0.12300593645038947,
    0.12167896335382454,
    0.1176935507940848,
    0.11716815247848508,
    0.11943368109216437,
    0.13378015055021233,
    0.1561541291500479,
    0.19433233763702196,
    0.34955223478908387,
    0.710624790198789,
    1.4943177241011907,
    2.128979115013519,
    1.8651490807008202,
    1.4048752594069494,
    0.9460499325017804,
    0.6171543402234285,
    0.42120632077873604,
    0.2993306122247478,
    0.20247165005522788,
    0.1808400097197748,
    0.14617091323651135,
    0.13360643072655468,
    0.12688770860046636,
    0.1262654214638479,
    0.12096589688049077,
    0.12080118557666507,
    0.12142929875261142,
    0.11927834839224312,
    0.11537022722476702,
    0.11358090282915091,
    0.11371155149779581,
    0.1149788098680281,
    0.11270816256419135,
    0.1132576851614555,
    0.11093688728105894,
    0.11002943898529638,
    0.10948629096447017,
    0.1089523449918571,
    0.10773118134933395,
    0.10921807368902724,
    0.10776000830100445,
    0.10654878401660374,
    0.10560902842756173,
    0.10617611170763214,
    0.1065562961180093,
    0.10545668484455434,
    0.10370797513053823,
    0.18645522538283613,
    0.22700804347721115,
    0.21071559600820414,
    0.21364106655927143,
    0.2149578028341367,
    0.2933603389583003,
    0.4611047091967173,
    0.9039007226401712,
    1.4504611677786419,
    1.573052175648438,
    1.4019684103884718,
    1.0389566949781224,
    0.7283455608880033,
    0.5052718447983007,
    0.35666731613345154,
    0.21855769628607666,
    0.16309056413849152,
    0.13282178872807454,
    0.11928786438115886,
    0.11765410300571222,
    0.11109365305784107,
    0.11512495062755171,
    0.11219512750544369,
    0.10769651686644612,
    0.10432330507347802,
    0.10395921808873983,
    0.10207917593592052,
    0.1001428046828946,
    0.10607439363002909,
    0.0999894018680204,
    0.09829335217448648,
    0.09853599544326004,
    0.09645934806221156,
    0.09590077050292647,
    0.09743106293692617,
    0.09565112330205355,
    0.09328558661965808,
    0.09407228611847718,
    0.0950655634515329,
    0.09335466838028496,
    0.0923298510872809,
    0.0938720351743834,
    0.09089007284695325,
    0.09120159862832358,
    0.08944833084509382,
    0.0918135440982345,
    0.08933347825845395,
    0.09040487602762705,
    0.0993037588761332,
    0.10108473907594287,
    0.15178084324332522,
    0.22807620823835797,
    0.3381758999152076,
    0.5815036738374635,
    1.024266689115443
  ],
  "validation": [
    {
      "train_loss": null,
      "update_counter": 0,
      "epoch": 0.0,
      "val_loss": 2.171875,
      "val_entropy": 2.12841796875,
      "val_ttr": 0.3108723958333333,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the results of the comparison of the whole timeline configuration.\n\n\\begin{figure}[H]\n\\centering\n\\includegraphics[width=0.8\\textwidth]{figures/comparison_whole_timeline_configuration_1.png}\n\\caption{Comparison of the whole timeline configuration}\n\\label{comparison_whole_timeline_configuration_1}\n\\end{figure}\n\n\\begin{figure}[H]\n\\centering\n\\includegraphics[width=0.8\\textwidth]{figures/comparison_whole_timeline_configuration_2.png}\n\\caption{Comparison of the whole timeline configuration}\n\\label{comparison_whole_timeline_configuration_2}\n\\end{figure}\n\n\\subsection{Comparison of the whole timeline configuration}\n\nIn the first figure we can see that the whole timeline configuration is the best configuration in terms of the number of nodes and the number of edges.\n\nIn the second figure we can see that the whole timeline configuration is the best configuration in terms of the number of nodes and the number of edges.\n"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " with equal time step size.\n\n\\begin{table}[h]\n\\centering\n\\begin{tabular}{c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ".\n\nThe first problem is a simple fluid-structure interaction problem. The fluid is a Newtonian fluid with a constant viscosity. The solid is a rigid body with a constant density. The fluid is incompressible and the solid is incompressible. The fluid is incompressible and the solid is incompressible. The fluid is incompressible and the solid is incompressible. The fluid is incompressible and the solid is incompressible. The fluid is incompressible and the solid is incompressible. The fluid is incompressible and the solid is incompressible. The fluid is incompressible and the solid is incompressible. The fluid is incompressible and the solid is incompressible. The fluid is incompressible and the solid is incompressible. The fluid is incompressible and the solid is incompressible. The fluid is incompressible and the solid is incompressible. The fluid is incompressible and the solid is incompressible. The fluid is incompressible and the solid is incompressible. The fluid is incompressible and the solid is incompressible. The fluid is incompressible and the solid is incompressible. The fluid is incompressible and the"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "10 m.\n\nThe localization error is the difference between the actual position of the robot and the position estimated by the localization algorithm.\n\nThe localization error is the difference between the actual position of the robot and the position estimated by the localization algorithm.\n\nThe localization error is the difference between the actual position of the robot and the position estimated by the localization algorithm.\n\nThe localization error is the difference between the actual position of the robot and the position estimated by the localization algorithm.\n\nThe localization error is the difference between the actual position of the robot and the position estimated by the localization algorithm.\n\nThe localization error is the difference between the actual position of the robot and the position estimated by the localization algorithm.\n\nThe localization error is the difference between the actual position of the robot and the position estimated by the localization algorithm.\n\nThe localization error is the difference between the actual position of the robot and the position estimated by the localization algorithm.\n\nThe localization error is the difference between the actual position of the robot and the position estimated by the localization algorithm.\n\nThe localization"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": "xua} it was shown that the CME is not a solution of the original CME equation. The reason is that the CME is not a solution of the original CME equation because the CME is not a solution of the original CME equation. The reason is that the CME is not a solution of the original CME equation because the CME is not a solution of the original CME equation. The reason is that the CME is not a solution of the original CME equation because the CME is not a solution of the original CME equation. The reason is that the CME is not a solution of the original CME equation because the CME is not a solution of the original CME equation. The reason is that the CME is not a solution of the original CME equation because the CME is not a solution of the original CME equation. The reason is that the CME is not a solution of the original CME equation because the CME is not a solution of the original CME equation. The reason is that the CME is not a solution of the original CME equation because the CME is not a solution of the original CME equation. The reason is that the CME is not a"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacher2010,Kaltenbacher2011,Kaltenbacher2012,Kaltenbacher2013,Kaltenbacher2014,Kaltenbacher2015,Kaltenbacher2016,Kaltenbacher2017,Kaltenbacher2018,Kaltenbacher2019,Kaltenbacher2020,Kaltenbacher2021,Kaltenbacher2022,Kaltenbacher2023,Kaltenbacher2024,Kaltenbacher2025,Kaltenbacher2026,Kaltenbacher2027,Kaltenbacher2028,Kaltenbacher2029,Kaltenbacher2030,Kaltenbacher2031,Kaltenbacher2032,Kaltenbacher2033,Kaltenbacher2034,Kal"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the local minima of the objective function.\n    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates of the local minima of the objective function.\n    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates of the local minima of the objective function.\n    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates of the local minima of the objective function.\n    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates of the local minima of the objective function.\n    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates of the local minima of the objective function.\n    \\item \\emph{The spurious first"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " of the sequence of iterates to a local minimizer.\n    \\item \\emph{No convergence in the case of non-convexity.} The convergence theory in the convex case is not applicable to the non-convex case.\n    \\item \\emph{No convergence in the case of non-differentiability.} The convergence theory in the convex case is not applicable to the non-differentiable case.\n    \\item \\emph{No convergence in the case of non-smoothness.} The convergence theory in the convex case is not applicable to the non-smooth case.\n    \\item \\emph{No convergence in the case of non-Lipschitz continuity.} The convergence theory in the convex case is not applicable to the non-Lipschitz case.\n    \\item \\emph{No convergence in the case of non-smoothness.} The convergence theory in the convex case is not applicable to the non-smooth case.\n    \\item \\emph{No convergence in the case of non-differentiability.} The convergence theory in the convex case is not applicable to the non-differentiable case.\n   "
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " the boundary of $\\Omega$ (e.g. $g_j = \\partial \\Omega$).\nI am trying to solve the following problem:\n$$\n\\begin{cases}\n\\nabla \\cdot (a(x)\\nabla u) = f(x) & \\text{in } \\Omega \\\\\nu = g_j & \\text{on } \\partial \\Omega\n\\end{cases}\n$$\nwhere $a(x)$ is a positive definite matrix.\nI am trying to solve this problem using the finite element method. I have a mesh $\\mathcal{T}_h$ and a set of basis functions $v_i$. I am trying to find a function $u_h$ such that\n$$\nu_h = \\sum_{i=1}^N \\alpha_i v_i\n$$\nwhere $\\alpha_i$ are the unknown coefficients.\nI am using the following formulation:\n$$\n\\begin{cases}\n\\int_{\\Omega} a(x) \\nabla u_h \\cdot \\nabla"
        }
      ]
    },
    {
      "train_loss": 1.8735625,
      "update_counter": 250,
      "epoch": 8.03225806451613,
      "val_loss": 2.396484375,
      "val_entropy": 1.7998046875,
      "val_ttr": 0.3798828125,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the performance of the different configurations for the whole timeline. The first figure shows the results for the case where the server caches the entire set of items, while the second figure shows the results for the case where the server caches only the items that are already requested by the users. In both cases, the results are shown for the case where the server is aware of the requests of the users (i.e., $\\lambda=1$), and the case where the server is not aware of the requests of the users (i.e., $\\lambda=0$). In the case where the server is aware of the requests of the users, the optimal configuration is the one that caches all the items, since this configuration achieves the best performance in the system. In the case where the server is not aware of the requests of the users, the optimal configuration is the one that caches only the items that are already requested by the users, since this configuration achieves the best performance in the system. In both cases, the optimal configuration is the one that caches all the items, since this configuration achieves the best performance in the system. In"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the problem \\eqref{problem_fluid} with $u_0$ from~\\eqref{initial_data_u} and $f$ from~\\eqref{f_fluid}. The first line of the table shows the error of the a posteriori estimator for the solution to \\eqref{problem_fluid} with the right-hand side $f$ from~\\eqref{f_fluid}. The second line shows the error of the a posteriori estimator for the solution to \\eqref{problem_fluid} with the right-hand side $f$ from~\\eqref{f_fluid} and the source term $s$ from~\\eqref{s_fluid}. The third line shows the error of the a posteriori estimator for the solution to \\eqref{problem_fluid} with the right-hand side $f$ from~\\eqref{f_fluid} and the source term $s$ from~\\eqref{s_fluid} and the boundary conditions $u = 0$ on $\\Gamma_D$. The fourth line shows the error of the a posteriori estimator for the solution to \\"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The main challenge is to ensure that the time-step sizes in the two problems are compatible. In the following, we describe how this is achieved. The overall algorithm is summarized in Algorithm \\ref{alg:overall}. The algorithm is initialized by calling the \\texttt{Init} function of the \\texttt{MultiRateProblem} class. The \\texttt{Init} function initializes the problem data and the time-stepping schemes for the two problems. The time-step sizes for the two problems are initialized to the same value. The \\texttt{Solve} function is called repeatedly. In each iteration of the \\texttt{Solve} function, the time-step sizes for the two problems are updated. The time-step sizes for the two problems are adapted to be compatible. The time-step sizes for the two problems are then used to solve the two problems. The \\texttt{Solve} function is called until the stopping condition is met. The stopping condition is met when the time-step sizes for the two problems are compatible and the maximum number of iterations is reached. The \\"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "10\\% in $2.4$ GHz, they can be reduced to 1.5\\% in $5$ GHz. This is because the $5$ GHz band has a higher line-of-sight path loss than $2.4$ GHz, which makes it more sensitive to the distance to the transmitter. The results in Fig. \\ref{fig:loc_errors} also show that the localization errors are higher in the first few frames, which is due to the fact that the received signal strength is not stable yet. The localization errors stabilize after a few frames and the errors are very low in the later frames. This is because the received signal strength is stable and the received signal power is proportional to the distance to the transmitter. The results also show that the localization errors are higher in the first floor than in the second floor. This is because the first floor has a higher number of walls and obstacles than the second floor. The results also show that the localization errors are higher in the bedrooms than in the other rooms. This is because the bedrooms have a higher number of walls and obstacles than"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": ",Valgushev:2016}, it is shown that the existence of the equilibrium of the system (1) is equivalent to the existence of the equilibrium of the system (2) with the potential (3) for any $n\\in\\mathbb{N}$. In the latter case, the equilibrium is defined by the condition $\\frac{\\partial V}{\\partial \\varphi_i}=0$ and $\\frac{\\partial^2 V}{\\partial \\varphi_i \\partial \\varphi_j}=0$ for all $i,j\\in\\{1,\\ldots,n\\}$. In the former case, the equilibrium is defined by the condition $\\frac{\\partial V}{\\partial \\varphi_i}=0$ and $\\frac{\\partial^2 V}{\\partial \\varphi_i \\partial \\varphi_j}+\\frac{\\partial^3 V}{\\partial \\varphi_i \\partial \\varphi_j \\partial \\varphi_k}=0$ for all $i,j,k\\in\\{1,\\ldots,n\\}$. In the latter case, the potential (3) is defined by"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacher:2010:JMM:1834013.1834020}), the study of the stability of the set of solutions to \\eqref{eq:system} is still a very recent development. The first results in this direction were obtained in \\cite{Kaltenbacher:2010:JMM:1834013.1834020,Kaltenbacher:2011:JMM:1834013.1834021}. In \\cite{Kaltenbacher:2010:JMM:1834013.1834020} the authors study the stability of the set of solutions to \\eqref{eq:system} in the case of a single space dimension $d=1$. In this case, the set of solutions is a smooth submanifold of the space of all $(p,q)$ tuples. In \\cite{Kaltenbacher:2011:"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the gradient. The Landweber iteration is a special case of the hybrid method \\cite{10.1016/S0021-9995(00)00060-8}, and the hybrid method is a special case of the hybrid projection method \\cite{10.1016/S0021-9995(00)00060-8}. The hybrid method is a generalization of the Landweber iteration, and the hybrid projection method is a generalization of the hybrid method. The hybrid method and the hybrid projection method are both iterative methods for solving the projection problem. The hybrid method is a special case of the hybrid projection method, and the Landweber iteration is a special case of the hybrid method. The Landweber iteration is a special case of the hybrid projection method, and the hybrid method is a special case of the Landweber iteration. The Landweber iteration is a special case of the hybrid projection method, and the hybrid method is a special case of the Land"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " of the sequence of solutions to a certain accuracy, which is independent of the initial data.  This is in contrast to the linear case, where the convergence is global.  The reason for this is that the nonlinear term in the Navier-Stokes equations is not a continuous function, and thus the nonlinear term in the weak formulation is not a continuous function either.  This means that the nonlinear term cannot be integrated against a test function, and thus the nonlinear term cannot be used in the weak formulation.  This is the main reason why the nonlinear case is more difficult than the linear case.  However, the local convergence theory in the nonlinear case is still very powerful.  In the current paper, we show that the convergence rate in the nonlinear case is the same as the convergence rate in the linear case.  This is a very important result, because it means that the accuracy of the numerical solution is independent of the initial data, and thus the numerical solution is independent of the initial data.  This is a very important result, because it means that the numerical solution is independent of the initial data.  This"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " the inter-particle forces. The system is coupled to a boundary value problem (BVP) for the corresponding boundary conditions, which are given by $u=0$ on $\\partial \\Omega$. The system is subject to a time-dependent forcing term $f(t)$, which is usually chosen to be a time-dependent external field. The system is driven by an external time-dependent control input $u(t)$ which is usually chosen to be a time-dependent temperature. The system is subject to a time-dependent energy constraint $E(t)$ which is usually chosen to be a time-dependent total mass. The system is driven by an external time-dependent control input $u(t)$ which is usually chosen to be a time-dependent temperature. The system is subject to a time-dependent energy constraint $E(t)$ which is usually chosen to be a time-dependent total mass. The system is subject to a time-dependent energy constraint $E(t)$ which is usually chosen to be a time-dependent total mass. The system is subject to a time-dependent energy"
        }
      ]
    },
    {
      "train_loss": 1.1266875,
      "update_counter": 500,
      "epoch": 16.096774193548388,
      "val_loss": 3.1064453125,
      "val_entropy": 1.49462890625,
      "val_ttr": 0.51171875,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the comparison of the prediction errors for the different configurations for a sunny and an overcast day of the {\\emph{PSP}} first orbit. The results for the {\\emph{PSP}} Time-tagged flux estimates are important insights for studying the performance of the different algorithms. The results for the  ``good\" and ``both\" configurations are almost identical, showing that the difference in the number of the estimated weekly cycles (see Table~\\ref{table:configuration_1_and_2}) does not have a significant impact on the estimation errors. The only exception is shown in Figure~\\ref{comparison_whole_timeline_configuration_2} for the Weekly average while the Weekly high and Weekly low results are almost identical. The results for the  ``good\" and ``conservative\" configurations are similar for sunny and overcast days. The conservative estimates tend to underestimate the actual weekly cycle duration. This is shown in the Weekly high and Weekly low results for both sunny and overcast days. The results for the ``good\" and ``conservative\" configurations are similar for the Time-tagged flux estimates"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the problem with a semi-smooth right-hand side. We first note that the error in the maximum of the a posteriori residual is much larger than the error in the minimum of the a posteriori residual. This is due to the fact that the solution is not very large in the domain where $\\nu=1$ and therefore the maximum of the solution is not very large. We also note that the error in the maximum of the a posteriori residual is still small in comparison to the right-hand side. This shows that the error in the solution is still small even if we use the semi-smooth right-hand side. This is due to the fact that the discrete solution is very smooth and therefore the error in the discrete solution dominates the error in the right-hand side. We also observe that the error in the maximum of the a posteriori residual is independent of the mesh size $h$ and only depends on the time step $\\tau$. This is due to the fact that the solution is constant in time and therefore the error in the maximum of the a posteriori residual only depends on the"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The overall solution of the system of coupled problems is obtained by a time-stepping of the most rapidly evolving subproblem, i.e. the fluid. The time-step for the solid is obtained by a time-step adaptation method of order 2 \\cite{Order2Adaptation}, which is shown to be robust against possible inexact preconditioners. The preconditioner for the solid is a direct solver based on the discontinuous Galerkin method. The time-step for the fluid is obtained by a time-step adaptation method of order 1, which is based on the solution of the rapidly evolving subproblem. The preconditioner for the fluid is a multilevel preconditioner based on a hierarchy of multirate PISO/PReCON subproblems. The overall solution of the system of coupled problems is obtained by a multirate partitioned \\textit{time-splitting} algorithm. The partitioning is based on the use of separate subproblems for the fluid and solid, and the adaptation of the time-step for the solid is performed without knowledge of the solution of the solid"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "10\\% down to 3\\% for the first $1000$ frames. The average channel power in the $2.4$ GHz band is $19.5$ dB, and the results in Fig. \\ref{fig:coml} show that the proposed method can reduce the mean power by $1.5$ dB. The maximum mean power reduction is $3.5$ dB. The maximum and minimum power difference is $1.5$ dB. The maximum and minimum relative error in the number of floors is $10\\%$ and $3\\%$, respectively. The maximum and minimum relative error in the relative position of the vehicles is $10\\%$ and $3\\%$, respectively. The maximum and minimum relative error in the relative velocity of the vehicles is $10\\%$ and $5\\%$, respectively. The maximum and minimum relative error in the relative position and velocity of the vehicles is $10\\%$ and $8\\%$, respectively. The maximum and minimum relative error in the relative position and velocity of the vehicles in $2.4$ GHz is $10\\%$ and"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": "atm}, the non-existence of the original equilibrium of a CME is explained by the non-conservative nature of the transport process of the CME. A CME moves outward from the Sun driven by the strong solar wind velocities but the speed of the CME is not sufficient to overtake the acceleration region and hence the acceleration region remains stuck behind the HCS. This explains why the left-hand side of the CME (the side facing the Sun) crosses the HCS at a speed lower than the solar wind speed: the left-hand side of the CME is slowed down by the HCS. The same explanation applies to the right-hand side of the CME since the right-hand side is accelerated by the wind flowing from the acceleration region into the CME deflector. Thus, the right-hand side of the CME crosses the HCS at a speed greater than the solar wind speed and the original equilibrium of the CME would be unstable. This non-conservative nature of the transport process of the CME is also responsible for the disappearance of the original equilibrium of a CME in the solar system's outer regions \\citep"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacher:2012,Kilburn:2013,Buoninfante:2014} and references therein), the problem of determining the ground states of such constraints remains in general very difficult. In this paper we will consider the tangential cone condition \\eqref{tangentialconecondition} at the critical point, and show that the relevant scalar product is the $\\mathcal{E}$-product. We will show that the critical point $(\\ref{tangentialconecondition})$ is a critical point of the $\\mathcal{E}$-product and discuss how we can obtain the correct zero-order correction by using the trace formula. However, we will see that the $\\mathcal{E}$-product does not satisfy the necessary decrease property in the energy functional \\eqref{energyfunctional}. We will show that the decrease property is satisfied at the correct order of the trace formula, but that higher order terms are needed to obtain the correct energy functional. We will give the required terms for the energy functional at order $s^2$ and $s^4$ and conjecture that they hold at any order"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the function value. The Landweber iteration is a heuristic and it is designed to find the global minimum of the energy. However, it can generate a spurious local minimum that is not the global minimum. This happens because the initialization of the iteration is not guaranteed to produce the global minimum. In the two-dimensional case, the Landweber iteration can generate a spurious local minimum that is not the global minimum. This happens because the initialization of the iteration is not guaranteed to produce the global minimum. However, we show that the iteration can still find the second minimum and reach a point that is better than the second minimum. We illustrate this phenomenon in the 2D setting. The initial minimum is shown in Figure \\ref{fig:firstlocalminimum}. The Landweber iteration generates a spurious local minimum at $1.0\\times 10^{-3}$, which is not the global minimum. The Landweber iteration generates a second minimum at $1.0\\times 10^{-6}$, which is closer to the global minimum than the first minimum. The Landweber iteration generates a"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " in a neighborhood of a given critical point, which is naturally a local information. The critical point here is the solution of the linear problem at stage $T$, which is a priori given. We first show that the solution of the linear problem is a good initial guess for the nonlinear problem. Then, we prove that the nonlinear problem with this initial guess converges locally to the solution of the nonlinear problem at the later stages. The linear problem is solved by the Newton-Raphson method, which is a second order method. In the nonlinear problem, we use the generalized Newton-Raphson method, which is a second order method in the sense of Chapter 5 of \\cite{epp10001}. The generalized Newton-Raphson method is a composite method of the Newton-Raphson method and the extrapolation method. The extrapolation method is a first order method. The advantage of the generalized Newton-Raphson method is that it can use the linear information given by the linear problem to avoid the extrapolation step. The linear problem is solved by the GKB method, which is a"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " the rough edges of the data. In particular, for $i=1,2$, $g_i$ is a $C^{\\ell-1}$ rough potential defined as the composition of a sequence of translations, multiplications by $j_k\\in \\mathbb{S}^{d-1}$, and mollifications with $\\mathbb{B}_{\\frac{1}{2}}$ (the unit ball in $\\mathbb{R}^{d}$) as the center. The precise definition of $g_i$ is given in Section \\ref{section-models} and we refer to \\cite{FR-LS} for a detailed study of this model. The data $f$ is assumed to be a solution of the fractional diffusion equation \\eqref{eq: fractional diffusion equation} with initial data $f_0$ which is a $C^{\\ell}$ function. The fractional diffusion operator $\\mathcal{A}$ is defined in \\eqref{eq: fractional diffusion operator} and the fractional Laplacian $\\mathcal{L}^s_N$ is defined in \\eqref{eq: fractional Laplacian}. The space $V"
        }
      ]
    },
    {
      "train_loss": 0.579328125,
      "update_counter": 750,
      "epoch": 24.161290322580644,
      "val_loss": 4.1845703125,
      "val_entropy": 1.086181640625,
      "val_ttr": 0.5999348958333334,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the time series of the obtained statistics for the two configurations. We provide the statistics for the main hand (right-handed in case of right-handed chirality and left-handed in case of left-handed chirality) and for all fingers. The detected events are recorded by the dedicated experimentalists at an experimental facility (e.g.~\\textit{iTEM} at DESY in Hamburg or a TEM-LV at a synchrotron) and are provided in the form of videos. In order to process the recorded events we propose a simple pipeline consisting of two steps. In the first step, we extract the selected region of the recorded movie with the help of the {\\emph{in~situ}} phase contrast or {\\emph{in~situ}} dark field tool. The diameter of the considered region depends on the particle size and varies for different centers. The time series of the particle counts is then obtained using a sliding window of width $2\\times\\mathrm{N}$ with $\\mathrm{N}\\approx$10-20.000 frames. Here, $2\\times\\"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple scale invariant problem of homogenous Dirichlet boundary conditions from Section~2.3. The large spectrum in the last row results in a flat region in the probabilistic a posteriori error estimator's output, as a posteriori information is only provided at the solution's nodes. As expected, the a priori error estimator works best when the error is smallest, i.e. on the coarser $I_k$ meshes in the first two rows. However, on the finest $I_{k+1}$ meshes, its output is still about $10\\%$ times too small. This is due to the fact that we solve for $H_0$ in order to obtain $H_0 + \\mu$ from the \\texttt{fineMesh} procedure, but the estimator only knows about $H_0$ finite-element approximation. This is also seen in the probabilistic a posteriori error estimator's output, as it too is about $10\\%$ times too small. We note, however, that it provides an improvement over the a priori estimator when the mesh $I_{"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The introduction of a third time scale, for communication, is motivated by the following argument. In the serial versions, the computational cost of a multirate time-stepping scheme is dominated by the communication cost of the step, which is due to the large amount of data exchanged during the solution of the problems that are solved in parallel. In order to handle this problem, a more efficient approach is to split the communication into several messages of smaller size that are exchanged at each time step. However, in order to avoid having to perform a very time-consuming initialization step to set the different subproblems and to track the state of the state-space, a fixed partition of the state is used. This, however, implies that the corresponding time step will not be adapted to the associated problems. Thus, what is needed is a fast and simple way to send messages between the subproblems such that the different time scales can be handled separately. In order to do that, a simple strategy is to reserve one step for communication, and to reserve a specific step for the solid problem and a different one for the fluid problem"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "10\\% on the second floor. The purple solid and dashed curves in the second column of \\ref{fig: fig1} show the correlation between a user's estimated position and the requested location. The correlation is used to determine the correlation time for that location, and to estimate the correlation time-based COMPASS information. The estimated correlation time is used to estimate the local RSU's capacity and mobility characteristics. The estimated correlation time for a user (a.k.a., Comander) located on the first and second floor of the two storey building are $1.08$s and $1.23$s, respectively. The results show that the proposed method not only improves the average error rate on the two different floors from $40\\%$ to $10\\%$, but also provides more accurate estimation of the correlation time for the corresponding RSU. The resulting mean capacity and mean mobility characteristics are shown in the last row of \\ref{fig: fig1}. The requested rate distance images in Figure \\ref{fig: fig1} show that the requested rate varies"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": "atn} the magnetic field structure observed during the CME event (see Fig.~\\ref{fig:cmeclearly}) was analyzed and it was shown that at the time of the eruption the prominence structure has moved towards the west at approximately $5.5\\times10^{-3}$~AU~per~s. This waviness, combined with the observation that the CME has a low speed (around $6.7\\times10^{-3}$~AU~per~s)  leads to the conclusion that the prominence structure is extended and the CME is not a single structure capable to completely reproduce the observed geometry. Instead, a multi-ring structure capable to resolve the prominent ends (cells) is required. Looking at observations, we realize that the majority of observed CMEs (especially those which are not self-interned) are compatible with such a scenario. In \\cite{Valgushev:2015atn}, the multi-ring model was proposed and a simple  model able to reproduce the observed waviness was proposed. In this model, each ring is"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKrawcewicz, NO:39} and the discussion below), the lack of a sharp description of the bundle $F$ in terms of the metric on $N$ (except for a few special cases) hamper the use of this condition as a tool for proving existence results for quasi-Einstein metrics on $N$. In order to overcome this problem, Yau proposed the idea to drop the requirement that the Riemannian metric on $N$ be the canonical metric of the bundle. This is the idea behind \\cite{Miao2} where a fixed circle bundle $\\mathcal{C} \\in \\Omega^2(E^* \\otimes E)$ is allowed to accompany the quasi-Einstein metric on $N$. Then existence and existence with normalizability for quasi-Einstein metrics on $N$ are obtained by including the allowed bundle $\\mathcal{C} \\in \\Omega^2(E^* \\otimes E)$ into the energy functional, and by using Yau's theorem on existence of minimal hypersurfaces in Euclidean spaces \\cite{Miao1, Miao2}. This"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the target function. In our example, the Landweber iteration does a double break of the local bifurcation, which appears due to inefficient approximations of the target function. This is due to the spurious first local minimum of the Landweber iteration. In this local minimum, the domain of the target function is $[-1,0]$, thus the local bifurcation is hidden and the algorithm converges to the first correct local minimum. However, this minimum is far from the boundary. In order to enforce the rule to explore the space, the algorithm needs to pass through the boundary. Therefore, the algorithm explores the space and the correct bifurcation is re-discovered. In this way, the Landweber iteration not only explore the space, but also avoid the local bifurcation. This is in contrast to the unrolling algorithm which only explore the space at the cost of computing more rules. The Landweber iteration also break the bifurcation at the boundary. However, the target function has a discontinuous in the point $d<1/4$. The Landweber iteration does not capture such a behavior and the target function is continuous at the"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " of the solution curve in a neighborhood of a given solution curve. The method is completely general and can be applied to all solution curves, including the original solution, established in Theorem \\ref{thm:NLP_0_0} (note that the nonlinear solution advected by $\\B@), \\boldsymbol{\\beta} = 0$ is one of them), at any given time $t$ or at any given time instant. We emphasize that the estimated solution, which is a solution in a material point approximation, may not be the same as the estimated solution established in this work. Given the fact that the solution constructed in this paper is again established by using the least square method on the measured quantities, we believe that the solution constructed in this and in the established only by using measured quantities we believe that the solution constructed in this and in the established in this work are both approximate but not necessarily equal. We believe that further improvement of the solution will come from improving the estimated measured quantities and further developing the solution manifold.  Since the established convergence theory is a local one, one can apply it locally to"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a harmful environmental agent for all $j=1,\\ldots, 5$. In environmental risk assessment, the term ``toxicity ``' is used to denote the tendency of the agent $j$ to cause harmful effects in the body, and is modeled by the function $h_j(\\cdot)$ in Eq.~\\eqref{Eq: Only Hazard Dose }. It could be either a well-defined set of diagnostic indicators (also called ``toxic biomarkers ``') or a set of clinical indicators (such as high blood pressure or kidney damage) that indicate the presence of the adverse effect of agent $j$. The function $h_j(\\cdot)$ is typically assumed to be linear or non-linear of the exposure to the agent $j$, i.e., the dose-response relationship is given by $h_j(x_j)=d_j x_j$ or $h_j(x_j)=d_j \\varphi\\left(x_j\\right)$ with $d_j$ denoting the minimal risk (or ``no-risk``"
        }
      ]
    },
    {
      "train_loss": 0.249173828125,
      "update_counter": 1000,
      "epoch": 32.225806451612904,
      "val_loss": 4.845703125,
      "val_entropy": 0.91015625,
      "val_ttr": 0.6070963541666667,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": ", for each line, the timeline of the administrator's (i.e., the collector) configuration values versus time, for the first and the second configuration, respectively. The curves for the first configuration are colored with the set of variables that are used to build the time-correlation graphs in Sections~\\ref{fig:execution_time_while_querying} and \\ref{fig:execution_time_while_propagating_results}, while the second configuration's curves are the actual values that are used to build those graphs. Notice that in both cases the first three hours are silent in terms of messages, due to the fact that the system is initialized and the first messages are sent at the beginning of that period. After that, the remaining of the first day is quiet as well, for both configurations, due to the registration rules that we mentioned above. The effect of the \\textit{Partitioner} component is to read the incoming messages that inform about the value of a particular node and randomly split them into submessages. The splitting is done between submessages according to the number of partitions that"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple linear problem $\\Omega=L^{-1}\\Omega$ with $L=\\text{id}$ size $32$ and $64$ and the exact solution $\\Omega=u\\Omega$ with $u=e^{j\\pi x_jx_j/2000h^2}$ near $x_j$. In this case the a posteriori estimator works almost perfectly, especially on the refined meshes. The log-log plot of the residuals in Table~\\ref{fluid_residuals_non_uniform_equal} shows that the error estimator works almost identically on all the time meshes and converges to the constant $10^{-4}$ around $T=5/4$ on the uniform grid. We note that we could not run the simulation with $N_T=N_\\mathrm{mesh}$ because the numerical solution grew too large. In this case, the estimator works well but develops a large residual. We observe that the error estimator is nearly optimal in this simple test. However, we expect that a refined time mesh near the transition layers would improve"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The overall time step $\\tau$ is given by $\\tau=\\min \\left\\{ \\frac{\\tau_{f}}{N_{t_{f}}} , \\frac{\\tau_{s}}{N_{t_{s}}} \\right\\}$, with $N_{t_{f}}=30$ and $N_{t_{s}}=100$ as illustrative values. The overall objective is to avoid an unacceptably slow solution by employing a large overall time step. The overall idea is to work towards a ``still'' boat with gradually applied forces. The formulation of such a sequence of ``suitable steps'' to reach the steady state is complicated by the uncertainty of the problem due to the open-loop nature and the possible incoherence of the forcing. In both problems, the initial time step is $\\tau_0 = 1$ s. 100 time steps are considered by simulating a time step sequence of length $\\tau_0, \\frac{\\tau_0}{2},...,\\tau_{max}$ with a maximum overall time step $\\tau_{max} = 5$ s"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "2\\% on the ground floor of the building in $2.4$ GHz, while the floor error on the 3$^{rd}$ floor is about $10\\%$ in $1.3$ GHz, which implies that scanning breaks the consistent correlation between the images and the building model in $1.3$ GHz due to the high-quality insulation of the building. In $4.8$ GHz, the floor resolution is very high and the floor size is very small, therefore the floor error is around $15\\%$ and the error between the ground floor and the estimated ground level is about $5\\%$. The results in Figure \\ref{fig: evaluation} show the correlation of the estimation and the real height as the building height increases from $0$ meters to about $40$ meters. The correlation for all bands is about $3.5\\times 10^{-3}$ to $4.5\\times 10^{-3}$ per meter of building height, while the correlation between the real height and the Coronavirus mortality data has only the order of magnitude. As"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": "Feb15} it is shown (see Fig.~\\ref{fig:iccmecmis}) that if the ACP position coincides with a transport cycle (i.e., $RT=8$), a the core of the envelope is located at $RT=10$, and a wing at $RT=6-8$). The CME crossing category, hence, is $E$, with a speed $S=4\\cdot\\frac{8}{10}-1=3.2$, which corresponds to a $T=2.4$ and $RT_{corr}=9.2$. Since the dynamic pressure outside (where the CME is traveling) follows the solar wind variability ($P_{dyn}=P_{90\\%}=P_{dynamic}$, $P_{99\\%}=P_{envelope}$, $P_{91\\%}=P_{conf}$). One finds that the contact discontinuity facing the solar wind has a velocity $v_{ld}$+$V_{ps}=S+RT_{corr}=3.2+9.2=12.4$~km~s$^{-1}$"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacher,Jockusch:10} and the discussion below), in this paper we will focus on the case in which the coLegendrian submanifold $\\Lambda$ is a (2-sphere) embedded in a symplectic manifold $(M^{2n}, \\omega)$, where the contact 1-form \\(\\alpha_{\\Lambda}\\) is given by $\\alpha_{\\Lambda}(x)~=~\\lambda \\theta(\\tau)$ for a real function $\\tau(x) \\in M$ such that $\\theta$ is the function of a particular vector field on $M$ (the Hamilton vector field of a particular Liouville metric on $\\mathbb{R}{}^{2n}$) and $\\lambda}$~is determined by $\\Lambda$. Here, we refer to \\(\\alpha_{\\Lambda}\\) as the _tangentorial_ contact 1-form of $\\Lambda$; the term \"tangential\" refers to the fact that this contact 1-form is defined directly from the symplectic metric, rather than by a Levi-Civita connection of some compatible connection on the symplectic manifold. We will first give a purely algebraic construction of"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the functions main parameters. For Landweber iteration this efficient parameter is the gradient, while for the GKB iteration it is the Hessian matrix, which we pointed out to be purely non-degenerate in our example. In this case, the GKB iteration has a unique local minimum while the effective parameters for the Landweber iteration come from a non-degenerate gradient that has two local minimum. The difference of the effective parameters for Landweber iteration produces a path that connects two different local minima. This phenomenon was pointed out in \\cite{LIU}, but the fact that the iterative method jumps between local minimums is not a problem as long as they find the current minimum that contains the current solution seed. We generate the iteration method starting from the correct solution at $k=1$ with stepsize $\\tau_k=\\frac{1}{4}(1-6\\epsilon)$ and find it is finds the first local minimum at $k=6$. However, this local minimum is produced by jumping to the second local minimum that corresponds to the initial effective parameters for Landweber iteration. The result"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " in each direction independently. It is straightforward to have a look at the linear case (the linear problem (1.1)\u2013(1.2) with the Banach space interpolation conditions), the iterative scheme converges uniformly in each direction. Thus the linear case legitimates the linear convergence in the nonlinear case by exploiting the discrete Banach space-norm similarity. In the nonlinear case, there are nonlinear terms in the iteration equation. The linear terms cancel against the nonlinear terms, thus the exact rate of convergence of the underlying linear subproblem is lost in the iterative scheme. However, as we show in a short appendix later, this does not matter in practice. Indeed, due to the normality of the convergence, the rate of local convergence of the solution does not depend on the exact rate of the underlying solution of the linear subproblem. In other words, the linear case does not influence the local nonlinear convergence in practice, and thus the local convergence rate in the nonlinear case is indeed only local with respect to the solution of the linear subproblem. \\emph{Thus the local convergence rate of the iterative scheme under consideration is a"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a harmful environmental agent for all $j\\in \\set{1, \\ldots, \\ell}$. Note that $\\D_\\Omega$ denotes a smooth version of the domain boundary $\\partial \\Omega$. We assume that there are $k >0$ healing forces on the boundary of the domain, namely $D_{j}=d_j\\mathbb{I}_{\\D_\\Omega}$, for $j=1,\\ldots,k$. Here, $\\mathbb{I}_{\\D_\\Omega}$ is the characteristic function on $\\D_\\Omega$, and we set $d_j>0$ in Section \\ref{sec:MathematicalSetUp}. The environment includes not only the harmful agent $g_j$'s but also a large number of beneficial environmental agents that form $f_j\\in \\mathcal{B}_{\\Omega,U}$, for $j\\in \\set{1,\\ldots,K}$. Note that, $U$ is a compact subset of $\\R^d$ denoting a space of beneficial environmental agents. We define the set of safe states as $S_q=\\bigcap_{"
        }
      ]
    },
    {
      "train_loss": 0.115556640625,
      "update_counter": 1250,
      "epoch": 40.29032258064516,
      "val_loss": 5.3515625,
      "val_entropy": 0.85205078125,
      "val_ttr": 0.5973307291666667,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the comparison of the timelines in total energy measure for all particles with different selections (see Section~\\ref{sec:configuration_choices} for the details) for the two configurations discussed in Section~\\ref{sec:configuration_choices}. We choose the run with a diameter $8$ (configuration a) as an example for the central stopping condition and for the frequency of $10$ frames per second. The selection for the figure is $80$ particles such that both the simulation data and the reconstructions using the data from the colleague have the same size. The figure shows four reconstruction curves for the two models $M_{1}$ and $M_{2}$ such that the horizontal black lines of the individual reconstructions are the mean values of the original data $F_1^{(sim)}(t)$ and $F_2^{(sim)}(t)$. The reconstructions using data from colleague haven't been performed including different colleague selections since the simulation data has a fixed size such that the colleague selections have different sizes. Therefore, we use these reconstructions for comparison and obtain advantage and disadvantage of the two models"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple one-dimensional problem $\\frac{du_i}{dt}=u_i(t)+u_{i-1}(t)u_i(t)u_{i-2}(t)u_i(t)u_{i-3}(t)u_i(t)u_{i-4}(t)u_i(t)u_i(t)u_i(t)$ for $i\\in\\{1,...,20\\}$. The time step is $\\tau=1$. The nonlinear function $f_i(u_i) = u_i(u_i-1)u_i(2u_i-1)u_i(3u_i-2)u_i(4u_i-3)u_i(5u_i-4)u_i(6u_i-5)u_i(7u_i-6)u_i(8u_i-7)u_i(9u_i-10)$ is used"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The fluid problem is relatively smooth and propagated mainly using the lowest order scheme of third order (3rd O) while the solid problems are more erratic and alternating between using the higher order (5th and 6th O) schemes for rigid body motion and friction related problems respectively as shown in Fig.~\\ref{fig:Multirate}. For problems where all the time evolutions are required as the contact setup is observed (e.g. kicking behavior in soccer), a consolidated scheme of third and fifth order (3rd and 5th O) is used. This scheme is a direct descendant of the classic centralized multirate schemes like Pace and is used only for fluid coupled to rigid body motion. Otherwise, each rate level is considered as independent and there is no mixed blending of levels during the computation time. The upper bound constraints for the step sizes are derived by using the convergence analysis presented in \\cite{BrovelliArXiv}. Although the convergence is of order $n\\pm 2$, where $n$ is the order of the applied time step, the actual step sizes are around "
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "3\\% on the B/V-bands and around 6\\% on the V/Q-bands on the $244$ MHz resolution, and around $15\\%$ on the $244$ MHz bands and $10\\%$ on the $57/92$ MHz on the testing time. On the $244/57/92$ MHz bands, the differential delay \\cite{BAM03} between the $15$ direction finding antennas as $25\\%$, $30\\%$, and $35\\%$, respectively. Figure \\ref{fig:difference} shows the bias between the maximum delay values of thea power spectra of the received signals at $f_1$ and $f_2$ bands. For different bands, it is observed that the bias varies between $1$ dB and $8$ dB. The average bias is $4.0\\%$, $3.4\\%$, and $6.0\\%$ for $15$, $25\\%$, and $30\\%$, respectively. The bias on the $244/"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": "April25}, the authors propose an alternative explanation for the observed structure of the CME-cirrus pair and argue that such structures are generated by simultaneous events, one being the SBO-CME and the other a delayed subsequent SBO-CME, where the second occurs because of an interplanetary delay-dynamical mechanism well outside the influence zone of Mars. Such an elaborate an explanation is required because the original one -- the non-existence of the same delayed CME outwards -- is simply not viable. To force the observations to match the hypothetical ``double event'', one has to ignore a large number of physical processes including the interaction of the two ``single'' CME neighbors, the interaction of the second with the heliosphere, which is not able to be justified by any the present model of\u8fdf\u53d1\u673a\u5236. Furthermore, such an explanation requires high there is a very high occurrence rate of delayed SEP events, which may not be true -- at least for small angle events -- as we show below.\\\\  Another explanation for the simultaneous occurrence is pointed out in \\cite{2017ApJ...8"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKrawcewicz,BO,MK} and the discussion below), it is not the most natural condition at higher-order. The natural order $3n$ condition to be used by some physicists is the so-called ``dual cone condition\", which requires that the $(n-1)$-jet representation of a solution of the $R$-equation of \\eqref{modifiedregulatoryequation} has radius of $r$ larger than the dual of $r$, i.e. the inverse of the coefficient $r$ is smaller than the coefficient $r-3$ (all multiplied by some common factor $\\mu^d$). This is actually equivalent to the classical cone condition in terms of the radial function (see \\cite[Equation (1.3)]{KaltenbacherKrawcewicz}), but the reason for introducing the $3\\to n$ dual cone condition was to make the $R$-equation work \"better\" in more spaces, for instance the room of power expansions of solutions \\cite{Hansen}. For our purpose it does not matter which cone condition is"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the function value. For Landweber iteration, it depends on the choice of $h>0$ fixed at each iteration. For the local minimum $(\\boldsymbol{x}_{k},\\boldsymbol{f}_{k})$ obtained at the $k$-th iteration, the best estimate for the function value of the global minimum is given by $h\\cdot(\\boldsymbol{x}-\\boldsymbol{x}^{*}$)$+\\boldsymbol{f}_k$, where $\\boldsymbol{x}$ is any solution satisfying the conditions of the local minimum. Now since $h$ depends on the previous global minimum, this estimate becomes dependent on the starting solution for Landweber iteration. To make the estimate independent of the starting solution, we should use $h\\approx h_\\infty$, where $h_\\infty$ depends only on the global minimum $(\\boldsymbol{x}^{*},\\boldsymbol{f}_{*})$. For the global minimum $(\\boldsymbol{x}^{*},\\boldsymbol{f}_{*})$, we have $h_\\infty \\approx h^{*}\\approx (\\boldsymbol{f}_{*}/\\boldsymbol{u})^{-\\eta}$, which was estimated for the"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " conditions that can be derived from the linear case by applying linear approximations about a zero-flow solution. The convergence condition is independent of the time derivative of the reconstructed solution. This means that provided the input data are consistent, given a sufficiently small initial norm of the initial conditions, the reconstructed solution converges to the correct solution in a closed loop control with no external time derivatives. This is in stark contrast to the nonlinear schemes analyzed in \\cite{anim2,aqua,DPC_nonlinear}, which contain all terms in the total variation expression, but are still dependent on external time derivatives for convergence bounds to hold. For comparison with the methods of \\cite{cd,fcd,ncd}, the argument in Section \\ref{sec:conv_ne_NL} indicates that the dispersion of the system would be larger than that of the model presented in this paper. The implicit method in this work is also simpler to implement than any of the methods proposed in \\cite{cd,fcd,ncd}. The convergence analysis in this paper is valid for inexact continuation method without any modification of the solution norms"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a binary segregation in the phase space $\\Omega$. Without loss of generality, the basic structure of the social dilemmas can be summarized in an ABM as follows: for all $i \\in \\set{1, \\ldots, n}$, agent $i$ chooses $\\xi_i \\in \\set{A, B}$, where $P_{ij}(a,b) > 0$ denotes the probability that agent $i$ chooses action $a$ and agent $j$ will choose action $b$. We assume that $\\xi_i (=\\xi_j)$ a social agent chooses in a given social context follows a leading effect (leakage and persuasion), an ambient effect (the behavior of surrounding agents), and an stabilising effect (theAgents' preferences), which are defined by the function $f_i(\\xi_1, \\ldots, \\xi_n) : \\set{A, B} \\to \\set{A, B}$. Note that specific functions $f_i(\\cdot)$ depend on detailed information like the socio-economic background, culture, or"
        }
      ]
    },
    {
      "train_loss": 0.07547265625,
      "update_counter": 1500,
      "epoch": 48.354838709677416,
      "val_loss": 5.708984375,
      "val_entropy": 0.781494140625,
      "val_ttr": 0.6041666666666667,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the evolution of the obtained results for configurations C_{1} and C_{2} as a function of time in the form of colored curves. The horizontal axis represents time in years. The vertical axis represents the number of solar wind streams with a specific selection of attributes. The mean and the standard deviation of the results, obtained in different runs, for each hour are shown in the Supplementary Material. The mean value of the proportion of the accelerated / decelerated solar wind streams for each hour, among all the channels, for each of the two configurations, is shown in Figure~\\ref{proportion_accel_dec_hour_yuri_table}. It can be seen that acceleration / deceleration of the solar wind speed is more likely to occur in intervals when the solar wind speed was $>$ 300 ~km~s$^{-1}$, w.v.c. (both from the Sun to {\\emph{PSP}} and in the opposite direction), when $v_r$ was small (for both directions) and when $\\theta_r$ was large for both the Sun to {\\emph{PS"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple flow by itself, with symmetric FLIP flow and hybrid flow with discontinuous VEC pressure. The results are similar for different values of $\\Omega$. Comparing the last two rows with the first one, we see that including the FLIP flow makes the a posteriori error estimate inaccurate earlier due to the higher absolute velocity. This leads to more incorrect time steps in the solution of the system \\ref{system} and, consequently, the fluid problem is solved less often. This happens also for the FLIP flow and symmetric FLIP flow together, although the error is smaller. This demonstrates the importance of using an enhanced Navier-Stokes solver for flows with discontinuous velocity. Also clearly shown is that the a posteriori error estimator correctly estimates the a priori error for the simple flow, but with the FLIP flow the error is underestimated by almost two orders of magnitude. This is a problem, but contrary to Pichavant \\cite{pichavant_as_problems} observed for flows with continuous velocity, he argued that this does not happen and leads to convergence of the method. However, we have previously"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The adaptation of the time steps is done by employing a lower bound for the relative residual or the L-BFGS-Q algorithm \\cite{Q2011}, which is used for the stabilization of the solution. The model of joint biplane flexure is computationally too demanding for a single multirate framework. Therefore, a separate single-rate subproblem for the fluid is used as part of a dual-rate scheme, as explained now. The overall framework is flexible, can be easily scaled, and can handle different problem parameters, as shown in Section~\\ref{sec:exampleSolves}. At each time step, the solution of the single-rate problem (dual rate) is used to drive the next steps of the higher-order solution of the biplane problem (primary rate). In the model of joint biplane flexure, on applying a loading to the model, the solution to the problem of fluid first changes and this is transmitted (via the mechanics) to the solid via step sizes adapted using the procedure described above. Because of this transmission, a higher scalability and higher accuracy of"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "3% on the second floor and 6\\% on the third floor. After closing the stores started to be scanned, the accuracy of the location information on each floor increases as well, from $1.7\\%$ to $2.2\\%$ on the first floor and from $2.6\\%$ to $3.2\\%$ on the second floor. This might be due to the fact that the items on the respective floors are running out of fast moving goods and there are low probability of incorrect scanner reads. However, accuracy remains high on the third floor likely because of the character of the goods sold which are possibly not easy to find after several purchasing. Table \\ref{tab:fig_loc_in_time} shows the evolution of accuracy of the estimation on each floor in terms of true root mean square (tRMS) over the course of one day. It can be observed that even though the accuracy on each floor varies between $8\\%$ and $12\\%$, the average tRMS on each floor is below $10\\%$. This shows that the error rate of recommendations in this store is actually quite high"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": "acq} it is shown that if the CME charge is initially positive, the instrument quality factor should be larger than one if one wants to continue observing the CME in {\\emph{STEREO}} observations past $300~R_\\odot$. Here, we consider a more realistic, physical scenario in which the charge of the CME is varied according to a Gaussian distribution with a mean charge sign and a standard deviation representing the dispersion in charge due to friction during the eruption.  In \\cite{Valgushev:2015acq} the mean charge sign was 0.5. In this work we consider a broader distribution with a mean of 0 and a 1-\\sigma deviation of 0.4. An example of the evolution of the CME QSO with time is shown in \\cite{Valgushev:2021cyg} by YouTube video \\cite{valgushev2021altimetatcmete}. The video shows the evolution of the CME in {\\emph{STEREO}} observations as well as theQT-2 QSO constant"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKosevich,Kosevich1998,Kosevich2010} and the discussion below), the lack of a scale invariant description of it, as well as the lack of a true invariant measure on the space of the parameters (see e.g.~\\cite{Iliopoulos1997,Iliopoulos2001,Kaltenbacher2006}) hinder a systematic study of higher order corrections to the HOC. For instance, it is not unknown cases of HOCs with the cone condition of order $n$ (see e.g.~\\cite{HoriHosonoKoma,BorisovSteinhauer2013,ElvangDai2014}). In addition, observation of HOCs are often enough to give insights about the dynamics of the plasma (see e.g.~\\cite{Sako2011,ElvangDai2014,BorisovSteinhauer2015,BorisovSteinhauer20"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of forces and gradients. For the gradient estimated by the Chavdar rule, its efficiency is unknown. In fact, for the Landweber iteration the Chavdar rule is applied at each iteration. From what we saw in Section \\ref{sec:test}, we are afraid that similar problems may occur for more sophisticated iterative methods if they are combined with the effective heuristic Rule 410. This rule recommend transitions, when the particle is in the boundary close to the exit $c_{\\alpha}$, regardless of the efficient running of the particle along the -theoretically- optimal direction. If the effective running is disordered, then Rule 410 recommends transitions of the particle to make it progress to the boundary $D_{\\alpha}$ instead of $c_{\\alpha}$. It is in this sense that we call the recommended transitions \\emph{unefficient} in Section \\ref{sec:theory}. If one is willing to apply Rule 410, then one has to guarantee that the effective running of the particle is always order optimal. We do not know a iterative method which would not have this problem"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " in each direction independently. The nearest linear approximation has the same parameters as the real function, i.e., $\\lambda_i = 1$ and $\\mu_i = 1$ for all iterations. As the image is captured, it is observed that the linearization converges to a small region. As the iteration increases, the region gets narrower and narrower. An example with $d = 4$ and $c(0) = [-0.13,  0.12,  0.11, -0.11]$ is shown in the top left of Figure \\ref{fig:nl_img_example}, and the projection of the points onto each coordinate direction is shown at the top right of the figure. Even though the primal parameters $\\bm\\Theta$ is not constrained to be symmetric, as can be seen in the bottom left of the figure, the neural network model is GPC-oriented and therefore the minimum area has by far a symmetric shape. In fact almost perfectly symmetric, as shown in the bottom right of Figure \\ref{fig:nl_"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a binary micro-structure, which can take three general forms in different models: (i) \\emph{smooth} structure with $g_j(\\xi)=1$ if $\\xi\\in \\mathcal{B}_j$, $g_j(\\xi)=0$ elsewhere $\\xi\\in \\Omega$, $\\mathcal{B}_j\\subset \\Omega$ is a set of size $r_j>0$; (ii) \\emph{singular} structure with $g_j(\\xi)=\\xi^2\\mathbb{I}_{\\xi<0}$, $\\xi>0}$, $\\mathbb{I}_{\\xi<0}$ being the indicator function; (iii) \\emph{ dual \"`anti-\"} structure with $g_j(\\xi)=\\xi^2\\mathbb{I}_{\\xi>0}$, $\\xi<0}$, $\\mathbb{I}_{\\xi>0}$ being the indicator function. If $g_j$ is smooth, $Q_j$ is defined in the classical sense and if $g_j$ is singular, $Q_j$ is defined"
        }
      ]
    },
    {
      "train_loss": 0.05398828125,
      "update_counter": 1750,
      "epoch": 56.41935483870968,
      "val_loss": 5.953125,
      "val_entropy": 0.750244140625,
      "val_ttr": 0.6018880208333333,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": ", for both configurations, the evolution of the optimal solution of (\\ref{ Pontryagin maximum conditional value functions})  over time  for the first and the second controller, respectively. The solution for control problem (\\ref{trading_decision_problem}) is shown in the \\textit{Namely} format\\footnote{For an introduction to the Namely format and for a comparison with other graphical representations of optimization problems see \\cite{mahutka_comparison_of_representation_2019}.} in Figure~\\ref{comparison_whole_timeline_configuration_1}. As we can see from Figure~\\ref{comparison_whole_timeline_configuration_1}(a), the first controller is always fully satisfied with the allocation choice of the agent and hence the solution of the optimization problem (\\ref{ Pontryagin maximum conditional value functions}) is constant. This, however, does not allow the controller to optimize the value functions. On the other hand, the second controller, as shown in Figure~\\ref{comparison_whole_timeline_configuration_2}(\\ref{Namely} format), gets"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple one-dimensional problem of solving $u' = 0$ on $[0, T]$ using initial condition $u_0 = e^T$ on $u_T = 0.5$. The time step is $\\tau = T/(1000c)$. For the estimated error index we use the output of the P-MPC estimator on these meshes. In this problem the flow is constant on the time mesh intervals, and the estimated time-derivative therefore incorrect. However, it does not affect the estimation of the overall error. The estimator still finds a reasonable approximation of the error. In particular, we note that its convergence rate towards zero is order of magnitude $O(h^2)$ suggesting that the a posteriori estimator based on the P-MPC model is strongly consistent. Furthermore, the solution itself is simple and we observe it to be well suited for the test of the estimator. It is accurate and sensitive to small error even in solution. This translates to accurate estimation by P-MPC. Indeed, the solution has absolute global error $E_A"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The fluid step size is determined by the time scale of the shortest supervision signal, i.e., $t_{\\text{ref},_F}$, while the solid step size is determined by $t_{\\text{ref},_S}$. As we shall see in Section~\\ref{sec:stability}, this choice ensures stability during the simulation. A detailed derivation of the optimal step sizes can be found in Section~\\ref{sec:multirate}. The overall simulation time is split into \\textit{fluid}, \\textit{solid} and \\textit{communication} parts. The time-steping scheme is called Multirate Explicit (MAE) \\cite{Lowder1997}, which is one element of the Family of Multirate Algorithms (FMA). The overall time complexity of a communication, fluid and solid phase has been analyzed in Section~\\ref{sec:timecom}, and amounts to $\\mathcal{O}(nf\\_sol+ nf\\_com)$, where $n$ is the total number of nodes (elements in this case), $f\\_sol$ and $f"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "3\\% on the B floor and around 6\\% on A floor during the last 30 frames (see in Fig.~\\ref{fig:freemata2}). Moreover, the fine details of the numbers of scanning frames needed to reduce the errors for different frequencies are shown in Fig.~\\ref{fig:freemata3}. The number of frames for $50$ kHz is not high enough to reach any reduction of errors. The greatest change in the numbers of the tabs corresponding to the frequencies and their proportions this algorithm achieved is shown in Fig.~\\ref{fig:tabnums1}. This algorithm not only reduces the errors on the same ground but also finds some tabs that are missed by other algorithms. One of the reasons for this is that this algorithm takes into account the updates regarding current found tab postcodes into consideration and thus, local tabs which are found by the TomTom device on the ground are not considered as possible overlaps for other tabs. This means that, not only do the local tabs on different grounds not have any effect on one another but also, the same local tabs on the same"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": "April25} it is shown (see Fig.~\\ref{fig:CME_radius}) that the radius of the CME continuously changes with time. Specifically, at what it is yet known a unique duration of a solar wind interval during which the agent of the eruption was a CME. In this time period the agent's radius continuously decreases, loweing the amount of the ejected mass (by the area bound), thus becoming underproductive.  In this situation the event's signature does not change much from one to the other such events, and the SACCESS rate would work in the same way if the active values of the parameters would be changed by a small amount.  Moreover, such events happen rarely, and sometimes even not at all given the chance threshold.  In contrast, the version with repair (Fig.~\\ref{fig:repair}) shows that such events run into the population, combining different episodes. This combination results in various agent properties. The chance threshold is effective for such a variety of possibilities, and hence the SACCESS rate works badly in this version.  Such a failure is due to"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKosevich,BO,Zee,Erber} and references therein),  the appearance of physical applications has focused attention on the strong impact of the choice of the magnetic field on the evolution of the magnetic field itself (both at leading and next-to-leading order in the expansion in the velocity), which is contained in the value of the magnetic source term $\\mathbf{B}_S$ (see \\eqref{magneticsource}). The source term is determined via the so-called ``magnetic dipole approximation'', which assumes that the evolution of the magnetic field is mainly influenced by interactions between the magnetic field and its own tail, which is triggered by the acceleration in the plasma. The initiation of the acceleration occurs close to the sun where turbulence is prominent, thus the magnetic source term is determined in terms of a specific type of turbulence. In \\eqref{magneticsource} the Mellin transform of 5-minute averaged global magnetic fluctuations $\\delta \\bm{\\mathrm{B}}(t)$ is used, where the 5-minute interval was chosen on the basis of various studies (see e."
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the function main derivatives. For Landweber iteration, it depends on the accuracy of the generated iterations. In the original paper~\\{\\cite{LW},}\\footnote{In fact, the method is general to restore a original shape from any quantity of successive generated iterations.}, it is stated that the Landweber iteration maintains a sequence $\\{I(b_k)\\}_{k=0}^\\infty$ with a fixed gap from the original function $f$ such that $d_L(f,v_k)\\geq 1/k$ for some fixed $k$ and $d_L(f,v_k)=0$ means $f=v_k$. Accordingly, the authors suggest to take as accurate estimate of the main derivatives the minimum distance between the $k$-th generation iteration $b_k$ and the original function $f$ such that $d_L(f,b_k)\\geq 1/k$. In this way, the effective convergence of Landweber iteration is guaranteed for any gap between the original function and its first generation iteration and for any accuracy"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " with respect to a tolerance, which is localized inside the space to contain the nonlinear term. The localized tolerance can be zero. Thus, it is possible to converge the solution of the algorithm to $0$ at any time. \\cite{A86,AS96} is the classic paper for the establishment of this kind of convergence. Let us briefly summarize the theory there. Consider the algorithm of line (\\ref{subproblem}), and let $w_{i+1}$ be any solution of it. We apply a linear rank-one perturbation to $w_{i+1}$, that is, $w'_{i}$ is obtained from $w_{i+1}$ by simultaneously decreasing its value and increasing its value by the same fraction of its value. Then we apply the algorithm to $w'_{i}$. We prove that if the iteration number $i$ is larger than a certain number $iStar$, then $w_{i}$ is within $\\sqrt{d/m}$ fraction of $w_{i+1}$, where $m$ is the rank of $A^{T} A$"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a binary micro-roughness, i.e., $g_j=\\begin{cases} 1,& \\text{if $|K_i-K_j| \\leq h_j$};\\\\ 0,& \\text{otherwise,}\\end{cases}$ where $K_j$ is the $j-\\text{th}$ contact point on the shape, and $h_j$ is the length of the $j-\\text{th$^*$}$ th harmonic component of $g_j$. In <Reference here> we use implicit differentiation to find the local contact forces at the contact point by using the chain rule. That is to say, at $K_j$, the local contact force $F_{K_j}$ is derived as $\\frac{\\partial }{\\partial x_i} \\left(\\frac{1}{2} \\sum_{j=1}^n \\gamma_j h_j \\right) = \\sum_{l=1}^n \\frac{1}{2} \\gamma_j^{l-1} h_j^{l}"
        }
      ]
    },
    {
      "train_loss": 0.068642578125,
      "update_counter": 2000,
      "epoch": 64.48387096774194,
      "val_loss": 5.501953125,
      "val_entropy": 0.790283203125,
      "val_ttr": 0.60546875,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the comparison between the maximum overlap area obtained with: the configuration discussed in Sec.~\\ref{configuration}; and the others, for different values of $N$. In this figures we also plot the area of the best configuration with $N=20$ in red. In the first figure  we can see that, no matter the $N$ value, the second configuration (green and blue curves) obtains, when $N=40$, a peak smaller than the one obtained with the first one (orange curve). This means that the reconstruction error using the $N=40$ subset is bigger with the second configuration. Moreover, the second configuration obtained for $N\\geq40$ shows peaks greater than the one obtained with the third configuration, with $N=20$. As a result, we can say that the third configuration with $N=20$ provides a better reconstruction for higher $N$ values, while it is better than the second one with lower $N$ values. In addition, the first configuration it is better than the second one with any value of $N$. From"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple toy problem of solving $-\\Delta u_i = 10$ in $D_i$ for $i \\in \\mathcal{I}$, where $\\mathcal{I} = \\{1,2,4,7,9\\}$ with each grid containing $n_i = 9$ points. In this problem, we have $N=9$ so that our estimated resolution is precise and we can use the a posteriori error estimator directly. For comparison, we also provide results of a number of a posteriori error estimators for a heterogeneous mesh in~\\cite{CA2010} and a priori error estimators in~\\cite{Hosono_PFl_2009}. Table~\\ref{fluid_residuals_uniform_equal} clearly shows that our estimator outperforms existing methods. In case of~\\cite{CA2010} the estimator is based on a direct solution of a local problem of the same form as the global problem, by using the surrounding nodes on the grid; the estimator is conservative by construction. In our problem there is no interference"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The overall time step $\\tau$ is computed as $\\tau=\\min \\left\\{ \\frac{50 \\frac{ \\Omega_s }{C_s} \\gamma \\gamma -52 \\gamma +6}{M}, \\frac{ \\tau_{fluid} + \\tau_{solid}}{M} \\right. + \\sum_{j=1}^M \\frac{\\omega_j}{2} + \\frac{\\tau_{solid}}{M} \\left. + \\sum_{k=1}^M \\frac{\\omega_k}{\\omega_1 \\omega_2}\\right\\}$, where $M$ is the number of multirate steps involved in the solution of the overall system \\eqref{eq:final}. The method is implemented by using a Low Rank and Sparse (LARS) multirate framework \\cite{Ciaraldi2009LARS}. The time steps $\\tau_{fluid}$ and $\\tau_{solid}$ are obtained using a classical third-order Runge-Kutta scheme which is, however, adapted to the problems at hand."
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "3% in some frames on the $2.4$ GHz band, as shown in the last row of Fig.~\\ref{fig:freq}. Next, the average distance between the estimated locations of the past $1000$ frames on the $2.4$ GHz band is shown in Fig.~\\ref{fig:dist2}. Green squares show the actual distance between the two images. The estimation is optimal, since we ignore the floor. However, a $10-20\\%$ error for the $2.4$ GHz band seems to be always present possibly due to the high similarity in the frequency range. Indeed, from $12.5$ GHz to $13.0$ GHz there is only one band carrier, and from $14.5$ to $14.7$ GHz there are only two band carriers, which are very close in frequency, as shown in Fig.~\\ref{fig:image}. We can also notice that the estimated distance on the $1.9$ GHz band is significantly smaller than on the $2.4$ GHz band"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": "Natur,Valgushev:2015NaturSA} it is shown that the Euler-Lagrange equations for the functional (\\ref{eq:cons}) are not invariant to scale transformations $\\psi(x)$ for the parameters $\\xi$ and $\\phi$. This means that (shallows as it may seem at first glance) the system is not conservative. Moreover, even if the scalar $\\xi$ solves the Euler-Lagrange equations for the modified functional $\\tilde{H}(\\phi,\\xi)=\\left[H(\\phi)+\\xi^2/2\\right]M^2$, the position vector $\\bold{\\mathbf r}(\\xi)$ will not be geodesics for the modified potential $H'(\\phi)+\\xi^2/2M^2$. The reason is that the scale transformation $\\xi\\rightarrow \\xi'\\left( \\xi \\right)$ will transform the position vector $\\bold{\\mathbf r}\\left( \\xi \\right)$ according to $\\bold{\\mathbf r}\\left(\\xi'\\left( \\xi \\right) \\right) =\\tilde{\\bold{\\mathbf r"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKovnerVardi}), the study of WSPCs as a tool for computing the dynamical cone generally starts with the example of quantum dots (QDs) (see e.g.~\\cite{Wspc1, Yang, Li, Xin, Dudarev1, Dudarev2, Fang, Oussifard, Post, Panichev, Kovalevskaia, Kovner, Kaltenbacher14, Kaltenbacher16, Rastegaee, Rastegaee16, Hammid, Si, Shi, Tang, Qin, Guo, Yi, Mao, Jiao, Jiao16}). QDs are a well-known example of embedded carriers in a semiconductor medium (see e.g.~\\cite{Sze, Baulch}). Typically, such carriers are semi-free-standing nanostructures like islands, islands after insertion of electrochemical dots, or junctions of artificial diamond layers (see e.g.~\\cite{Sivaprasad, Suryawanshi, Bansal,"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the target function. The efficiency of the Landweber iteration does not depend on the accuracy of the written function, but its rival the hybrid method does. Therefore, for the Landweber iteration, the figure of local search could be taller than shown in Section \\ref{section:localsearch} since it involves another inaccurate estimate namely the Subspace Projected target. In this case, part of the improvement comes from the decrease in the performance of the Subspace Projected method. Also, from the summary statistics in the\u9644\u4ef6, we can see that the resulting function for the Landweber iteration looks slightly worse that the first hybrid method result. This is expected since the hybrid method uses two iterations while the Landweber uses only one. However, as we discussed in the section for the hybrid method, the Landweber method finds a minimum that is the first local minimum of a series of local minima.  Since that first minimum estimated by the Subspace Projected method; it generates a spurious local minimum around (Z,-func(Z)) that is far from actual minima. While the summary statistics in"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " relative to a uniform norm approximation of the objective function. This implies that one can prove convergence of the sequence even if it is affected by \\emph{removable functions}, the removal of which would not change the linear-linear convergence statement. Besides, the local convergence theory can be easily generalized to the nonlinear case when the family of nonlinear functions $L^{(m)}$ in \\eqref{nonlinearwhole} involves a closed form expression of the perturbation $m$. In the nonlinear case, the convergence rate depends on both the initial power and the perturbation. For fixed initial power and perturbation, the smaller the norm of the perturbation, the faster the local convergence. By this argument, we recommend that the perturbation $m$ should be as small as possible. However, in practice it may not be reasonable to pursue smaller and smaller perturbation since it may make the linear-linear prediction function $L^{(m+1)}$ even less accurate. In this case, we suggest adopting a slightly larger perturbation that is sufficient to guarantee a reasonably accurate prediction function $L^{(m}}$. In our experiments, we choose the perturbation to be equal to the"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a small perturbations in the boundary of a small tumor. In layman's terms, we consider a healthy region $\\Omega$, where the potential function $\\phi_0$ is smooth and a small perturbation $\\phi_1$ is applied only on the boundary $\\partial\\Omega$. We also allow $\\phi_1$ to be modeled by a simple small function $g_j$. Such models are created for several applications including drug delivery \\cite{Chung}, designing optimal implants \\cite{Busse}, etc. For such models, we want to find the largest $J(\\phi_{1})$ values under the constraint $\\phi_{x_j}=0$ on the boundary $\\partial\\Omega$. Typically, the potential $J(\\phi)$ is found by way of the $L$-functional \\cite{CA}, whereby we minimize the $L$-functional with respect to $\\phi$ using standard finite element methods. For tumors, models like \\eqref{eq:model} have been studied en masse in \\cite{Cahuc,Layton,Hernandez,Pius,Bra"
        }
      ]
    },
    {
      "train_loss": 0.049060546875,
      "update_counter": 2250,
      "epoch": 72.54838709677419,
      "val_loss": 6.130859375,
      "val_entropy": 0.70703125,
      "val_ttr": 0.6129557291666667,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": ", for each node, the evolution of provided and queried values over time at the two configurations. We highlight that in Figure~\\ref{comparison_whole_timeline_configuration_1}\u2014where $n$ is equal to 16\u2014the average delay of a node is relatively high and it increases significantly when providing the value ``1\" ($94$\\% of incorrect delays). We also notice that its average distance is very high\u2014$$5.2\"$ with the highest $20\\%$ of distances falling between $6.5-7.5$. This is due to the high variability in the shape of the distribution, which is not very smooth\u2014as shown in Figure~\\ref{fig:mu_image_configuration_1} in the supplementary material\u2014resulting in small queries. As for the mean distance, we observe that $95$$\\%$ of the distances are within $7$. The average distance is a good indicator of the proximity of the nodes within the graph $G$. In contrast, for the results obtained at $n=32$\u2014as shown in Figure~\\ref"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple toy problem of evolving a given initial data by a one-dimensional wave flow. The initial data are created according to~\\eqref{eq:Nfld_initial_data} with $v_0 = -\\left( x^2 + 1 \\right) / 2$ and $u(0) = 1$. The parameter $a$ is chosen such that the second phase $\\Omega_i = \\mathcal{O}(10^{-3})$ for all levels $t_i = 0$, for $i \\in \\set{_0}^4$, and the initial data are given equivalently by $u(0) = 1$ and $a = -4 \\left( x^2 + 1 \\right) / 5$. In this example the solution is clearly symmetric under $i \\to -i$ because the phases evolve under identical conditions. We fixed the resolution of levels by setting the volume of level $i$ to be $\\mathcal{O}( t_i, v_1^2 + v_2^2 + v_"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The fluid time-stepting scheme is used for all flows and the solid time-stepping is used specifically for model $M_1$ (no twisting) and $M_2$ (no twisting but with stabilizing frame). The scheme for solid is divided into subschemes, one for each motion involving contact (picking up from the fact that solid outputs are computed as a sum of integrals over motions), such that the scheme for $bu_i$, $i=1,...2N$, is a second order integration scheme for $bu_\\text{out}(t)=bu_\\text{in}(t+t\u518d\u52a0\u4e0a2N-1\u6bb5\u5fae\u5206\u6bb5\u201d), where $bu_\\text{in}(t)$ is computed as the output from $A$ multiplied by the intray values of the integral equation for $bu_\\text{in}(t)$. Similarly, the scheme for fluid is a second order integration scheme for $fu_\\text{out}(t)=fu_\\text{in}(t+t\u518d\u52a0\u4e0a2N-1\u6bb5\u5fae\u5206\u6bb5\u201d), where $fu_\\text{"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": " $4.7\\%$ on the $3.5$ GHz frame without the scanning, and reduces to $2.7\\%$ after scanning, as shown in the table $9$. Also, the errors on the second floor mainly due to not having the received signal strong enough to be reliably localized, this could be solved by adding several weak reflectors on the ground of that floor. As for the third floor, it is noticeable that $2.4$ GHz floor $2$ has the highest errors for the without-scanning configuration, which is a normal phenomenon as it is the closest frequency to the $50$ Hz frequency component. Therefore, the errors on this floor on the $2.4$ GHz floor are usually high. However, as can be seen in the table $9$, as a result of the scanning, the errors on this floor reduce from $13.1\\%$ to $4.6\\%$. As for the $9.2$ GHz band, the results show that the scanning in the power delay line (PDL) improves the localization errors on all the floors on"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": "April22} it is shown that if the SBO condition is satisfied, then a moving CME charge gradually fills the equatorial Heliosheath with plasma. This process continues while the charge remains attached to the CME. However, in \\cite{2021ApJ...914...61Y} the authors show that once the CME has accelerated beyond 300~km~s$^{-1}$, the flow speed of the charge reaches the regional value in the heliosphere and the charge ceases to expand. As a result, the radial profile of the charge mass gradually decreases, although the charge remains structured. According to \\citet{2021ApJ...914...61Y}, such a scenario is consistent with sub-Alfv\\'enic plasma with a very small amplitude humped magnetic field lines. To resolve such a field line topology, the CME charge must compress the background plasma along the heliocentric distance of the eruption. Although such a scenario is consistent with observations, it is much more complicated than the original SBO theory. The SBO concept originally was presented to"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKippenhahn,PfeiferAlbrecthe,BenmermazSteinhaus,BenmermazThesis,Benmermaz2} and references therein), the achievement of such a constraint may present significant numerical challenges. For the classical dynamical system $\\vec{\\dot{x}}=F(x)$, describing the evolution of the position $x$ in time when acted upon by a deterministic force $F$, the tangential cone condition is an automatic constraint that is naturally imposed on the position $x$ at every time step by means of numerical integration methods. However, the existence of an tangential cone also has significant mathematical implications for the evolution of the system over time, significantly increasing the discoverability of hidden structure in the system (see e.g.~\\cite{KaltenbacherKippenhahn,PapadopoulosJfraen} and references therein). In contrast, for a large number of contemporary dynamical systems, including those described by mechanical systems with hybrid dynamics, the force $F$ is not deterministic, but given by an uncertain parameter $\\lambda$ (i.e.~\\ $"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the target function. The efficiency of the Landweber iteration does not depend on the accuracy of the solved quadratic problem, but on the size of the quadratic problem solved. In particular, by construction, at the first local minimum, the solved problem has  a size of 0 and thus, in certain cases, the Landweber iteration produces a spurious local minimum for the effective energy function associated to the problem (L). Below, we prove that the direction of motion generated by the Landweber iteration requires changing the threshold only in a region where it is known to produce a solution. This is done by shifting the threshold starting from the highest value. More precisely, note that the Landweber iteration requires solving (L) with a modified threshold. However, since the effective performance of the Landweber iteration depend on the number of solved quadratic problems, in certain cases, this modified problem (L) can have a nonzero size. Thus, at the first local minimum there is a solution with a nonzero energy. However, the direction of motion is indeed first found at the minimum. To prove this,"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " based on a finite neighborhood of each point in the direction of the linear coefficient matrix. In this case, one can argue the convergence of the algorithm by randomly putting points in the ambient space and testing them in the linear subspace under the linear coefficient matrix. The algorithm demonstrates local convergence in this sense. More details of the local convergence are included in Section \\ref{The_Proof_NA}. It is worth mentioning that the local convergence in the linear case was shown in various ways in other algorithms \\cite{Steinbach,Tanaka_2009,Lepski_,Ax,Par_rec}, but the rate is not known in most cases. Reference \\cite{C_view} is the first paper to apply the linear convergence in the nonlinear case. However, the numeric value of the rate is not provided. In \\cite{C_view} , the authors argue the algorithm converges in a neighborhood of the target function, but do not provide any guarantee. The linear linear convergence with a correct rate is proved in this paper and I prove the algorithm converges in a neighborhood of the target function."
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a small random perturbation such that $Y$ is a standard normal random variable if $g_j = 0$, and $Y - g_j$ is a standard normal random variable if $g_j > 0$. We consider in particular the case where $N=2$ and the domain is a rounded rounded rectangle $K_\\mu := \\ball{x_0} \\subset \\R^2$, $\\mu >0$ with center $x_0 \\in \\R^2$ and width $\\mu$, where the coordinate system is such that $x_0=(0,0)$ and the top left corner of $K_\\mu$ is $(0,0)$. We assume that the perturbation $g$ is of binary type, that is there exists a universal constant $\\theta>0$ so that $g_{k}(x)=\\theta$ whenever $|x|=k$, $0< g_{k}< \\theta$ in a neighborhood of $k>0$, and $g_{k}=0$ otherwise. Such a model is described in \\cite{DANI"
        }
      ]
    },
    {
      "train_loss": 0.038125,
      "update_counter": 2500,
      "epoch": 80.61290322580645,
      "val_loss": 6.376953125,
      "val_entropy": 0.67138671875,
      "val_ttr": 0.6272786458333334,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": ", for both the $1$-out-of-2 scheme and the $2$-out-of-3 scheme, the correlation between the $P_{T}$ and the $E_{T}$ figures and the predicted missing particle rates based on the Bayesian inference method (\\ref{ Bayesian  )} and the frequentist method from our bootstrapping analysis \\cite{Chakravarty_2018}, for the two configuration choices previously described. We first focus on the $1$-out-of-2 scheme. In the upper figure, the green solid and dashed curves correspond to the predicted missing particle rates using Bayesian inference, and the frequentist method, respectively. The blue vertical bands show the $P_{T}$ and $E_{T}$ combinations of data that are expected to appear in the LHC detectors. Notice that they are the same bands for the $E_{T}$ figure in both the $P_{T}$---$E_{T}$ $(1:\\ 2)$ scheme and the $P_{T}$---$E_{T}$ $(2:\\ 1)$. In"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple toy problem of evolving a given initial data by a one-dimensional wave equation ($p \\in \\HTAS(N_t=50,N_t=20)$) with respect to a second-order discretization of the Laplacian. The computed residuals are at the order of $10^{-2}-10^{-4}$, except for the case of $u_T = 0$ (table row \"expected error\"). This simple problem is used to test the a posteriori error estimator in the same way as in the literature, see e.g.~\\cite[Experiment 2]{RudinEtAl13}. We would like to make the following comparisons: a) to the grid-based a posteriori error estimators in Table~\\ref{uniform-fpas} shows the estimates performed by Grid FPAS for the same problem as well as by Grid FPAS and Lag FPAS for the grid problem implemented in~\\cite{LagliaOtt} (b) in Table~\\ref{uniform-gap} compares the new Gap FPAS for the problem defined by standard discretization of"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The fluid problem is solved using a single line of code featuring advanced output statistics, and provides a `channel' through which it is hoped the solved solution will act as a `seed' for the initialized solid initial value problem. This fluid routing is done using a multirate time-stepping scheme with adapted step sizes and data replication to cope with the separate time scales of the problems. The time scale corresponding to a particle's motion through the solved solution, the solution of the mechanical problems and also the stopping events between fluid injection and grain growth/coalescence events are all adapted using a simple idea: a test for the difference of levels of convergence between the fluid problem and each of the solid problems. Given that we know the solution of the fluid problem, the multirate time-stepping scheme using adapted step sizes is used to solve the solid problems. This test is done every time step, and allocates the smallest step size between adaptations to solve the solid problem with the lowest convergence level. The remaining adapted step sizes are allocated in any form required by the number of parallel solved problems. This approach"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": " $3.2\\%$ $ (see Figure \\ref{fig:ThickWallL12}), when the scanner is placed on the overlap between $30$-degree and $45$-degree floors. Also, the results from Figure \\ref{fig:ThickWallL12} show that using a multi-level path-loss model and extended Kalman filters not only reduce the errors on $30$-degree floors but, also, the impact of the floors near the horizon. The multi-level path-loss model considers $N$ divided suburbs such that each suburb consists of $N_{suburb}$ small buildings. There are $7$ subdivisions and $7$ buildings per subdivision and the path-loss equation considers, directly, and through reflecting walls. Thus, the path-loss model considers all possible paths from a source to a receiver through $N_{r}$ number of reflecting walls. Also, using the KE-based wave-number selection and by using thekalmanTab ($<0.5$) in Eq. (\\ref{eq:WLC}) for the"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": "Apr05} it is shown (see especially Fig.4) that if the CME is static and has a radius that is larger than 38~$^m$, then the shock strength at the cloud edge is the same as the quiet-region shock strength at the same distance from the Sun, not the much weaker shocks associated with a CME. This means that no static, one-sided deflection by the EPs under the considered conditions can lead to a disappearance of the CME. Furthermore, if the trajectory of a CME crosses the magnetic pole, then at some moment of its motion there is no deflection of the S/C along the magnetic field, what means that the consideration of the interaction of the CME with the magnetic pole is also inappropriate. Therefore, in all cases, the CME will be accelerated and integrated into the CM-flux. The flux intrusion also occurs through the magnetic pole, what means that it must also be saying about the occurrence of a cross-Parker spiral behind the CME. Moreover, the cross-Parker spiral decay must lead to acceleration of the CME through the process of energy injection"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKippenhahn,PfeiferHuxley,IshikawaSadayoshi}, this paper), there are other situations where the tangential cone condition happens to be particularly important, where the evolution of the tangent vector is easily implemented and observed (see the examples of this in \\cite{FuchsSibon,RangayaniSibon,Sibon}). This is due to the fact that for the numerical simulation of the system, the evolution of the tangent vector only involves difficulties when features of the curve that have period $2\\pi$ or $p\\pi$ (for compact intervals of the unit interval $[0,1]$ and for rational values of $p<1$ or $p>1$), which means that objects with period $2\\pi$ (e.g. periodic orbits) or $p\\pi$ (objects with chaotic dynamics as the H\u00e9non map with parameters $p=1.0 and a=0.25$ (see e.g. \\cite[Fig. 1]{FuchsSibon}) are"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the target function. The efficiency of the Landweber iteration does not depend on the accuracy of the solved quadratic problem, but on the accuracy of the initial value. In fact, by starting with any initial value equivalent to the solution of problem (\\ref{eq:genq}) with $b=b_{max}$ and $a$ fixed to the value of the Landweber iteration, one obtains the local well-posedness for the following $Z$-generative use rule: if $Q_{k+1} > T_k \\max_{i} (f_i(x_k))$ then solve problem (\\ref{eq:genq}) for $b=b_{max}$ and $a$ fixed to the value of the Landweber iteration and use $b$ as the next iterative value of $Q(f(x))$. This rule in turn gives birth to the spurious local minimum found by the Landweber iteration. Indeed this first minimum is present even in the unaccurate model where $f^\\prime (x)=\\frac{d}{dt} f(x"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " based on a finite neighborhood of each point in the direction of the linear coefficient matrix. We prove that the optimal solution gets into a closed form when the linear large neighborhood is given. The expression of the optimal solution only contains the solution of the linear case and the initial value, and does not involve the nonlinear function. Therefore, we also indicate the linear convergence in Figure \\ref{as1}, and will give the detailed procedure of extraction in Section \\ref{se:nu}. In the linear case, we can also use the standard Cauchy criterion for convergence for the constructed Q-function \\eqref{eq:quatenational}. See Appendix \\ref{ap:as1} for the detailed convergence criterion. Moreover, we also present the optimal solutions of some selected examples in Appendix \\ref{as1_app}. Note that the optimal solutions of the nonlinear case always are better than the linear ones. In addition, some initial values which are not regular to the given function may be preprocessed by our preprocessor (see Section \\ref{sec:preprocessor}) to obtain the optimal solutions for comparison with the N"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a small perturbations in the self-consistent equations. The study of self-consistent equations is usually focused on the stability of the stationary solutions, when the $g_j$'s are assumed to be small. However in reality, outside of the domain $\\Omega$, the background medium interacts with the embedded embedded boundary and affect the solution profile at $\\Omega$, causing spontaneous perturbations even when $g_j$'s are neglected. Such effect is captured here by including a source term in the Navier-Stokes equations $\\sdg = \\sdg_{j}$, describing the incoming perturbations due to the interaction of the domain with the background medium. The source term is formulated using the domain border layer (DBL) theory of McBride and Goldstein, recently extended to the $N$-dimensional case by the present author \\cite{MR2426933}. When the background medium is air, the DBL theory provides rather sharp prediction of the incoming perturbation, rather than merely stability analysis. Earlier studies on stability of embedded embedded boundaries included interaction with the background medium outside the DBL region \\cite{MR"
        }
      ]
    },
    {
      "train_loss": 0.0358515625,
      "update_counter": 2750,
      "epoch": 88.6774193548387,
      "val_loss": 6.5390625,
      "val_entropy": 0.6470947265625,
      "val_ttr": 0.60546875,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": ", for both the $20$ $K_\\odot$ and $10$ $K_\\odot$ cases, the evolution of various quantities corresponding to the two configurations described in Section~\\ref{compareConfigurations} along the lifetime of the systems. These figures aim at providing a general overview of the evolution of the relevant features of the stellar interiors and surfaces, presenting both density, core mass fraction, surface helium and hydrogen concentration and surface luminosity from the beginning to the end of the stars' lives. To do so we showed the time-evolution of the following quantities for both $10$ $K_\\odot$ and $20$ $K_\\odot$ stars following the configurations identified in Section~\\ref{compareConfigurations}: $(i) $density$\\ $(11)$; $(ii) $core mass fraction$\\ $(13)$; $(14) $surface helium concentration$\\ $(17)$; $(15) $surface hydrogen concentration$\\ $(16)$; and $(16) $surface luminosity$\\ $(22)$. In the case of the $20$ $K_\\odot$ stars"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple toy problem of evolving a given initial state of a fluid on $[0:T]$ by a finite-dimensional solver, resulting in models $\\Omega \\in \\mathbb{R}^{T}_w$ defined by the Euler equations. The resulting solution is $\\in \\mathcal{S}_\\Omega$ with $|u_T^2-1|$ small. In this case the finite-dimensional solution is able to predict the same underlying state on future meshes $T \\leq i \\leq T + \\mathcal{T}_{\\text{a priori}}$ with $|u_i^2-1|$ small even for $i = T + \\mathcal{T}_{\\text{a priori}}$. In particular, we observe that the a posteriori error estimator works even if we integrate over a grid which is not specifically chosen with respect to $\\Omega$. We observe in Table~\\ref{fluid_residuals_uniform_neq} that if we allow the solution to run in two directions (i.e., to evolve in time as well as in $\\mathbf{u} \\rightarrow \\mathbf{u} +"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". First, the problems have to be solved in order to obtain desired solution accuracy. Second, the order of accuracy of the problems differ and hence, it is recommended to use a time-stepping scheme for the fluid problem of order $6$ or higher and a multirating scheme for the solid problem such that the low-rate subproblems are of order $5$ or lower. As shown in the section on adaptive schemes, the output of the solid problem is obtained as a ``mixing output'', i.e., it contains information about the motion of the body as well as the body's position. While such features are a consequence of the problem, the multirate time-stepping scheme allows adapting the time step for fluid and solid such that the error in the time-varying position of the body reduces with each time-step. This is shown in Figure \\ref{fig:multirate_output}, for a simulation of a roadside vehicle monitoring scenario. The upper left panel depicts the time evolution of the error in the body's position for one time step. The lower left"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "3\\% on the ground floor. After \\(600\\) scans, the error still remains this high. However, on the second and third floors the errors on the ground floor are reduced on the PRD model from \\(9.3\\%\\) and \\(11.75\\%\\) to \\(2.85\\%\\) and \\(2.37\\%$,\\) on the SAIC model. On the CT model, the errors on all the floors are reduced but on the second and third floor the errors are still high such that the average error is \\(6.9\\%\\) and \\(10.85\\%,\\) respectively, for the SAIC and CT models. However, by \\(600\\) scans on the second and third floors on the PRD model \\(95\\%\\) of the items are located correctly. After \\(600\\) scans, the average path coverage for the CT model is \\(95\\%$ of the paths but only \\(31.3\\%$ of the routes. As the path coverage increases to \\(100\\%,\\) the route coverage varies between"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": "Apr05} it is shown (see especially Fig.4) that if the structure of the CME is preserved and it is observed once by {\\emph{STEREO}}-A, then it has several different sources, and the \\ifmmode\\mathrm{Ly}\\alpha\\else{}Ly$\\alpha$\\fi{} source is the most likely one (see the Definition 2 in \\cite{Valgushev:2015Apr05}). In such a case, the CME is better described as a remote-source CME, which travels totally different along both 1p-CMEs in such a way that {\\emph{STEREO}}-A observes them only after a different trajectory. In \\cite{2019ApJ...864...21C}, a different technique is used to distinguish remote-sourced and one-source CMEs, i.e, examining the evolution of magnetic fields rather than the trajectory. In this case, the magnetic field configurations resembling those of a CME and those of a subsequent 2p-CME are found to be distinct, which also confirms"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKosevich,BO,Zang,Lin_2011,Rinfor,Marin,QIU} and the references therein),  the shape and content of this condition may be drastically different with respect to the space on which it is defined.  In this view, the smoothness assumptions of models 1 and 2 are not sufficient to guarantee a unique coalition selection with respect to a different (or not fixed in advance) choice of space $\\mathcal{X}$. In the following, we will systematically remove the subspace $\\mathcal{X}$ on which the smoothness assumptions 1 and 2 are defined, and show that this space is actually a random phenomenon.  Consider the general model 2 with independently identically exponentially distributed potential benefits $u_j(t) \\sim exp(t)$ and assume that the human subjects are healthy when admitted to the MNAS. If the only developments are treatment, stay in the MNAS and elevate to higher floors, it is guaranteed that the functional range of the indicators $X_j(t) = g(t(), \\theta"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the gamete fitness values. For the coevolutionary game we explore here, the best rules are those that exploit even if this behavior is not beneficial according to standard behavior. This unexpected dependence of iteration performance on \\emph{dynamics} (how rules are transmitted from one generation to another) is emblematic of the nature of social evolution and has attracted much recent interest (see \\cite{Balans2015,Balans2016,Balans2018,Balans2020} and references therein). In our previous work \\cite{DT2011}, we noticed that the Landweber iteration has a spurious local minimum in the fitness landscape of our game. This arises due to regularizing terms in the mathematical formulation of the algorithm (persistency across generations for a fixed proportion of genes) which badly model the social dynamics (which transmit rules but lose some genes in the process). The result is a rule that never inefficiently \\emph{defends} its own territory but nonetheless has a local peak in fitness (fig.~\\ref{fig:"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " based on a finite subset of the basis vector evolving in a local neighborhood. In this way, one can be less sensitive to the noise and the chosen subdomain in the sense of not including the whole space. We have proven that the local convergence theory is still effective when the local neighborhood contains \\emph{important volumes} of the basis with termination of the convergence from the rate $\\sqrt{T}$ at any time $T$ (see Fig. \\ref{fig:local convergence}). This conclusion is better than some existing convergence theories in the linear case or the nonlinear case without the use of noise-resistant basis. For example, the convergence theory in the linear case includes the noise and the termination of the convergence rate is rate $1/T$ (see Fig. \\ref{fig:linear convergence}). In contrast, only using the basis set shown in Table \\ref{tab: polybasis }, the noise resistance property of the P-BFS can achieve the termination of the convergence rate $\\sqrt{T}$ (see Fig. \\ref{fig:linear noise resilience}), which is faster than the rate"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a small random perturbation such that $g_{N-n}$ is the number of points lying in $\\Omega$ close to the boundary. If $P$ is a ball in $\\mathbb{R}^N$, then $g_{N-n}=0$ and $P$ is simply a perturbation free ball of radius $R$. However, in our model we assume that $g_2>0$ only in a subdomain $\\Omega_1$ of $\\Omega$ where the perturbation $P-R$ is far from the boundary.  In the following, we define the basic perturbation-free problem from which our semi-discrete problem in Eq.~\\eqref{eq:nal-problem} is derived, as well as the problem to which our discrete solutions apply. We consider $P$ to be \\emph{far} from the boundary when $P$ and $\\partial\\Omega$ have different normals, i.e. when the orientation of the boundary changes when looking at the interior of $\\Omega$ from outside $\\Omega$. We set $P$ to be \\emph{near} the"
        }
      ]
    },
    {
      "train_loss": 0.071015625,
      "update_counter": 3000,
      "epoch": 96.74193548387096,
      "val_loss": 5.29296875,
      "val_entropy": 0.82666015625,
      "val_ttr": 0.599609375,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": ", for both the $\\ell$ algorithm and the $\\omega$ algorithm, the correlation between the PVI estimated from the configuration of the signal detected at each sensor and the PVI calculated directly from the signal itself. In Figure~\\ref{comparison_whole_timeline_configuration_1} the colored dots show the correlation for each pair of simulations obtained by varying the detection duration $D$ at each sensor (here $D = [0.25, 0.5, 0.75]$h) while the lines correspond to the correlations that would be obtained by simulating all the possible combinations of $D$ and the measurement noise. Notice that only the correlations higher than 0.7 are shown (green circles with indicator above 0.7). In Figure~\\ref{comparison_whole_timeline_configuration_2} the duration of the measurement $D$ at each sensor is shown, but also the direction of the signal ($\\theta$). Here, the detected signal was notices with either no additional information (no noise) or including noise. In the case of $\\ell$, it is observed"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple model of problem~\\ref{eq:problem_1D} with a fixed boundary condition and periodic solution. The left column is for the case that the time step is uniform, while the right column is for hybrid time stepping. As expected, the a posteriori error is highest at the end of the time step as the solution is not smooth at this point. It decreases quickly on the first hybrid time step and then again slightly on the second hybrid time step due to a stabilization of the error. However, in this problem model there is only one critical point and hence no second hump in the a posteriori error. It thus concludes that our estimator is indeed robust and even quite reliable even if we use uniform time meshes. In Table~\\ref{fluid_residuals_uniform_neq} we compare results for the same problem with uniform and unequal time meshes. The time steps are such that the estimator is valid. This leads to a large number of very small iterations and thus quite some computation cost even if we observe that the solution is in a corner quickly. Note that we perform the estimator at each"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". A multirate time-stepping scheme with adapted step sizes for fluid and solid is presented in section \\ref{sec:multirate}. Based on the analysis in section \\ref{sec:problems} and simulation results shown in figures \\ref{fig:experiment_time} and \\ref{fig:experiment_pressure}, a multirate algorithm is designed in section \\ref{sec:algorithm}. The algorithm is implemented as a concurrent program using the shared memory approach. In the approach, an arbitration layer is added between the central processing unit (CPU) and concurrent data structures to read and write data held by multiple cores/threads. Each thread is assigned to solve a problem of one type (fluid or solid) at a certain rate. Based on this approach, every thread solves its problem at a time scale corresponding to one fluid time step. The algorithm is further optimized in section \\ref{sec:optimization} by applying several novel techniques, including a number of scalar temporary variables, probe and target memories with bidirectional copyable streams, novel helper chains for frequently used operations, and extraction into low"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "26\\% on the second floor, after scanning. In $3.6$ GHz, however, there is no such difference between one-frame and scanning acquisitions, as shown in Fig. \\ref{fig:fig4} because the main localization error for that channel is between floors. In $5.8$ GHz, however, significant differences are observed, as shown in Fig. \\ref{fig:fig5}, because the main localization error type in this channel is between symbols. Thus, one should note that although scanning improves localization throughout the building, there is a channel in which it actually increases the average error. This is channel $2.4$ GHz, shown in Fig. \\ref{fig:fig6}, where we plot the correlation between the GPS timestamp and the average number of measurements that are required to localize the signals. We can observe that even though the measurements on the first and third floor are sufficient for reliable localization, the measurements on the second floor is Corridor dependent and therefore, requires scanning. We can also observe that the overall correlation is very high on low and medium"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": "April22} it is shown that if the SEP enhancement is sustained for the duration longer than the transport time (in the event the S/C is fixed) or transit time (if the S/C is mobile) then the optimization procedure would have produced a shorter duration but equally energetic SEP enhancement. This would result in an incorrect control function profile and incorrect statistics of SEP intensities during the event. Moreover,  in several previous studies including \\citet{2020ApJS..246...29G}, the duration has been found to be significantly shorter than the solar cycle duration of 11 years. Since the SEP intensities well follow magnetic field strength both during the CME eruption and during the subsequent SEP events, it is evident that the occurrence rate of solar wind events plays an important role in the decision making process and sets the probability of an S/C receiving an energetic particle event. An empirical formula of this occurrence rate has been found through an extensive statistical analysis \\citep{2017SoPh..292...63H,2019"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKOE,Charkhal:2019oyc,Karakash:2020mrs,Karakash:2021ppv,Rabanal:2021pge}, in greater generality than presented below), there is a more elegant geometric interpretation of this condition, which was introduced by Bia\u0142y and Kaltenbacher \\cite{Blyakhal:2015yto}, and further studied in \\cite{KaltenbacherKOE,Tolley:2017eua}. This point of view leads to new insights and BLAK has become an important tool in quantum topology. A part of the paper will be dedicated to this approach and it will be shown that the constraints of the tangential cone condition can be derived naturally from a minimal area problem. As in the algebraic approach, the first non-trivial singularity of $SU(n)$ type is found and observed to have a special shape. For a given even value $t$ of the parameter $y$, the coordinate dependent group parameter $x="
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the J-function derivatives. For the Landweber iteration the efficiency of the derivative estimates depends on how quickly the rule iterates move between different local minima. For the starting initial hypothesis this efficiency is very high because the rule iterates move quickly between different local minima. However the landing point of rule iterate always has access to all the information available on the hypothesis state, including the matrix and Hungarian problems that define it. This landing point always has a much more accurate hypothesis than the rule iterate left to the first local minimum. This is most clearly observed by replacing the European migration rate with a much faster rate for the initial hypothesis. In this way a spurious first local minimum is generated early in its history. However, each time rule application selects the landing point as its estimate for the best possible state, it also performs a heuristic rejection of the smaller values of $\\ell(\\hat {h}_k) / \\ell({h}_k)$. As a result each subsequent Landweber rule application selects a hypothesis state that is clearly better than the one returned by the naive application of the rule to the modified hypothesis with a reduced migration"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " in each function of a vector-valued sequence, meaning that one can neither consider the convergence of the vector as a whole or a specific function in the vector. The establishment of convergence conditions and convergence speeds are thus strictly local in the nonlinear case, i.e., the norms of the involved matrices are not changed and the coefficients of the polynomials only change in a finite neighborhood of the original one. See the Related Work for more discussions. Based on the established local convergence theories, we also present several proofs of the convergence theories in the Appendix. Considering the nonlinear case is the most challenging part of this paper since the basic framework does not apply. To solve the problem, we first observe that the basic framework can be reformulated by replacing the diagonal matrix in the linear case with the identity matrix in the nonlinear case. This new formulation keeps the linear framework but keeps the linear norms while changing only the polynomial functions from linear to nonlinear. Thus the convergence theories in the linear case can still hold this new formulation. We can then apply the convergence theories in the linear case to prove the convergence theories in the nonlinear case. However, there"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a noisy measurement. The domain may include time-dependent parameters and be split into several environments, each of dimension $n$. We assume that $u$ and $\\Delta u$ are periodic in the environment dimensions and the measurement values $g_j$ are sampled from some noise distribution and corresponding covariance matrix. We model the environment as influenced by $N-1$ independent simple stochastic processes and a single unknown dynamic process, respectively. Similar models have been studied in problems such as \\cite{Adler2009,Ming2012,Zhou2015,Hosono2016,Okita2017,Wu2021}. Under our system configuration, we identify the environment as a multi-dimensional Brownian motion, $B_{j,a}$, $j \\in \\set{1, \\ldots, N_a}$, $a \\in \\set{1, \\ldots, n}$ and the unknown process as fractional Brownian motion, $B_{i,s}$. In each dimension $j \\in \\set{1, \\"
        }
      ]
    },
    {
      "train_loss": 0.04494140625,
      "update_counter": 3250,
      "epoch": 104.80645161290323,
      "val_loss": 6.177734375,
      "val_entropy": 0.673828125,
      "val_ttr": 0.583984375,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": ", for both the $\\ell$-loop configuration and the $\\lambda$-controlled configuration, the evolution of the state for the system initialized in the state $\\frac{1}{n}$ along with the performance of the TRUKE algorithm for two different choices of the waveform parameter $p$. In the $\\ell$-loop configuration (see Figure~\\ref{comparison_whole_timeline_configuration_1}), for $p = 10$ we observe that the algorithm is able to recover the initial state for $\\ell = 15$ and $T = 100$ even for high $n$. However, for $n = 10$ and $p = 10$, the algorithm is not able to recover the initial state for all the loops. When the waveform is set to $p = 100$, the algorithm is able to recover the initial state for all the loops and for both values of $n$. A graph that shows the error against the number of loops for $n = 10$ and $p = 10$ is shown in Figure~\\ref"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple toy problem of solving $-\\nabla \\cdot \\alpha \\nabla u = 0$ on $\\mathbb{T}_i = \\left[-\\frac{i-1}{2}, \\frac{i}{2} \\right]^2$, $i = 1 \\cdots 4$, all squares with area $2^4 = 25$ meeting the assumption that the time mesh is uniform. In this case the solution is linear and we know it exactly. In this case the a posteriori estimators are almost optimal. However, the estimators are not simple quantifications of the $H_0$ error, but additions of residual norms. We explain this as follows. Since we use a generalized eigenvalue problem, our solution can be any function of the linear problem. Thus there are no error terms between the actual solution and the one corresponding to $\\Omega = \\mathbb{T}_0$. However, there is an error between the solution in the new solution space $\\mathcal{T}$ and the solution in $\\mathcal{S}$. This is particularly annoying on the last time mesh because the residual norms"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The overall time step is calculated as follows: $| \\mathrm{overall} \\: \\mathrm{time} step |_T = \\frac{T_\\mathrm{fluid}}{M_{\\mathrm{fluid}}} \\cdot \\frac{T_\\mathrm{elonga}} {M_{\\mathrm{elonga}}} \\in [\\frac{1}{M_{\\mathrm{fluid}}} , T]$, where $M_\\mathrm{fluid}$ and $M_\\mathrm{elonga}$ are the number of fluid and elongation steps respectively and $T = \\frac{T_\\mathrm{max}}{N_\\mathrm{steps}}$ is the time which the simulation is to be finished within. $|_T$ is any suitable normalization of the overall time step such that $ |_T 1 = T_\\mathrm{fluid} $ and $ |_T M_\\mathrm{fluid} = 1$. This is bounded by the time limits that impose elongation step size as $ |_T \\mathrm{elonga} time |_T \\le 1$ and impose fluid time as $ |_T T_\\"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "36\\% on the second floor in $2.4$ GHz when noise is present. This error is reduced to 11\\% when the \\ac{DMA} is enabled in $5.9$ GHz. Similarly, in $1.7$ GHz, the error on the second floor without the \\ac{DMA} is 30\\% and it is reduced to 9\\% when the \\ac{DMA} is enabled. Thus, the \\ac{DMA} significantly improves the localization accuracy on different floors in \\textit{pulsed Infill} configuration. On comparing the performances of different frequencies, the combined use of \\textit{2.4 Infill 5.9 reserve} and \\textit{2.4 Infill 5.9 reserve 1.7 infill} configurations, yields the best results on different floors in \\textit{infill} configuration with and without the \\ac{DMA}. However, these two configurations are not equal in performance so that \\textit{2.4 Infill 5.9 Reserve 1.7 Reserve} which is"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " AcceptedJGR} it is shown that the equilibrium of CMEs with a source has a generality of manifestation in the dynamics of the structure changes rather than in a trajectory permanence condition. Indeed, the position in the solar system changes and the structure changes enough during the passage from the Sun to Mars, so the same configuration with a different position is possible in the different epochs. Moreover, the configuration is likely to correspond to a sphere-shaped structure with a probability rate higher than half above $18 \\rm MK$ \\citep{Shi:2007ApJ,Valev:2013ApJ}, since, with the same initial volume, a larger constraint from the\u78b0\u649e effect (counter-pressurization) against expansion leads to a more stable configuration for a high pressureemic structure. Thus, the much more likely scenario for this event is a spontaneous termination and subsequent collapse conversion, which naturally removes the original pressurized shell  and launches a shock. Moreover, the signature of this scenario is a nice linear correlation between the radial velocity of the switchbacks and the distance from the sun (see"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKosevich,Kosevich,Donatelli,Zelditch}), there is a different condition that turns out to be very important in-situ during the evolution of a cosmological large-scale structure formation simulation: the transverse cone condition. The reason for this is that this condition allows one to establish a one-to-one correspondence between the cosmological large-scale structure formation process and a specific initial value problem for the Einstein equations (see \\cite{Fang:2007dm,Fang:2007gast,Fang:2008dma,Michler:2008zk,Sliwa:2009pq,Sliwa:2010zk,Voronov:2011zz,Voronov:2012gga,Roder:2012stz,Roder:2013nfa,Roder:2013jaf,Roder:2014pdat,Roder2014JASTP"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the R-values. For the Landweber iteration, it is worth noting that, in order to obtain the optimal update rule by breaking the symmetry of the condition number, we have to discard this first spurious minimum. Indeed, in Figure \\ref{fig:local_min_landweber} we compare the function $f_{\\text{Landweber}}$ with $f_x=x^2/2$. Obviously, $f_{\\text{Landweber}}$ has a spurious local minimum at $x_0=0$ while $f_x$ does not (for all $x$). Therefore we conclude that the optimal update rule $\\text{Landweber}$ leads to a better performance than the simple iterative method $\\text{Simple}$ explained in Section~\\ref{sec:operator} which has no symmetry breaking. For the global optimization problems, we want to solve the system of problems which has the maximum objective value. Therefore, we have to jump out of this local minimum which is far from the center of optimization problems. The effective performance of different iterative methods can be accurately estimated"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " in each direction independently. It is well-known that the convergence of the linear case is the global one. For instance, if the linear space $\\mathcal{V}_0$ is a vector space with addition and the linear function $F$, then not only $\\langle\\mathrm{P}\\mathbf{v},\\mathbf{v}\\rangle\\leq \\mathbf{v}^T\\mathbf{v}$ but also $\\langle\\mathrm{P}\\mathbf{v}+\\mathrm{P}\\mathbf{w},\\mathbf{v}+\\mathrm{P}\\mathbf{w}\\rangle\\leq (\\mathrm{P}\\mathbf{v}+\\mathrm{P}\\mathbf{w})^T\\mathrm{P}\\mathbf{v}+\\mathrm{P}\\mathbf{w}^T\\mathrm{P}\\mathbf{w}. $  Since the convergence of each direction independently, we also call this feature as only local convergence. In practice, we may consider a function $F$ which has some small difference from the linear case, i.e. $C\\sim0$ for some small number $C$. The theoretical results in"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a bi-directional driving force (the subscripts ``J`` and ``i`` are applied to the notation $g_j(\\mathbf{x},t)$ to indicate the presence of either a single-directional (destructive competition) or dual-directional (building cooperation) driving force on the survival and reproduction of the model in the corresponding direction). The letter $i$ denotes the direction where the unit $i$ is located, and the letter $j$ denotes the direction where the driving force is present. The letters $J$ and $i$ only apply to the bi-directional driving force component of $g_j$, which models the driving force on the survival and reproduction of the model in the corresponding direction. The mean density function $f_j(\\mathbf{x},t)$ satisfies Assumption \\ref{a1} with a  natural parameter condition like Assumption \\ref{a3} for models of the form \\ref{s1.i} and \\ref{s1.j}. Similarly, the mean function $f_j(\\cdot)$ satisfies Assumption \\ref{a2"
        }
      ]
    },
    {
      "train_loss": 0.035162109375,
      "update_counter": 3500,
      "epoch": 112.87096774193549,
      "val_loss": 6.3828125,
      "val_entropy": 0.650634765625,
      "val_ttr": 0.5940755208333334,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": ", for both the $1$-dimensional scenario and the $2$-dimensional one, the evolution of the 2 most important variables ${\\phi}_{\\rm cav}$ and ${\\phi}_{\\rm dam}$ as a function of time for the configurations $(c,k)=$(2,1), which correspond to the ones originally used in the simulation studied by \\cite{1}, and the two sets of configurations listed in (\\ref{new_c_k_1_1d}) and (\\ref{new_c_k_2_1d}), respectively. As can be seen from Figure~\\ref{comparison_whole_timeline_configuration_1}, although the time-evolution profiles of ${\\phi}_{\\rm cav}$ and ${\\phi}_{\\rm dam}$ for the set of parameters $(c,k)=$(2,1) and for the set of parameters constructed following the procedure described in \\ref{new_c_k_1_1d} are almost identical at the end of the simulation. This suggests that configurations $(c,k)=$(2,1) are not sensitive to the addition of"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple model of frictionless flow on a circular domain with a star-shaped node on the boundary. We initialize the solution $u$ from Table~\\ref{flock_init} and compute time meshes $T_{k+1}$ by LES method \\eqref{elastic_LES} with $\\Omega_{er}=0.5\\Omega_c$, $E_c=10$, $E_p=2.5\\times 10^{-2}$ and $T=0.25$ for $k=0,1,2$. In the 3rd line of the table we highlight the case when we set additional time mesh $T_{1,1} = 0.05$ to get a precise initial time mesh $T_{1,0} = 0.05$. As we can see the estimator is very accurate and covers the error $FE_k \\approx E_k$ for all time meshes. Its precision is almost independent on the refinement level $k$. In addition, we observe a series of high-accuracy point"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The overall time step is calculated as follows: $| \\mathrm{overall} \\: \\mathrm{time} step | := \\left( \\frac{H_2}{H_1 + H_2} \\right) \\cdot \\min \\left( |\\mathrm{time} \\: \\mathrm{step} |_{Flux} , |\\mathrm{time} \\: \\mathrm{step} |_{Extrud} \\right)$, where $H_1$ and $H_2$ are the total numerical Hamiltonian for fluid and solid respectively separated into real parts by basis conventions and their order. Notice that the time step is adapted depending on the amount of flux through the HL problem's boundary conditions and the extruder load as well as the cutting force on the workpiece. Furthermore, as the total time step is divided into equivalent time steps for fluid and solid such that each row of the HL problem has the same time step \\cite{Potjere2019InDOPT}, the time steps for fluid and solid must be adapted in a coordinated manner. Thus, while evaluating the total"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "36\\% on the second floor in $2.4$ GHz to as low as $1.3\\%$ on floors $1-4$ in combined training. Scannings also help in the algorithms to find the correct floors for new items. In the $2.4$ GHz scheme, it reduces the average iteration to $9.0$ from $30$ in static scheme. The number of subwindows on each floor remains the same. The number of channels in a subwindow is shown in Table \\ref{tab:frequency}. In the $90by90$ grid, we keep $9$ grids for ceiling and first floor and $8$ grids for each other floor. We calculate the average delay to reach a floor at the store. The maximum delay is $28$ seconds for second floor and $9$ seconds for other floors. The average delay for $2.4$ GHz scheme is $3.8$ seconds and for $50K$ scheme is $5.3$ seconds. The number of items that are detected in the store is $"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " AcceptedJASTP} it is shown (see there for details) that the equilibrium of the energy current flux cannot exist if the escaping CME energy is continuously injected into the Solar Wind. The reason is that the escaping CME energy causes the compression in the direction of the CME departure and heat up the region behind the eruption in the opposite direction \\citep{Tayfun:2011ApJ,Leslie:2011ApJ}. The latter effect is stronger if the compression scale is similar or smaller than the injection scale (instrument resolution), closer to the Sun. In this case, the heating is distributed evenly along the compression scale and the flux of the solar wind from the interior of the compression is higher than the external flux. Moreover, the internal solar wind speed is slower than the external one \\citep{Tayfun:2011ApJ,Leslie:2011ApJ,Valgushev:2015 AcceptedJASTP}. As a result, the inner boundary of the compression has a higher temperature (high solar wind temperature means high electric field) than the external boundary"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKosevich,Kosevich,Donatelli,Zelditch}), there is a different situation that is more relevant for our discussion, which is illustrated in \\cite{Zelditch}. There it is shown that the triviality condition \\eqref{conditionalJID} is not for every $d$ and $n$ necessarily true even when the external structure of the space is fixed. More precisely, for a sequence of connected topological spaces $\\mathbb{S} = (\\mathbb{S}_0, \\mathbb{S}_1, \\ldots, \\mathbb{S}_n)$ with $\\dim(\\mathbb{S}_0) = d-1$, $\\dim(\\mathbb{S}_i) = d$ for $i \\in \\{1,\\ldots,n\\}$, and $\\dim(\\mathbb{S}_n) = d+1$, where each $\\mathbb{S}_i$ has dimension $d$, there exists an $X \\subseteq \\mathbb{S}_0$ such that $\\mathop{\\mathrm{cd}} (X, \\mathbb{S}_d"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the R-values. For the Landweber iteration, it is worth noting that, in order to obtain the global minimum, it does not matter if the first R-value is spurious; in fact, at each iteration $i$, starting from the initial two-times population, generated by the first iteration of the method, the effective R-value at node $u_i$ at iteration $i$ is the redistricted population $P^{(i)}_{l(u_i)} + P^{(i)}_{l(u_i)}^\\partial$, where $l(u_i)$ is the set of neighbors of $u_i$. This is because, by the construction of $P$, the population $P_{l(u_i)}$ is strictly larger than $P_{l(u_i)\\cup\\{u_i)}$. Moreover, if $l(u_i)$ is disconnected then $P^{(i)}_{l(u_i)^\\partial}$ is strictly larger than $P_{l(u_i)^\\partial}$. Thus, the effective R-value of"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " in each parameter space dimension independently. It is convenient for implementation and analysis. Assume $s$ is the dimension without convergence and the dimensions with convergence are $r$ and $u$, i.e. $s+r+u=3$. Let $r=u$, then there is a collection of reference solutions respecting $(r,u)=1$, e.g. see Figure \\ref{fig:ref_image_nonlinear} for the nonlinear case. We seek the guarantee that the algorithm outputs are closer to the reference solution $x^{(k)}$ in dimension $s>0$. The algorithm outputs in dimension $r$ and $u$ are clearly decreasing and the one in $s$ is increasing. We can define the function with respect to the dimension $s$ to be the difference of the initial data and the reference solution in this dimension, negative if smaller and positive if larger. It can be guaranteed that the direction of change of this function is consistent with the direction of the output dimension $s$ using a proper correction strategy. This is implemented in Algorithm \\ref{alg:correction_"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a bi-directional driving force (the subscripts ``$\\bf{j}$'' refer to the direction of the driving force, and the superscript ``$q$-periodic\" signifies that the function is $Q$-periodic). The boundary value condition for the coefficients $K_{j^{\\bf{x}}}$ and $K_{j{\\bf{t}}}$ is defined in Eq.~\\eqref{eq:defBoundCondition} below. The value of the driving force $g_j$ corresponds to a physical situation where $q$ is the measurement frequency, and the direction of the driving force is determined by the measurement protocol. The parameter $k$ in Eq.~\\eqref{eq:Hamiltonian} denotes the intensity of the driving force $g_j$. The value of each driving force is dependent on the result of data processing operations obtained at the boundary of the domain $\\Omega$. For example, the data processing operation ``1 second average\", obtained from human measurements, is defined as $g_{j^{\\bf{x}}}=k \\sum_{t=1}^{1\u79d2\u949f} u_{j^{\\bf{x"
        }
      ]
    },
    {
      "train_loss": 0.034779296875,
      "update_counter": 3750,
      "epoch": 120.93548387096774,
      "val_loss": 6.546875,
      "val_entropy": 0.64599609375,
      "val_ttr": 0.6077473958333334,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 9) (14, 9) (15, 9) (16, 9) (17, 9) (18, 9) (19, 9) (20, 9) (21, 9) (22, 11) (23, 11) (24, 11) (25, 9) (26, 9) (27, 9) (28, 9) (29, 11) (30, 11) (31, 11) (32, 9) (33, 9) (34, 6) (35, 6)"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": ", for both configurations, the evolution of the optimal solution of (Ref.~\\cite{Coradelli2006}) for the initial values that are at the initial point $q_{initial}$ (see Table~\\ref{tables}) and for the value of the control variable $\\mathrm{control}$ = 1.5 that corresponds to the optimal control of the system as given in Table~\\ref{tables}. In this way, even though the DFS is not implemented, the evolution of the solution of (Ref.~\\cite{Coradelli2006}) is shown, from the initial point where the observations are made, in order to be able to analyze the evolution of the optimization over the whole time path. Notice that in Figure~\\ref{comparison_whole_timeline_configuration_1}, the solution of (Ref.~\\cite{Coradelli2006}) is quickly optimized, but in Figure~\\ref{comparison_whole_timeline_configuration_2} this is not the case, because the PFC is not implemented. In both figures, the solution starts at the initial"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple model of problem~\\eqref{eq:problem_arE}_3 when $\\lambda = 10^4$ and $\\lambda = 5\\cdot 10^4$. In this case the a posteriori estimator is computed using the same fluid domain with the time discretization by \\textit{Pietermaritzburg} and \\textit{Lagrangian} methods. On grids, which are large by relative standards (second row of Table~\\ref{fluid_residuals_uniform_equal}) the estimator performs well. In particular, the error reduction rate provided by the estimator is similar for both methods. This may be explained by the lack of noise in the discrete problem in this case. On grids, which are small by relative standards (first row of Table~\\ref{fluid_residuals_uniform_equal}) the estimator performs poorly. This is to be expected since the a priori estimates are not reliable in this case. In the last two rows of Table~\\ref{fluid_residuals_uniform_equal} results of the a posteriori estimator are shown for the \\"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The overall time step $\\tau$ is divided into $\\tau_F$ and $\\tau_S$ such that $\\tau = \\tau_F+\\tau_S$ is divided into steps of equal time in fluid and solid, i.e. one time step in solid is equal to $4$ time steps in fluid. The time steps $\\tau_F$ and $\\tau_S$ are adapted separately for fluid and solid such that the solution conditions on the boundary of the fluid-solid interface are used to determine them. This is possible because the solutions are solved in separate submodels with different times scales. Furthermore, a finite value $\\delta$ is defined for the scale measurement in solid. We have the estimate that $\\tau_F \\leq \\delta < \\tau_S$ should be satisfied. Indeed, we have $\\tau_F + \\tau_{S+1} < \\delta$ and $\\tau_F + \\tau_{S-2} > \\delta$. In general, higher the scale measurement time, slower the solid time step should be. A higher time scale"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "3\\% on the second floor. After \\(100\\) scans, the average error for all floors is found to be \\(1.85\\%\\) for the contextual information, and \\(1.65\\%\\) for the geometric information, showing that the number of scans has a significant impact on the performance of CFA. The with ``shift\" in Eq.~\\eqref{eq:fc_shift} on the mean of received frequency companders, i.e., the shift factor, is shown in Fig.~\\ref{fig:fc_shift}. Although the impact of this shift factor on the average error between the actual channel entries and the computed TFs is not significant, the accuracy of the computed shifts is critical for the accuracy of CFA. In Fig.~\\ref{fig:fc_shift}, the arrows represent the average power on each subcarrier. It can be seen that the average power is concentrated on the center two subcarriers (the last row of racks) for all frequencies, which means that the average power on different subcarriers differs, but the average power on the same sub"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " AcceptedJASTP} it is shown (see there for details) that the equilibrium of the energy current flux (caused by the equilibrium of the solar wind) cannot be posed with a single source at the Sun as was done by \\citet{2010ApJ...714..521T}. The so-called {\\emph{source-cutsource-cut}} model (where a CME starts as a source at the Sun and then cuts another CME which cuts by panels of {\\emph{PSP}}) is\u534a incorrectly predicted the energetic flux balances in the solar wind and therefore cannot exist in reality. Moreover, as shown in \\cite{Valgushev:2015 AcceptedJASTP}, the energetic flux equilibrium cannot be posed with a single CME, as was done by \\citet{2010ApJ...714..521T} either. Instead, the energetic flux equilibrium implies that all the meteors and CMEs in the solar wind populating packagings have to share the energetic flux from the Sun therefore, every CME has to cut other CM"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKosevich,KosevichSiegl,Mustafoev2012,MustafoevBook,Donat1980,F4,Zhidkov}, as well as in the study of the residue (see \\cite{MustafoevResidue})), there are many other aspects of the tangential cone which remain studied less frequently. A major issue in studying the tangential cone in a non-abstract way is to establish explicit representations for it, as it is done for the cone $\\mathcal{C}_1$ in refs. \\cite{KaltenbacherKosevich} and \\cite{Donat1980}. Here, we represent the tangential cone $\\mathcal{T}_{\\tilde k}$ in term of the $\\mathcal{C}_1$ profiles of the $k= \\tilde k$ copy of the subsystem $\\mathfrak{sl}(2,\\mathbb{R})$. We show that the coefficients of the $F$-series and the $\\overline{F}$-series of the multiplet $\\tilde k$ are"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the R-values, which are obtained by applying suitable interpolation or approximation methods. The exact first R-value $R^{(0}_{k)}$ for an individual piece of data $\\mathbf{x}$ has an accuracy that depends on the sampling procedure and the maximum dimension of $\\mathbf{x}$. By default, in our numerical experiments, we sample the effective data set (or super dataset) by applying a simple uniform sampling on the effective space of rules of the lattice $\\mathcal{L}_0 \\times \\mathcal{L}_1$, and the effective accuracy is that of a normal rule with $b=10$ rules per dimension and a step size $\\tau=0.01$. This means that the first R-values $R^{(0}_{k}$ for the effective data set are obtained by applying the system of equations \\eqref{eq:SA-system} with these parameters, and by default, for our preliminary tests, we notice that the system has just one positive stable limit cycle, which moves in the upper lattice $\\mathcal{L}_0$, and then finds the system of equations"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " in each neighborhood of a given reference solution. In contrast, the linear case is a global one. In Fig.~\\ref{fig:illuminated_surface}, we compare the nearest neighbor sets of the linear and the nonlinear cases with a given reference solution $y_i^{(0)}$ (the solution is public in the GLODIN computations). The sets are all of the same size but the nonlinear case is much larger. This illustrates the local convergence statement in the nonlinear case. \\emph{Only in the generalized case. } In the generalized case, the sequence $\\{asers(y_i^{(0)})_{i=1}^n\\}$ is a semi-algebraic set consisting of all collections of $asers(y_i^{(0)})_{i=1}^n$ that are consistent with the given finite approximations $y_i^{(k)}$ on each grid. It is worth emphasizing that the semi-algebraic set $\\{asers(y_i^{(0)})_{i=1}^n$ has only linear dependencies over semi-infinite grids. In contrast, in the finite dimensional case, the sequences"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a bi-directional forcing term, which can be interpreted as a driving force that generates different directions in the boundary. The $g_j$'s are often chosen to be positive in order to generate a growing phenomenon due to the fact that negative perturbations would lead to stagnation. The boundary conditions are listed as $u=\\nu$ on the boundary $\\partial\\Omega$, where $\\nu$ is some noise constant adapted to a Lebesgue measure on $\\partial\\Omega$. We assume that there exists a unique solution to (\\ref{eq: primal J-MQ}), which is obtained using a standard abstract theory (see \\cite[Theorem 1]{CIRS} or \\cite[Theorem 1]{COR) }, and the solution $u$ is continuous on $\\Omega$ and $\\Omega$ is considered to be sufficiently small so that $u$ is bounded on $\\Omega$. However, the above optimal value is only known up to order $2$ (order $3$ if one adds a two-weekly periodization \\cite{ACA1}), and it is not even known if there is a strategy that"
        }
      ]
    },
    {
      "train_loss": 0.0481728515625,
      "update_counter": 4000,
      "epoch": 129.0,
      "val_loss": 5.5703125,
      "val_entropy": 0.74609375,
      "val_ttr": 0.5895182291666667,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 9) (14, 9) (15, 9) (16, 9) (17, 9) (18, 9) (19, 10) (20, 10) (21, 10) (22, 10) (23, 10) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 9) (30, 9) (31, 9) (32, 10) (33, 10) (34, 10)"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": ", for both the $1$-out of $N$ configuration (Figure~\\ref{comparison_whole_timeline_configuration_1}) and the $1$-in $N$ configuration (Figure~\\ref{comparison_whole_timeline_configuration_2}), the evolution of the accumulated error for the base algorithm   and the algorithms using the proposed strategy in different configurations: 1) the $1$-in $N$ algorithm is run in the case where the prediction thresholds are set based on a single data (the largest or smallest one among all the samples)\\footnote{This is the configuration where the false detection rate is a concern and wants to be avoided.}, 1-out of N and the $1$-in $N$ configurations, respectively; 2) the $1$-in $N$ algorithm is run in the case where the prediction thresholds are equal to $\\xi_{\\rm out}=0.5$, 1-out of N and the $1$-in $N$ configurations, respectively. In both figures, the results are plotted against the number of time windows $M$ considered"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple model of problem~\\ref{eq:problem_fluid} when $\\mu = \\lambda = 1$ and $K=2U$ so that the flow is accelerated and therefore enhanced. As expected, the a posteriori error increases with increasing refinements, but in contrast to similar studies in solid mechanics (see~\\cite{ \\bfBalsano} and \\bfTruskinovski} for three examples of fluid flows and related studies, respectively), it is noticed that the a posteriori error of the time-stepping solver in mode $i=\\text{fluid}$ is negligible. This is also true for a sequence of refinements shown in Table~\\ref{fluid_residuals_uniform_inequal}. In the upper half of the table the speedup is about $10$ and in the lower half of the table it is about $100$. Main reason for the high speedup is enhanced global character of problem~\\ref{eq:problem_fluid} for which a simple expression for the P-F map \\eqref{eq:ff_map} cannot be"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". For fluid problems, we use the multirate splitting method for Stokes problems in \\cite{Discacciati2002} with an intermediate output of the fluid-structure interaction force field. A detailed description of our solution strategy for problems of structural dynamics can be found in \\cite{Discacciati2010SolvingFluidInfiltrationInStructures}. Overall control of the execution of the overall solution procedure between the multirate time-stepping schemes for fluid and solid and the parallel sub-solvers can be achieved using a control loop similar to that described in \\cite{Discacciati2010SolvingFluidInfiltrationInStructures}. Overall solution control can be further refined using a procedure similar to that described in \\cite{Discacciati2009OptimizingMultirateSolvers} and in \\cite{Discacciati2012OptimizingMultirateSolversForControl}. Overall solution control can be further simplified using an output fusion algorithm at the highest multirate between \\cite{Discacciati20"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "36\\% on the 2.4 GHz due to the high level of background interference in the map. The reason for this high error is that although the vehicle was observed to be about the same rough location on both the 2.4 and 70/80 GHz maps, the height estimate on the 2.4 GHz image was higher by about $3$ m. The reference height map obtained from the GPS does not show such large variations. This shows that, even though the vehicle was observed to be on the same rough location, level of resolution of the shorter wavelength imagery does not allow to resolve the details of the surrounding structure and therefore fails to identify the actual \\textit{position} of the vehicle. On the other hand, this problem does not exist on the higher wavelength imagery as it provides a much better resolution to identify the actual structure around the vehicle. Therefore, the total error for the 70/80 GHz images is about $17\\%$, which is much less than the floor to floor transfer error for the corresponding $2.4$ GHz images. When we look"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " AcceptedJGR} it is shown that the process of energy transfer from the CME to the corona is non-linear and therefore a simple relation between the 3D velocity of the CME and that of the corona is not expected. Moreover, analysis of \\citet{2020ApJ...897..134V} showed that the small-scale CMEs have significant role in hauling the bulk CMEs. From a heliocentric distance of $30~R_\\odot$ on 4 Apr. 2018, the CME was measured at 200~km~s$^{-1}$ larger than the 33~km~s$^{-1}$ of the corona, while at $35~R_\\odot$ it was measured at 305~km~s$^{-1}$ larger than the 301~km~s$^{-1}$ of the corona \\citep{2020ApJ...897..134V}. According to \\S3 of \\citet{Valgushev:2015 AcceptedJGR"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKovacevic:2011,Kovacevic:2012,Banfield:2012p901,Hammer:2012p5010}, in the context of classical CR geometry and quantum CR mechanics, in the paper and in the commentary below we will focus on its story in quantum general relativity and we will comment on the related issues in classical general relativity and quantum general relativity on the background geometry \\eqref{PSp}. Note that the quantum Kerr effect on Lorentzian metrics was considered in \\cite{Banfield:2008p120606}.  The scalar product in a canonical coordinate system is non-degenerate but on a general Lorentzian manifold it could be degenerate on a small neighborhood of some spatial axes. To avoid this problem we have  introduced the canonical coordinates in the metric toponology as basic elements of a scalar product on a manifold with a given metric and on a manifold with a given Lorentzian metric, see \\cite{Kovacevic:2007p2"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the gradient. For Landweber iteration, we effectively use the same estimate for both the gradient of the objective function and that of the stability criterion. As we shall see in a demonstration in this section, this estimate can spontaneously yield a spurious local minimum in the sense that generating a continuation procedure starting from this point leads to a path in direction of increasing the objective value. This does not affect our main results in the paper since we always continue the continuation procedure by starting from optimal continuation points. The reason why the results in the paper still hold is that the optimal continuation points are significantly far from the spurious local minimum. The spurious local minimum is also far from the optimal continuation points in terms of the energy. In fact, it follows from Lemma~\\ref{lev_on_ll_min} that the value of the objective function from applying 10 continuations starting from the optimal continuation point $\\bar{x}_{k,i}$ is the same for all continuation points $\\bar{x}_{k,i}$ that are produced by stepsizes $\\gamma=10^{-4}$ or $\\gamma=1"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " in each component of the parameters without considering the overall convergence status of the parameters. In the linear case, the convergence of the parameters can be guaranteed by projecting the parameters-limiting vector to the parameter space given in the initialization, which is indeed a local operation. In the nonlinear case, we can still use the \\emph{controller} to verify the overall convergence of the parameters in practice. See the examples in the demonstration of neural network acceleration in Section \\ref{sec:demo}. Our controller also can be seen as a verification tool for the convergence theory in the nonlinear case. In this case, we can have a convergence mechanism that is independent of the initialization and the learning process and is guaranteed by the controller. In this way, we can always control the parameters to be in a certain status for the correspondingaccelerator to decide its further behavior. In practice, this additional convergence is guaranteed by additional learning steps. In this way, we always have control over the accelerated kernel and the accelerator can decide when the accelerated kernel is bounded in terms of parameters in order to avoid configuration overhead caused by excessive parameters learning. An"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a small perturbations in the boundary of the domain, which can be seen in Figure \\ref{fig:perturbance}. Note that in the case of smooth boundaries, $g_j = 0$, and the problem (\\ref{eq:Hessenarf}) is still modeled by parameters $\\mu_j, \\lambda_j$. Therefore, in this paper, we also consider the more general case where the parameters $\\mu_j$, $\\lambda_j$ are no longer zero but small. The model of the perturbation $g_j$'s has the purpose to avoid solutions with nonzero values at the boundary. In Figure \\ref{fig:perturbance}, we can see two cases: the case with perturbed boundaries, the solution $u_1$ is injected into the smooth boundary $\\partial\\Omega^{(1)}$ by the right-hand side $\\mu_1$ and has two distinct different signs at the perturbed boundary $\\partial\\Omega^{(2)}$, and the case with no perturbed boundaries, the solution $u_2$ is injected into the smooth boundary $\\partial\\Omega^{(3)}$"
        }
      ]
    },
    {
      "train_loss": 0.0493466796875,
      "update_counter": 4250,
      "epoch": 137.06451612903226,
      "val_loss": 6.23828125,
      "val_entropy": 0.6728515625,
      "val_ttr": 0.6061197916666667,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the evolution of the optimal shapes for configurations ~similar~ to \\ifmmode\\mathrm{Ly}\\alpha\\else{}Ly$\\alpha$\\fi{} QSOs and \\ifmmode\\mathrm{Ly}\\alpha\\else{}Ly$\\alpha$\\fi{} QSO-galaxy mixes, respectively. The red lines correspond to line-dominated galaxies, while the blue lines to X-ray-dominant galaxies. Both the optimal shapes are concave (triangles). This shape is consistent with the observations: the distributions of both the broad-line volumes and x-ray emission volumes should be U-shaped. This is more evident for the first row (configuration~\\ref{comparison_whole_timeline_configuration_1}), because the optimal shapes for line-dominant galaxies (red lines) are Rec(2,1) shapes with height $a=4.5$ and R.L. of $\\nu=0.6$ (black lines). According to Figure~\\ref{example_shapes_and_partial_pressures}, this corresponds to a shape similar to an Esteban et al. (2011)"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple toy example of Problem~\\ref{problem-secular} with $N_t=9$ uniform time meshes. The ground truth according to Table~\\ref{free-form-sol} is $\\sup_{0 \\leq t-t_k \\leq \\text{Td}} \\lVert U(\\cdot; \\cdot_k) - \\varphi_\\tau \\rVert_{L^2 (0, \\text{Td}; \\mathbb{R}^3)}^2 = 0$. Here, $\\phi_\\tau$ is given by~\\eqref{fluid-func} with $\\psi$ given in Table~\\ref{psi-table-uniform} and $\\tau = 1$. Due to the simple choice of $F_i$ in this example, we include $L^2$-residuals also for the deterministic estimator. We observe that the a posteriori error for the stochastic estimator converges to the true a posteriori error by the solution of the discrete problem~\\eqref{Nke-trans} obtained by grid steps of $0.25$ has $"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The fluid problem is relatively stable and therefore assigned a low rate. A simple rarefaction is used here to model a window in the solid problem, as illustrated in Figure \\ref{fig:adapt_step_hybrid}. The object of this experiment is to demonstrate that, even in such a simple scenario, the proposed algorithm is able to suggest step sizes that are sufficiently large to move the solver out of the local minimum of the energy function. Indeed, as shown in Table \\ref{tab:ads_hybrid}, the suggested step sizes are all several times larger than those that would be chosen using the numerical test from section \\ref{sec:ref_test}. This table also shows the minimum and maximum relative residuals during the solver search, which demonstrate that the suggested step sizes also help with moving the solver out of a local energy minimum. In addition, the suggested step sizes have $\\ell$ and $s.d.$ levels that are, respectively, 1E-04 and 5E-04 for both problems, where $\\ell$ is the relaxation factor from the fluid problem defined in"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": " $1.7\\%$ on the $32$th floor (see Fig. \\ref{fig:freqshift}, floor $32$), about twice of the high floor error on the $16$th floor in the uncompressed signal (see $2.4$ GHz row, floor $16$). This might be due to the in-service timing-biased scattering which is more prominent in the $2.4$ GHz band since there is no interference since, in contrast to the $5$ GHz band, there are no building-to-building interference in the $2.4$ GHz band. Furthermore, the noise from the receiver's coverage in the $2.4$ GHz band is lower than that in the $5$ GHz band as the receiver is a mobile receiver that is frequently switched off. Thus, the noise biased scattering shifts the frequency spectrum as shown in $2.4$ GHz row. However, the small changes in the frequency spectrum as observed in Fig. \\ref{fig:freqshift} does not justify the deep changes in the floor dependence in"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " AcceptedJASTP} it is shown (see there for details) that the equilibrium of the energy density function impasses the notion of continuous coordinate  functions and therefore the notion of a periodic CME as an evolving cycle.  The  angular coordinates  $\\theta$ and $\\phi$ are continuous and therefore cannot  form a periodic cycle, which would be composed of time-changing fragments with the same positions and radii in each epoch.  However, the positions and radii of the CME fragments give a natural definition of a \"period\" and  \"orbit\" in the solar system.  The above analysis also indicates that the positions and radii should be defined by equations of motion, and thus the evolution of a \"period\" and \"orbit\" changes with time. Both actually occur due to the presence of dynamics. Since the rate at which the fragments expand differs among the phases of evolution, the observed expansion rate at any time depends on the history of the S/C and those of the other fragments.  Thus, the S/C can have a preferred initial projection location (the Sun), have preferred portions of the"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKotozskimorphity,HeinrichKotozski,HerrmannSchoen,ShinudaHerrmann} and also Section~\\ref{sec:fourierAnalysis}),  analysis and construction techniques have been developed mainly for the Wess-Zumino-Witten (WZW) model~\\cite{ZuminoEsssay,Fang_Zumino_Witten,KatoReview,Cox_WZW,Buchel_Moduli,Hori_Fang_Witten,Buchel_Cohomological_Decompositions,Hori_Slope_integrals}.  The WZW model has an important mathematical structure as the gauge system is derived from a \\textit{conformal nanoSUSY field theory}, i.e., a supersymmetric nanoformal theory with a linear Calabi-Yau target space~\\cite{ZuminoEsssay,Fang_Zumino_Witten}.  The connection with SCFTs have also been established by other methods~\\cite{Eremenko_WZW_Scully,S"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the target functions. To demonstrate this effectiveness we have presented Landweber iteration, one of the best iteration for second order methods, with the heuristic rule \\textbf{LWE} using the learning result for second order methods. However, we notice that the learned function $\\widetilde{h}_i({y})$ from the effective child node ${y}$ by using the method of section \\ref{section: learning effective gain} is significantly different from the target function $h_i({y})$ at the node ${y}$. This becomes mainly due to the noise in both input space and output space. However, we easily correct this problem by learning a new function $\\widetilde{h}_i({y})$ at the effective node ${y}$ using a third order polynomial, such that $\\widetilde{h}_i({y})=h_i({y})+p_3({y})$ with $p_3({y})$ being a third order polynomial on ${B}$. Using this learning results, we can easily replace $\\widetilde{h}_i({y)}$ by $h_i({y})+p_3"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " in each direction independently. The nearest linear approximation has U-shaped stability in the linear case. However, in the nonlinear case, the nearest nonlinear approximation has L-shaped stability. This means that, even if the nonlinear terms are turned on, the resulting convergence nature is similar to the linear case. Also, in the nonlinear case, if any one of the three directions (any one of $\\{RF, WC, $IS\\}$, $\\{WC, RF, IS\\}$ and $\\{IS, RF, WC\\}$) is eliminated, the convergence nature of the remaining directions \\textit{remains unchanged}. That is indicated in Figure \\ref{fig:reg_img_nonlinear}(c,g,k) where the red, blue and green lines correspond to the cases $p=10$, $p=50$ and $p=100$ respectively. Moreover, we can also prove that when any one of the three directions (any one of $\\{RF, WC, $IS\\}$, $\\{WC, RF, IS\\}$ and $\\{IS, RF, WC\\}$) is eliminated, the corresponding optimal values of"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a noisy evaluation of some numerical scheme approximates the true solution $u$ to some elliptic problem $\\triangle u = f$ in the domain $\\Omega$. Often, $N=2$ will denote shallow water equations with non-dimensionalizing numbers $a, b$ that specify the dry-sand and fluid-fluid phases transitions, $u = u_1$ denotes the height of the water level, and $f = f_1$ denotes the scalar pressure drop due to gravity and other external forces. The term $\\sum_{i,j} H_{j,i} \\epsilon_j \\epsilon_i = H \\boldsymbol{\\epsilon}^\\top \\boldsymbol{\\epsilon}$ models a simple small-amplitude linear stability analysis over an affordable numerical grid. Here, $\\boldsymbol{\\epsilon} = (\\epsilon_1,\\ldots,\\epsilon_N)$ are the small imaginary numbers that appear due to numerical discretization, that is $ \\epsilon_j \\sim \\epsilon_0 j/N$, where $\\epsilon_0$ is the small number. The most commonly used model with non-dimensional numbers $a"
        }
      ]
    },
    {
      "train_loss": 0.0339052734375,
      "update_counter": 4500,
      "epoch": 145.1290322580645,
      "val_loss": 6.509765625,
      "val_entropy": 0.647216796875,
      "val_ttr": 0.6279296875,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": ", for both configurations, the evolution of the optimal shapes with respect to time in the case of finite response function weight $w_{\\text{reac}}=0.20$ (left) and time limited to 50 seconds in the case of a constant response function weight $w_{\\text{reac}}=1.00$ (right). As we can see, shapes obtained for $1000$ reaction partners are almost identical for both configurations of the response function until around $t=30$ seconds, with $\\mathbf{c}(\\tau)$ almost constant. This difference in the initial conditions is due to the fact that the chosen partners are permanent, i.e., once selected, they are not recomputed by the controller. After $t=30$ seconds, the configurations of the response function lead to distinct shapes. This difference in the shape is only observable for $10$ seconds, because after that time, more reaction partners have been recruited into the cluster. As a result, their average reaction rate, represented by $\\mathbf{r}$, changes, resulting in a"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple toy example of the conservation law $\\dot{u} = 0$ on $[0,1]$ with $u \\mapsto u^3$ on the boundary. In this example we use equal time meshes. This example is far far from the quasi-uniform meshes considered in all other tables. This example is chosen to demonstrate several features of the a posteriori error estimator. First, in view of the generative model and the basis solution solution, the posteriori estimator should be universal. Its minimum should only depend on $V$ as well as its time step $\\tau$ and the norms of the time-evolution operators $I - C (\\cdot \\tau)$ at each time step. Indeed, the numerical data generating process generates time meshes $\\{ T_j \\}_{j=1}^n \\subset [0,1]$, where $T_j = \\frac{j-\\frac{1}{2}}{\\speed}$ for a speed $\\speed \\in (0,1)$. There are $n = \\left\\lceil \\frac{T_{\\rm final} + \\tau}{ \\tau}"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The fluid problem is time sensitive because of the point scattering procedure which is performed by the solid algorithm. Therefore, time steps for the solid are adapted through a delay term $\\lambda_T$ which is the solution of the optimization problem~\\eqref{eq:act_congr}. In this application, it is obvious that effort spent in predicting the motion of single atoms is irrelevant. The algorithm is therefore designed to fix this problem by not computing the subproblems involving the fluid $\\lambda_S = 1$. In case the CPU power is insufficient to solve for all the points which are supposed to be moved due to fluid inflow, the solution can be restricted to a subset of the points which can be considered to be mobile and which are most important from the optimization point of view. This can be done by modifying the output of step 1.2 of algorithm \\ref{algorithm:reg-fluid-solid} which predicts motion of single atoms into a matrix $\\mathbf{R}_{ij}(t_{k+1})$ which ranks solid nodes $i$ in their order of importance for the prediction of fluid inflow"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": " $1.7\\%$ \\cite{Tab.Proposed}. Table \\ref{tab:2} shows the performance of the algorithm when including the scanning frames in the map, i.e., the PRNU subwindow is filled automatically due to the randomization of the frames. In this case, the GPS location is extracted from the power signal and the error category of the received samples is classified via a deep neural network (DNN) as proposed in \\cite{DeeA17a,DeeA17b,DeeA17c}. There we also proposed how DNN and the GPS location can be used in order to clean the errors in the power signal maps. The number of samples per map is reduced from $100$ to $25$, and, we obtained tabulated values as shown in Table \\ref{tab:1}. Here, as well, the values of the distance between two GPS locations (of two different frames) are used to check the consistency of the PRNU model for the six frequency bands, and the variation of the Bayes error is shown"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " Accepted}, the authors argue that the bound on the total energy of a binary system cannot be fulfilled for masses that fall below the mass of the Sun. This argument leads to the conclusion that the initial mass fraction of the Sun in the progenitor mass is $\\geq 1$ and the initial mass fraction of the progenitor in the Sun is $\\leq 0.002$. The conclusion here is that at least one of the bodies must rotate counter-clockwise in orbit. This, however, already kills the equilibrium of the system, since the observed alignment of the orbits of the components in the CME is not compatible with a rotation of the orbit of the Sun in a counter-clockwise direction. In the case of the total energy bound, the authors therefore propose in \\cite{Valgushev:2015 Accepted} an amendment to the conservation law for the total energy of the system, which allows for a total energy lower bound. However, this amendment is justified on physical grounds, namely that about half of the system's total energy must be supplied by friction. But since the time of the first perihel"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKotozashvili:2012,KotozashviliBimini:2013,BanfieldMhlongo:2014,FigueroaGimenezKotozashvili:2015} and in Figure~\\ref{fig: tangential cone condition}),  there are other more flexible solutions of the Maxwell equation which allow to obtain non-unique solutions. The most prominent example is a so-called  electric or magnetic  dipole in scalar electromagnetism, which is a pole for a charge or a magnetic source. The dipole solution can be obtained by a reflective solution in the boundary, which is a solution reflection across a wall. The reflective solution has applications for the solution of a a scattering problem (see e.g.~\\cite{Bromberg1996,KaltenbacherKotozashvili:2012,Scharf:2011ar,Scharf:2012a,Scharf:2012pp,F"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the gradient. For Landweber iteration, we recall that they use an rough estimate $\\lambda_{k+1} = \\lambda_\\mathrm{max} \\cdot \\tau^{k+1}$ where $\\tau$ is the termination threshold and $\\lambda_\\mathrm{max}$ is the maximum eigenvalue of the function's gradient. This estimate implies that the initial approximation for the next iteration is indeed $\\lambda_{k+1} \\cdot \\tau^{k+1}$. In the running of the simulation, we suppressed such an informal estimation and set $\\lambda_{k+1} \\equiv \\lambda_\\mathrm{max} \\cdot \\tau^{k+1}$. This does not change the convergence of the algorithm (we still have the same number of iterations to reach the maximum), but it could allow the simulation to compare different values of $\\lambda_\\mathrm{max}$ directly. We also observed that the Landweber iteration a sudden change in the value of the function at the first iteration. This corresponds to the removal of an spurious local minimum corresponding to the initial rough initial approximation of the function"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " in each place with respect to its neighborhood structure, and one can assume the existence of connection between each two neighboring element in the collection of solutions. To supplement the theoretical linear analysis, we can prove only linear local convergence in the mixed case. The local convergence arguments this time are also given in terms of the main or derivative terminal conditions. This is in contrast with the \\emph{global linear convergence} argument given in the linear case, which is given in terms of the original terminal conditions. This local convergence argument is also constructed by using the \\emph{perturbation method}, and we still get the local convergence of global this time. Similar to the linear case, the robustness of the collected solutions in this local convergence procedure is also discussed. In conclusion, we can conclude that there is no new nor extra convergence or robustness issues in the nonlinear case, and the numerical experimentation results in Section~\\ref{sec:nelementexperiment} can explain the performance of the PFLOR in the nonlinear case as well as in the linear case. Note that the robustness issue presented in this local convergence analysis is similar to that in the"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a noisy evaluation of some numerical scheme approximates the true solution $u$ to some elliptic problem $\\triangle u = F$ in $\\Omega$. Often, $F=0$ indicates that the non-living parts of the system are in equilibrium. The term $g$ models external drive from external agents such as social networks, or external time-varying environments. Here we assume that they are simple multiplicative drives $g_j = \\ell \\cdot \\beta$, where $\\beta$ is the drive intensity, and $\\ell\\colon \\Omega \\times \\mathcal{T} \\to \\R$ is a function of time and users such that the number of agents who are affected by the drive is dynamic. This model generalizes the popular (imperfect) competition model where the drive is linear and proportional to the output. It also permits systems with constant drive intensities such as linear discounting. The number of users that is driven by a given intensity $\\beta$ is given by $\\beta positive \\overset{\\mathsf{def}} = \\sup_{s \\in \\mathcal{T}} \\beta(s)$ with"
        }
      ]
    },
    {
      "train_loss": 0.0330048828125,
      "update_counter": 4750,
      "epoch": 153.19354838709677,
      "val_loss": 6.73046875,
      "val_entropy": 0.6259765625,
      "val_ttr": 0.5888671875,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": ", for both configurations, the evolution of the optimal shapes with the increasing of the relaxation time. $\\gamma=10^{-4}$ in Configuration~\\ref{comparison_whole_timeline_configuration_1} and $\\gamma=10^{-5}$ in Configuration~\\ref{comparison_whole_timeline_configuration_2}, in a graph that shows the time of the controller. We can see that in both cases the shape of the optimizer does not change much during the relaxation. This d(iffers from the behavior analyzed in Figure~\\ref{fig: fig00001} where we compare the shape obtained by the optimizer when $\\gamma=10^{-4}$ with the evolution of the shape for $\\gamma=0$ plus the evolution of the shape for $\\gamma=10^{-5}$, with $\\gamma$ allowed an range of action to modify the shape of the optimizer. In the first case the modification is small and the optimizer found with $\\gamma=10^{-4}$ is optimal for a long time. In both cases the rapid changes in the position and shape of the optimal"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple toy example of Problem~\\ref{problem-secular} with $N_t=8$ uniform time meshes. The grids are generated on the interval $[0,1]$ such that $t_k=k-2$ for the $k$-th mesh. Since the solutions $u$ and $\\bar{u}$, although modeled on the same space, differ even at the final time $t_{8}$, the a posteriori estimators are not perfectly accurate. However, we observe that the estimators speculated about the \"true solution\" in the correct direction. For $k\\geq4.0$ such estimates are sufficiently close to the true value for the discussed estimators to work well. We also observe that the estimators are decreasing with time. In particular, between $t_{4}$ and $t_{5}$ the estimators are closer to the true value than at $t_{4}$. However, for $k\\in\\{1,2,3\\}$ the estimators are almost constant such that they would provide incorrect estimators of the a priori error for these time meshes. This is due to"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The fluid problem is time sensitive because of the frequency dependent term in $\\bm{H}_s$ even if the other problems are solved using time-independent energies. It might be obvious to use the same step size for fluid and air with some assumption of a similar time scale, perhaps the same. However, air is a complex system with fast motions in the gas phase as well as in the liquid phase, indeed in the solid phase, which are non-negligible on the simulation timestep. If the solution behavior cannot be observed easily due to a large number of variables, a simple method to establish a suitable step size is presented. By testing the solution in direction of the fast variables, suitable step sizes can be obtained for each problem separately, which can then be adapted further to specific problems. The time-dependent solution of the multirate system is presented in the following. As shown in the prototype code presented in Section~\\ref{sec:implementation}, the multirate framework can be easily extended to additional levels of resolution. If, for example, a further resolution of 1 $EDT$"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": " $1.7\\%$ \\cite{Tab.{}\\ref{tab:image2}, tab:images7}. The number of error floors is $3$ for the correlation and the method of $2.4$ GHz+bandwidth training patch size and $4$ for $2.4$ GHz+bandwidth training patch size and MATLAB training patch size. In all the cases, after training the neural network, the algorithm is able to work in real-time so that it can handle real scenes as shown in Fig.{\\ref{fig:real}} A, B. It is worth emphasizing that in all for the first three floors, the algorithm is able to achieve low error rates, although the training patch is small. However, for the fourth floor, which is the frequency floor, it is better to train the algorithm using frequencies in the entire world, so that we can achieve zero floors. However, delivering a large number of frequencies to a real scene is difficult. Also, a real receiver has limitations on frequency drift fixation. Thus, it is logical that the frequency drift should be fixed by a reference frame"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": "April22} it is shown (for the CME properties based on the observations from LASCO-C1 and COR2) that the structure of a CME changes greatly during the first two Encs. Thus, instead of a smooth, spherical structure with a velocity dispersion along the radial direction and a decrease of the velocity in the transverse directions, the observed CMEs have a cylindrical structure dually embedded in the solar wind \\citep{Strous:2000Phenomena,Kawai:2007ApJ,Valev:2011Aug25}. In this case the solar wind velocity parallel to the radial (axial) and transverse (radial) to the CME direction remains constant, which is contrary to the classical velocity distribution along the radial axis. Moreover, the hydrodynamic model of the planetary-scale CMEs successfully matched the radial decline of the magnetic field toward higher radii \\citep{Shioshima:2014April15}. Moreover, the modeling of \\citet{Kawai:2007ApJ} showed that the cylindrical-like structure"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKotozashvili:2013tgr}, \\cite{BolitskiGorin:2011 AWA}, \\cite{Voronov:2013dma,Voronov:2014dma,Voronov:2015nfa,Voronov:2016nfaa}) we have noticed that the condition of a ``weakly trivial cosmic brane'' \\eqref{weaktrivbranecondition} is essential for a number of different problems we have studied in recent years (see \\cite{Malkin:2013tma,Malkin:2013gma,Kuznetsov:2014GMA,Kuznetsov:2015GMAa,Kuznetsov:2015gma,Kuznetsov:2016KP). For example, this condition is essential for a successful introduction of a critical density point in two dimensional quantum field theory with a global symmetry \\cite{"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the costs. For the labellings produced by the initial Landweber iteration, we also has efficient approximations of the costs (i.e. the APD). In fact, for the initial labellings generated by the starting algorithm, these estimates are randomly distributed in $[1/k, 1]$. This way, in the first Landweber iteration, the algorithm incorrectly produces a first local minimum of the utility which is in fact a spurious non-optimal solution. This phenomenon happens when the starting labellings contain two neighbouring cycles of length two. However, this initial Landweber solution can easily be improved to become an optimal solution by performing a single iteration of the \\emph{Relabeling step} of the National Shearer algorithm in Figure~\\ref{fig:algorithm}. This relabeling step is able to detect the incorrect estimates of the costs produced by the first Landweber iteration and to output a new optimal solution in terms of utility only after verifying that the estimated costs are correctly produced. The time cost of this additional iteration in the National Shearer algorithm is guaranteed to be $"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " in each place, and the derivatives are compared to those in the linear case. The direction of the convergence can be determined by the direction in the linear case. See Section \\ref{TheNewSkills} for the detailed explanations. Besides, we can obtain a much more robust result than either of the linear or hybrid case. Firstly, on the condition side, the convergence condition in the linear case is three conditions for each direction, which means one condition for each place. The conditions in each place are collections of derivative conditions and linear-linear correlation conditions. In the linear case, one can not guarantee a non-zero derivative at any direction at any place. Secondly, on the parameter-update side, the number of parameter updates in the nonlinear case is $32{N}({N}+\\sqrt{N}{\\tilde{r}}){R}({\\tilde{r}}), where R \\tilde{r}$ is the number of residual blocks, $N}$ the problem size, and $\\tilde{r}$ the ratio of $i$-th residual block to $i'$-th residual block. This amount of parameter updates"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a noise term, is called a linear noise-affected boundary problem. The concept of a $C(\\Omega)$-module means a module $M$ equipped with a continuous map $\\tau \\in C(\\Omega) \\times M \\rightarrow M$. The letter $C$ denotes continuity of the module operation $\\tau$, and it is in harmony with the continuity of the module structure and the algebra structure. The module operation and the module algebra operation are governed by a boundary problem.  In 1970, a general theory of $C(\\Omega)$-modules was introduced in \\cite{Jacobs:1970ui} for continuous modules that are equipped with an algebra structure ${\\rm End}(M)$ that is independent of the module space $M$. This lead to the development of the quantum groups via the construction of $C(\\Omega)$-modules. In \\cite{Jacobs:1970ui}, the module operation is defined directly in the algebra ${\\rm End}(M)$ defined as the direct product over $\\Omega$ where $x \\in \\Omega$ contributes a copy of"
        }
      ]
    },
    {
      "train_loss": 0.03257958984375,
      "update_counter": 5000,
      "epoch": 161.25806451612902,
      "val_loss": 6.849609375,
      "val_entropy": 0.6260986328125,
      "val_ttr": 0.5963541666666667,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": ", for both configurations, the evolution of the optimal shapes at three different stages of optimization: (a) when the controllers are trying to reach the target shape (b) when the shape has been reached but the shapes are slightly off (c) when the shapes are almost but not exactly reached. The panel (a) shows the shape reached at the end of the control when the optimal shapes are reached (b) and the panel (c) shows the shapes that are off from the target. We can see that in configuration~\\ref{comparison_whole_timeline_configuration_1} when the shapes are off from the target and not close to it in shape, it is due to noise that appears in the evolution of the optimal shape. This happens mostly in the shapes with low value of $N$ (3.0 and 5.0). In contrast, in configuration~\\ref{comparison_whole_timeline_configuration_2} the main reason that the optimal shapes are not reaching the target and are off from it is due to the lack of stability of the optimal controls. This is mainly shown in"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple toy example of Problem~\\ref{problem_egg} with $N_t=8$ uniform time meshes. The strong resolution preference of the uniform mesh does not affect the a posteriori error estimator. In particular, the production meshes produced by the PMV-L/LV preconditioner are larger than the one chosen on the uniform mesh. Moreover, the production meshes do not vary on the grid level. More grids grown with the production method give production meshes that are still larger than the corresponding grids of the previous example. The production meshes produced by the PMV-L/LV preconditioner show a dissimilarity index from zero on $85\\%$ of the production meshes. The only exceptions are production mesh $9$ and $10$ generating dissimilarity percentages of $95\\%$ and $96\\%$, respectively. We conclude that the produced production meshes are large enough to contain solution fluctuations of the problem. Furthermore, the error estimates computed on the production meshes are relatively accurate. The estimator works well for solution values as well as differential equations. In particular, the estimator works"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The fluid problem is solved at a faster rate than the solid problem, i.e., it is passed `information' every $p$ time steps by a multirate method such as the split-block method proposed in \\cite{Arnold2001}. Every $p$ time steps, some of the solid state is updated to be up to $t_{next}$, where $t_{next} = p...|I_1|..$. However, we observe that a direct update of solid does not slow down fluid in the wrong way because otherwise, every time fluid is updated at a $p$ time step, solid has to be solved from time $t$ to $t+T$, which is too long for $p$ to be a multiple of $1$. Therefore, we propose a modified solution strategy, which is shown in Figure~\\ref{fig:multirate-solve}. Because of the explicit boundary conditions, every time fluid is solved, solid is solved from time $t$ to $t+T/p$, thus making the exchange of information between fluid"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": " $1.7\\%$ on average (seen in Figure \\ref{fig:f3} grid). In Figure \\ref{fig:F2} and Figure \\ref{fig:F3}, the 2 and 4 th floors are over-localized as seen by the figure (a) and (c) of those blocks. However, the error on the 2 nd and 4 th floor in $2.4$ GHz is under estimated as shown in (d) and (e) of the blocks, respectively. It is worth noting that, not all floors are divided into blocks equal in size; e.g., the $9^{th}$ and $2^{nd}$ floors only have two blocks as shown in (k) and (i) of Figure \\ref{fig:f2} and the $10^{th}$ and $1^{st}$ floors only have one block (seen in (o) and (i) of Figure \\ref{fig:f3}). In addition, the difference between the two blocks for the same floor varies. For example,"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " Accepted:ISAS} we have shown that the ejecta of a CME must be connected to the host phenomenon (the eruption) by an interaction between the S/C and the companion before the eruption occurred, such as a pulsation, a explosion, a jet, a magmatic blob expulsion. The chance for a random CME to find itself interacted by an S/C and companion in the solar system is very low whereas in the planetary system the chance to have this interaction is very high. Therefore no CME can simultaneously belong simultaneously both to a solar system and a planetary system. Therefore, we conclude that the chance that there is a relation between the CME that crossed {\\emph{PSP}} right before Enc.~1 and the Enc.~1 CME which was observed by {\\emph{STEREO}}-A is very small and therefore this does not justify the acceleration mechanism proposed in \\cite{Valgushev:2015 Accepted:ISAS}. Moreover, the  likelihood of a long-range acceleration means that the effect of the source CME would have to extend through at least two S/C interactions with the"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKotozashvili:2013tgr}, \\cite{BolgianoMeyardy:2013gtc}, \\cite{Koshelev:2014ggr,_articles_2021_Yuri}), in many new problems this condition turns out to be too restrictive and it is necessary to consider much more general asymptotic behaviors of the fields. In order to fix the ideas, we will consider fields that come out from the straight line solution by an arbitrary smooth deformation. Moreover, in order to make this deformation target oriented, we shall refer to use as $\\mathbf{s}$ to fields whose behavior on the target is given by the simple linear line solution $\\mathbf{s} = \\mathbf{s} (s)$ \\eqref{straightlinesolution} with $\\mathbf{s} \\in \\mathbb{R}^3$. Thus, the asymptotic behavior of the fields $\\Phi_i$ is  given by $\\Phi_i(\\mathbf{s} \\cdot \\bm{\\gamma}) \\plus$ some terms that do"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the R-functions, which is obtained by the Landweber iteration. The spurious local minimum for Landweber iteration reported in this section is not due to the valueless objective function, but due to the fact that the kernel matrix changes greatly by removing relatively small neighborhoods from it with each update. As a result, even though the algorithm converges to a local minimum, the target function does not decrease significantly. This is while the Brackx-Hansen algorithm does not notice such a phenomenon. Below we show  the effective performance of the Landweber iteration by comparing the target function with the Brackx-Hansen iteration. In the figure below, the effective performance of Brackx-Hansen and Landweber iteration is shown for $k=10$ initial conditions. The initial conditions are generated by random values, their average norm is $2.60$, and they are sampled from the regular part of the kernel (see figure \\ref{fig:performance2}). Both methods find their global minimum (the triangle-shaped region). However, the effective performance of Landweber"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " in each place, and the derivatives are compared to those at the neighboring points from a finite selection set. To explain our convenience on comparing derivatives, we stop at the derivative level for convenience. Since the underlying function $f$, $i.e.,~$$f^{(k)}$$,  converges locally means that its derivatives $f^{(0)}, f^{(1)},\\dots$ converge to the corresponding derivatives of the underlying function in a neighborhood of the $K$-th order power function $\\sqrt[K]{f}$. However, if the selection set $\\mathcal{S}$ contains points from a uniform distribution, we have a convergence in $L_{\\infty}$ (see Theorem \\ref{thm: NL only local convergence}). We highlight that the convergence theory in the linear case is a global one, meaning that the derivatives of the residual and the derivatives of the underlying function, all converge to the corresponding derivatives of the underlying function----the $L_{\\infty}$ convergence, at ANY given point. As a consequence, we may turn to the linear case when the selection set $\\mathcal{S}$ from a uniform distribution is"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a noisy evaluation of some estimate $x$ by some limited set of true values $\\mathcal{L}, ~ e_j \\in \\mathcal{L} \\setminus \\{x\\}$ with $|\\langle g_j-e_j\\rangle| \\gg 0$. In all we have $x \\neq g_j$'s and the corresponding estimate of the unknown true value $\\mathcal{L} \\sim p(x)$. The above stochastic optimization corresponds to the estimation of true values $\\mathcal{L}$ from the noisy observations $g_j$'s. Such a reversal of problems happen quite often in science and engineering. If the prior distribution $p(x)$ is known, one can define the noise term $e_j$ directly at the level of $x$, so that the corresponding $g_j$'s are not noisy. Then the optimal solution to the optimization $(i)$ can be obtained by an optimized solution to the forward problem $x \\rightarrow g_j$. This is, however, not always feasible. The noise term $e_j$'s depend on complex"
        }
      ]
    },
    {
      "train_loss": 0.040052734375,
      "update_counter": 5250,
      "epoch": 169.32258064516128,
      "val_loss": 5.458984375,
      "val_entropy": 0.750732421875,
      "val_ttr": 0.5696614583333334,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": ", for both models and both genders, the evolution of the peak height as a function of age for \\textit{Only Shared Information}, and \\textit{Both Models} respectively. In both cases we can see that while there is evolution in the peak height, there is no significant change in the height level between both models, which means that there is no advantage of using information from multiple senses to model the object, in predicting the height of the object. We can also see that there is no Reaction-Diffusion dynamics, since the peak always appears in the center of the population, in both models, and the level of height is almost constant across age, except for the last peaks, in which it slightly shifts to the right. This behavior results from not having any advantage of using more senses to predict the peak height, therefore, the models use the same strategy. We can also observe that, for both models, the peak height starts to increase when the peak becomes the dominant one, that is when all individuals in the population, have the same height. This behavior results from the inhibition mechanism, which imposes"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for the simple model of Section~\\ref{sec:fluid} with $\\kappa=\\lambda=1$, $n_t=100$, $n_x=n_t/100=1000$, and an initial condition $\\Omega_i=1, i=1,...,25$. In this case, $S_-\u6d77\u7ef5$ and $S_+\u6c14\u7403$ with scale $\\frac{1}{10}$ are floating in the domain. In Table~\\ref{fluid_residuals_uniform_scale}, we provide the a posteriori estimator for the same solution with a scaled domain size $\\Omega=\\left(10,10,10\\right)$. For the $L^2$-norm estimator in both Tables~\\ref{fluid_residuals_uniform_equal} and~\\ref{fluid_residuals_uniform_scale}, the results for the PMP-DDP method are provided. We note that the computed solution is recovered at the end of the PMP-DDP procedure by a step size of $0.01$ and"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The overall solution method is thus composed by a multirate time-stepping scheme for each subproblem, address-based communication and a lower-level data exchange between the fluid and solid iterations. The time-stepping schemes are detailed in \\cite{fedele_cm_2011}. The overall solution method is further endowed with a ``moving'' mechanism for the interaction between fluid and solid, so that the previous solution is ``thrown away'' when the latter is ``full''. This mechanism is required because the solid problem is much slower than the fluid one, and a large amount of fluid solutions become obsolete before the solid iterator runs up to it. This is due to the multirate time-stepping used for the fluid-solid interaction; indeed, exploiting the adjusted step sizes, given by Eq. (\\ref{adjusted_ts}), the overall solution method can process \\textit{$p$} solutions of fluid per one from solid with \\textit{$p$}=2\\footnote{In practice, depending on the user-defined stop condition, the actual ratio can be even larger"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "0.098\\% in $3.7$ GHz. The results are presented in Fig. \\ref{fig:powerplot}, where floor and frame biases are small $0.007$ and \\(0.001\\) e-folding, respectively. There is a sharp drop in accuracy on the second to last level, from 12th to 13th floors, from 0.048\\% to 0.36\\%, corresponding to the unoccupied mezzanine. Similar errors for other frequencies are \\(2.4\\) GHz: 0.045\\%, \\(2.9\\) GHz: 0.04, and \\(3.7\\) GHz: 0.06. Figure \\ref{fig:powerplot}, right, shows the relative change in the biases for different frequencies. The bias on the last level, considering all the beams, shows an increase by more than 1 e-folding, in other words, the relative change in this e-folding is about 100. Results for single floors are"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": "nfa}, the authors argue that because of the non-newtonian character of the gravitational phenomena in the AVA, the notion of an orbit and an equilibrium point is not well-defined. Indeed, the potential in the AVA does not hold a fixed point, and approximations of a cusp-like point always belong to a whole family of points  moving inside the $r$--$\\theta$ plane w.r.t. the source body's gravitational field. Moreover, as shown in \\cite{Valgushev:2016gru}, even an instantaneous CME is unable to find an equilibrium. This is so, because the motion in the AVA must be \\emph{observably}, which means that the coronal source is always observed at some spatial resolution. Since the instantaneous value of the field strength is function of the resolution, the in-situ motion always follows a piecewise continuous curve instead of an $\\mathrm{d}x/\\mathrm{d}t = 0$ trajectory. As a result, both the concept of an equilibrium and an orbit in the AVA are, in certain aspects"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacher,Os,Herrmann,Zou,Press,Stoilow}, \\cite{Maurelainteau,Ginoux} and the review in \\cite{GeU}), reality of this condition is not yet entirely understood.s Some qualitative arguments in order to understand where the viscous core should be located are given in the seminal paper by \\cite{Press}. A non-trivial numerical method for the determination of the viscous core depth $H$ is presented in \\cite{Maurelainteau}. The method is based on solving  the Stokes problem in an aperture (a channel) with a sharp angular perturbation. The solution of the Stokes problem is obtained by a numerical realization of the PIV method (see \\cite{Press}). The solution of the PIV method is fit to a fit of the non-linear second-order polynomial $p(\\theta)$ (see \\cite{Maurelainteau}). The angular perturbation is computed from $p(\\theta)$ using the angular cone condition of \\eqref{tangentialconecondition}. The resulting relationship between viscosity depletion and aperture is $H"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of functions. For Landweber iteration, two easy to compute efficient estimates are provided by the heuristic rule and by the gradient method. In this paper, we focus on the heuristic rule generated by our starting point (an approximate initial value problem). This heuristic rule is characterized by three free parameters: the initialization parameter $\\tau_c$, the $\\epsilon_g$ parameter and the initial value of the $\\epsilon_g$ parameter. For this starting point, the algorithm produces a sequence whose first approximation level is a single flat level at $r=r_c(f)$. This corresponds to the situation of the starting value problem, as it is characterized by the effective rule $h(z)=z/m_c(f)$ for $z$ in an interval containing $z_0$. This effective rule has a single critical point at $z=z_c(f)$. In contrast to the previous scenario, and in agreement with \\cite{frougny}, we have seen that for $k>0$, and for $z_0$ contained in the interval connecting $0$ to"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " in each hidden layer on a neighborhood of the target mapping. The convergence status of the entire network in the nonlinear case and in the linear case are different. In the linear case, we can prove that the evolution direction is always toward $\\bm{x}\\rightarrow\\Pi_{\\mathbb{R}^{d}} (\\bm{y}.*\\bm{w}^+)$. While in the nonlinear case, the evolution direction is taken on the sub-manifold $\\{ \\bm{y}.*\\bm{w} \\in \\mathbb{R}^{d} \\subset \\mathbb{R}^{2d} \\} $, which only contains the line in $\\mathbb{R}^{2d}$ where $\\bm{x}$ is bounded. Moreover, it can be proved that the function $\\bm{y}.*\\bm{w}$ always decreases on the closure of this sub-manifold, which follows from the S-curve property of $\\frac{d}{dt} \\bm{g}(t)$ (see \\cite{c4}). Together with the twice-infinitely-differentiable rule in the nonlinear case (Theorem \\ref{"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a noise term, is called the Gray-Scott (G-S) model \\cite{Akiyama2006}, which expresses the reaction-diffusion mechanism for motility in microorganisms\\cite{Esfandiari2014}. The G-S model is one of the most popular model for describing the bacterial dynamics in dense communities, introduced by Akiyama and Saito in 2006 \\cite{Akiyama2006}. It has been shown that this model captures the most important features such as \\textit{i-}) constrictive behavior \\cite{Layton2007}, \\textit{ii-}) inhibitory behavior \\cite{Layton2007}, \\textit{iii-}) stimulatory behavior \\cite{Layton2007}, \\textit{iv-}) competitive/non-competitive behaviors \\cite{Layton2007}, \\textit{v-}) passive advection \\cite{Layton2007}, \\textit{vi-}) active"
        }
      ]
    },
    {
      "train_loss": 0.061861328125,
      "update_counter": 5500,
      "epoch": 177.38709677419354,
      "val_loss": 6.234375,
      "val_entropy": 0.658935546875,
      "val_ttr": 0.6103515625,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the evolution of the efficiency for both configurations as a function of the iteration count. The mean efficiency and its fluctuations are shown in Figure~\\ref{fi_est_mean_1} and \\ref{fi_est_mean_2}. In both cases we observe that version 1 is always more efficient than version 2 and its fluctuations are smaller. Figure~\\ref{comparison_iteration_dependent_configuration_1} shows the efficiency of the \\textit{while while while while while} strategy for both configurations. In this case for iteration count between 16 and 22 both versions are equally efficient. Figure~\\ref{comparison_iteration_dependent_configuration_2} shows the efficiency of the \\textit{for while while while while while} strategy for both versions. This strategy is less efficient than \\textit{while while while while while} for all the iteration count that we considered. This strategy also has a high fluctuation rate, except for iteration count 20 and 24 for version 1. In this cases the strategy is more efficient than \\textit{while while while while"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for \\emph{geometric} Navier-Stokes problem in the small solution case. The bottom half of the table compares the estimators performance for the external and internal boundaries. The top half compares the estimators performance for the case $u_1=0$ and the case $u_1=1$. Note that the performance of the fluid subproblem is very good and almost optimal and almost never uses the outer threshold. Similarly, the performance of the potential problem is very good and almost optimal and almost never uses the outer threshold. However, the fluid-potential coupling problem has an almost $9$-fold variability in performance in the top half of the table. This happens because the function $\\psi_{\\tau_k}(z)$ in~\\eqref{fluid_residuals_uniform_fluid} depends on the solution of the fluid problem, that is $u$, thus, for different choices of $u$ the level set functions $\\psi_{\\tau_k}(Z_h)$ start to differ and hence the matching problem in~\\eqref{fluid_residuals_uniform_bound} varies. Note that if we"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". For the fluid problem, we use the scheme offered by the High Performance Computing (HPC) Package of Numerical Algorithms Group (NAG), an extension of the previous partitioned-view in time algorithm, presented in \\cite{C2010}. For the solid problem, we use our own algorithm, adapted in time for heterogeneous subproblems, similar to the strategy used in \\cite{H2012}. The time steps in fluid and solid are not coordinated between each other. The convergence of the HRTDOC algorithm towards the limit problem can be observed in figure \\ref{fig:convergence}. In both problems, the convergence of the HRTDOC algorithm is guaranteed on the basis of the a priori given preconditioners and a finited-difference basis for the solution. The main computational cost of the HRTDOC algorithm in fluid is given by the solution of the PDE in contrast to the strong multirate time-stepping in solid. The main cost of solid is derived from the updated multirate time-stepping scheme and the step size adaptation. We compare the computational cost"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "3\\% on the second floor, $2.4+2.4+2.6\\%$, for the 250 kHz\u9891\u5bbd and on the third floor, $2.4+2.4+2.2\\%$, for the 500 kHz\u9891\u5bbd. This is best achieved with a proper downconversion lens, as shown in Fig. \\ref{fig:fig10}. The magnetic field due to the main receiver can interfere with the measurement and therefore, we also need to shield the receiver. For that, we created a double flap door, and only one of the flap can be opened to access the receiver. The floor on the receiver side is made of magnetic concrete, and all the walls are made of non-magnetic concrete. We also put a steel target in the wall between the first and the second floor, and, surprisingly, the magnetic floor on the first floor does not reach the edge of the target. In other words, the surface magnetic field is well combsuted and we can just add the readings from different sensors on the second floor to reduce the errors."
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " Accepted:Aps} we have shown that the dynamical analysis reveals a much more complex picture than the equilibrium one. The dynamical analysis reveals the processes that lead to the dippers' roll-about as the long magnetic tether gets stretched and twist is imbalanced. All of this results in the rotation of the dipole axis. The analysis also shows that the average magnetic field components, responsible for the dippers' resistance against the rotation, change in the solar wind stream. The stream is always there, what means that the magnetic connectivity is always present and the outcome is the rotation of the magnetic tether. The equilibrium presented in \\citet{Valgushev:2013ApJ...767..100V} is, in fact, far away from the reality. In the equilibrium version, the S/C cuts the magnetic tether and suddenly jumps out of the magnetic field structure and proceeds with its observing mode in the solar wind. Dynamical analysis shows the reality much closer to the reality: the S/C continuously interacts with the magnetic field and participates in the maintenance of the magnetic field structure. In the"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKotozskRef}), there is also a more abstract reason, why one want to study the mathematical regime of weak and strong friction.  The description of the small and large friction case are different in nature, and different analytical and numerical methods are more appropriate for each case. The analysis of small friction involves studying the system of ordinary differential equations \\eqref{oDCS} for initial conditions of small-size perturbations of the critical solution. The numerical methods should be chosen such that they can be applied to very small initial conditions as well, which in turn means that the numerical precision should be much higher than in the case of large friction. On the other hand, the numerical implementation for the large friction case is different, because the original system \\eqref{pde0} is not itself suitable for analysis of the strong friction limit. The reason is that the system itself does not capture the additional motion of the critical torus, caused by the non-trivial dynamics of the tangential cone \\eqref{tangentialcone}. In the case of large friction, the critical torus moves in the sublevel set of"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the gradients. For Landweber iteration, we refer to the efficiency of the gradient estimates as the \\emph{irregularity} of the function \\emph{which may be very inefficient!}; we will discuss this issue when presenting the results of our algorithm in Section~\\ref{sec:results}. In this section, we describe a surprising observation on the initial Landweber iteration. For the current setting, the optimal function has a large number of small bumps as shown in Fig.~\\ref{fig:optimal_landweber_performance_unfixed}, and accordingly, the initial heuristic rule $(f_0(), g_0()) = \\mathcal{H}_I$( annulus $R<r<a$, $p<W$, $q<L$) performs a surprising behavior for Landweber iteration as shown in Fig.~\\ref{fig:optimal_landweber_performance_unfixed}. There is a spurious local minimum for $t=0$ as shown by the blue dots; specifically, the performance value starts from any value higher than 0 and decreases at first, before eventually reaching"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " of a given sequence, no matter where it is in the entire space, but the convergence is limited to some neighborhood of the current given solution. This is in contrast with the linear case where one can prove global convergence. In practice, this local convergence is still sufficient in many cases because the original function values of the nonlinear terms do not change much from the initial values; see Example~\\ref{ProvenByPlotting} for details. \\emph{Only local convergence in nonlinear case} is also due to the way the optimality conditions are expressed. In the linear case, one has standard optimality conditions for the function values, whereas in the nonlinear case, the optimality conditions are all derived using (I-)GP values. Since we are using standard optimality conditions, we need to mimic the strategy used in the linear case and also impose constraints on the model parameters; see \\autoref{sec:commentApplic} for the application examples. \\emph{Only local convergence in nonlinear case} is also due to the way the optimality conditions are expressed. Due to this impingement, it would be difficult to prove a certain"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a biological or mathematical process involved in the dynamics. Such models are usually termed as universal models. To understand the neural network dynamically, we need to impose an arbitrary condition on the output. In this paper, we use an auxiliary constraint method which is one of the cutting edge method in neural network theory \\cite{Parvi2019}. For convenience, the formal neural $F_{\\nu}(x \\in \\Omega; \\theta^{\\nu})$ is represented as $F_{\\nu}(x;\\theta^{\\nu})$ in this paper, where $\\theta^{\\nu}$ denotes the parameter of the neural $F_{\\nu}(x;\\theta^{\\nu})$. The $\\theta^{\\nu}$'s are to be determined, which are called the time evolution parameters. When $\\Omega=\\R^{N}$, we use $\\nu$ to denote the space dimension. In this paper, we only consider the case that the domain $\\Omega$ is bounded, i.e., $\\Omega \\subset \\R^{N} \\subset \\mathbb{R}^{D}$ with $D>N$ with smooth boundary condition $b"
        }
      ]
    },
    {
      "train_loss": 0.033322265625,
      "update_counter": 5750,
      "epoch": 185.4516129032258,
      "val_loss": 6.50390625,
      "val_entropy": 0.643798828125,
      "val_ttr": 0.5999348958333333,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the evolution of the system's time-correlation function along the two paths of our experiment. For each curve we denote by different colors the limits where we stopped to plot the curve. In this way we can highlight the points where we stopped and performed analysis of the system in configurations corresponding to different equilibrium states: a liquid-vapour transition (red curves), a liquid-liquid transition (green curves) and a first-order (blue curves). In each curve we identify four points: the highest height of the shoulder (upper right), the intermediate peak (upper left), a low peak (lower right) and a minimum (lower left). The position of these points is labeled with numerical values. From this data we extract the critical parameters at which the system goes from a phase with a shoulder to a phase with a peak. In this way, we confirm our finding using the characteristic of the time correlation function known as anomalous scaling near the phase transition. In Figure~\\ref{aoft_vs_t} we study the scaling behavior near the critical point using the anomalous orientational order parameter $Q(\\"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for \\emph{geometric} Dirichlet problem. The top half shows the estimated error for the finite element solution of~(Problem~\\ref{eq:fp_akrov_grammalp}) found by solving Problems~\\ref{eq:fa_bkrov_grammalp} and~\\ref{eq:fa2_bkrov_grammalp} using GFAL, while the bottom half gives the error estimated on the discrete solutions of Problems~\\ref{eq:fd_gt_akrov_grammalp} and~\\ref{eq:fd2_gt_akrov_grammalp} obtained by GFAL using the same finite element model but with a compact domain. All problems have $n=50$ time steps, $N_T=100$ and $P_T=200$, and $u_0$ given by~\\eqref{eq:initial_value_uniform} with $\\Omega=(0,1) \\times S_0$ for some simple shape $S_0\\subset\\mathbb{S}^d$ (e."
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The fluid step size is determined by the time scale of the valve opening and closure, while the solid step size is required by the time scales of the plastic deformation that takes place in the body and elements. Furthermore, a complex multirate scheme for the coupling is also proposed to overcome the problems of synchronization and time-reversing of waves that are commonly encountered with the traditional synchronous schemes. In addition to these, regular output updates are done at each stage, so that sufficient descriptive information is always available to do future time-steps in a reliable manner without wave guessing. All these advanced features make the multirate embedded dislocation motion simulator to be unique in comparison to other multirate finite element or finite difference schemes. It should be noted that the adaptive multirate schemes used in problems featuring mixed discretization, such as mechanical fining of thermal problems, are currently the best in the world. \\cite{cct,dtn} In contrast to these schemes which adapt among multiple time-stepping schemes within a single subroutine, our multirate scheme adapts among the entire dislocation propagation"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "3\\% on the second floor, $2.4+2.4^{(2)}$ (Figure \\ref{fig:fig2}), and on the third floor, $2.4+2.4^{(3)}$ (Figure \\ref{fig:fig6}), to about 1\\%. This reduction in error for the higher floors is taken from the last two frames of the series in Table \\ref{tab:tab4} since, in those frames, the error on the first two floors is below 10\\%. Also, seeing the evolution of the error on the first two floors in the frames $972$ and $1002$ in $2.4+2.4^{(1)}$ (Figure \\ref{fig:fig2}), shows that, although the layout of the shelves has not changed, the error on the first two floors gets higher. That is due to the fact that, the rows and their locations on the floors are forgotten after a while, on account of the floors being well separated in the RF range. That effect would cause large errors if we"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " Accepted}, the authors analyze manuscript reports and draft documents for CME VCME and show that the orientation of the CME transiente by longitude and time was $\\theta \\sim 90^\\circ$ in both the crossing and the science ships. Using this constraint, only a zero-crossing version of the acceleration can be possible. However, in \\cite{Hoyer:2013ApJ...763L..33H} the authors argue that the velocity dispersion observed among solar wind particles as wide as in CME eruptions cannot be caused by transport effects only and thus they present a model where sub-CME structures interact with one another to be accelerated to approximately the same velocity, thus producing a single acceleration profile. This version of the acceleration also allows the CME velocity to be consistent with the velocity of the rushing plasma in the corona and thus this version of the VCME CME can also be called the universal acceleration version of the CME. Which version of the acceleration is correct is impossible to say without knowledge of the nature of the interaction that produces the velocity dispersion, however both versions of the acceleration"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKotozskRef}), there is also a more abstract point of view that $p$-abelian vector bundles are relevant in categorical quantum mechanics. This perspective was introduced in \\cite{MaR2} (see also \\cite{KitaevContactSphericalProducts,Henningsen:2011vM} for a more classical perspective): $p$-abelian vector bundles are defined for general (non-cyclic) base curves $C$, and correspond to the inclusion $U_m\\subset End(E)$ as a $C$-morphism, where $C^{h}=U_m$. For a fixed base curve $C$, the category of $p$-abelian vector bundles on $N$ with stable rank $m$ restricts to the category of finite-dimensional vector bundles, and thus can be used as a categorical model for physical vector bundles. This perspective was used to construct some explicit examples of physical vector bundles, \\cite{MaR2,BaraudPublic} in particular in various fields outside of quantum topology. The use of this model was also important"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the gradients. For Landweber iteration, we refer to the efficiency of the gradient estimates as the \\emph{irregularity} of the function \\emph{which may be very inefficient!}; in our experience, an unefficient regularity can even render the global search ineffective. In Section \\ref{sec:regularity}, we will describe this problem in detail and provide solutions for it. For now, we briefly describe the problem so that it is part of our discussion. For the boundary conditions we used in Section \\ref{sec:stability}--\\ref{sec:regularisation}, the Landweber iteration starts returning a spurious local minimum. Since the function value at this minimum is strictly larger than that of the optimal solution, the response function returns a wrong evaluation. This failure was not expected.  As a temporary solution, we were forced to include a step size reduction step within Landweber iteration at the spurious minimum. This dramatically improved the quality of the output of Landweber iteration but at the expense of complicated iteration. We had to carefully control the negative step sizes used in the iteration to avoid descending"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " of a given sequence, no matter where it is in the entire space, but the convergence is guaranteed to be true in a neighborhood of the sequence. The convergence of the reconstructed function and the norms will be always uniform in the whole space. The convergence of the parameters will be given in a neighborhood of the parameter space. Moreover, we also obtain some explicit update directions for each parameter. See Section~\\ref{TheTheoremN} for the detailed statement of the theorem. Consider the following example to demonstrate the local character of the establishment convergence theory in the nonlinear case.  We set the value of the parameter $\\ b = 0.1$ in the independent parameter case in Section~\\ref{TheExampleP}. Since the function $\\ u_i^{\\rm NL}$ is constructed by a finite sum of nonlinear functions based on second-order polynomial, we know that a 2-dimensional nonlinear vector field $ \\mathbf{x} \\ := (x_{1}, x_{2}) \\ := (\\cos2\\cdot p_1, \\sin2\\cdot p_2)$ can be generated from this model"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a biological or mathematical phenomenon in $\\Omega$ (e.g., a disease propagation model, a neural network dynamics model with simple biological interpretations or a model for ionic diffusion). Letters $s, n$ denote simple (possibly discrete) measurements obtained at a given number of discrete time steps $t_k=k\\Delta t$ where time step $\\Delta t$ is assumed to be sufficiently small and $i \\in \\set{1, \\ldots, D}$ denotes the channel or sensor through which the measurement is obtained. The measurements $y_j^i$ are further assumed to have zero mean and variance $\\sigma_j^2 \\in (0,\\infty)$ only for $i \\in \\set{1, \\ldots, D}$. We assume that the values of $g_j$ and therefore also of the measurements $y_j^i$ depend on time $t$ from a static underlying environment; this time dependency is modeled with a small positive positive number $\\Delta$, which is called the coarseness of the environment, and which is further described with three small positive"
        }
      ]
    },
    {
      "train_loss": 0.032439453125,
      "update_counter": 6000,
      "epoch": 193.51612903225808,
      "val_loss": 6.669921875,
      "val_entropy": 0.6282958984375,
      "val_ttr": 0.5983072916666666,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the evolution of the relevance value in the timelines obtained with the configurations C1 and C2, respectively. The results were obtained by repeatedly pulling on the cluster during the relaxation (100 cycles) of the simulations of 10 runs for each configuration. In both figures we highlight the different values of the relevance value in the nodes at the center of the cluster, at the ends of the branches of the cluster and in the nodes that are the connection away from the cluster center (nodes that are on the outer edges of the clusters). In both cases, the figure with the configuration C1 shows that the time-evolution of the relevance value is quite stable except for cycles number 2 and 3 in which a node that is away from the cluster center gets close to the nodes in the cluster center and in the outer edges of the cluster. In addition, that node gets less relevant, getting closer to the new generated nodes. In addition, notice that the relevance value in the cluster center gets higher in cycles number 6 and 7 and lower in the outer edges of the cluster in later cycles. All those"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for \\emph{geometric} Dirichlet problem. The top half shows the estimated error for the finite element solution of~( \\ref{eq:fluidproblem}) obtained as the solution of \\eqref{eq:GKBprob} by a direct numerical derivation. In this case the estimated error estimated by~\\eqref{eq:FluidTacit} works almost identically to~\\eqref{eq:FluidEstim}, since the solution is relatively simple. The bottom half shows the estimated error for the solution of \\eqref{eq:fluidproblem_n} with the homogeneous Neumann boundary conditions for the simple test cases $p=1$ or $d=2$. In this case the estimated errors are significantly different, since the solutions are very different. Thus, the a posteriori estimator works very well in detecting solutions with different errors. For the case of homogeneous Dirichlet problems, the estimator estimates the true error well for all values of $\\mu$, but it underestimates the error for the most complicated problem $p in \\{2, 3, 4\\}$. This underestimation is very unusual, since the solution $F"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The fluid step size is determined by the time scale of the valve opening and closure, while the solid step size is required by the time scales of the plastic deformation that involves the edges, the cladding and the embedded track. To this purpose, we employ Generalized Minimum Residual (GMR) scheme of order $q$ on grid $Q$ (see \\cite{MR05}), where the residual is defined as sum of residual in each of the subproblems. The choice $q=10$ and $Q=10$ is made even though higher orders and grids could be considered for larger problem sizes. The stoping criteria is set to $q\\at=10$ and $\\tau_s\\at=0.0001$. When the fiber fiber is in contact with the cladding, which is assumed to be coated with epoxy, there is no reaction between the two materials and we assume that the cladding is rigid and embedded in the substrate. Therefore, the plasticity in this case is purely deformation related and caused by fiber motion and insertion. The plastic"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "3\\% on the second floor, which reduces to 1\\% when removing the noise in the estimates of the front-end parameters. The discussions in this section show that the proposed model shows steady and reliable results even in complicated scenarios. The parameters obtained by the VFI-DL method are shown in Appendix. The blue solid curve in Fig. \\ref{fig:dating} is the baseline system used for comparison and the orange dashed curve is the dream system with more than 3000 measurements. The more accurate floor selection and higher measured frequency, half-sides of the readings, and the vibration noise suppression in the parameter estimation stage of the algorithm, all help in the quality of the measurement system. However, the vibration noise always results in inaccurate measurements, even with the advanced measurement system, and in addition, the limited number of measurements restricts the potential of the measurement system. In Appendix, we also compare the method with other methods. The sensitivity of the estimation floor into the accuracy of the measurement frames is interesting, showing that the vibration noise has a strong effect on the measurements, which can be used as"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " Accepted}, the authors argue that the bound on the velocities from stationarity of the Gaussian velocity distribution is no longer valid for such structures as CMEs. The analysis of the velocities of CMEs in the Solar Wind Observation (SO) suite of instruments on {\\emph{Helios}} \\citep{Zapatta:1986ssq}, {\\emph{Helios}} Co-Aligned Portions of Electrons (HANE) \\citep{Koll:1981ssq}, and {\\emph{STEREO}} \\citep{2017SSRv..204...73B}, showed that CMEs have consistent lower velocity tail with a Gaussian distribution, but with a mean velocity significantly more negative than the mean of the distribution. Moreover, the tails of the distributions at large acceleration velocities are much more flat than the distributions with near-ballistic velocities. Thus, the problem of an unstable equilibrium may arise in two cases. The first is the occurrence of constant acceleration of the CME in a circular orbit, which is the case stated above. The second case is when the"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKotozskRef}), there is also a more abstract point of view that $\\cal N$ should contain only objects that are invariant under translational and rotational transformations (see e.g.~\\cite{JockersBook,JockersPietsch}). According to this point of view $\\cal N$ is not a proper subspace of $\\cal K$. In order to get a proper subspace of $\\cal K$, we had to define it with the help of the measure $L$. However, if we choose another measure $U$ on $\\cal K$, say the standard Lebesgue measure on the unit sphere $\\mathbb{S}^2$, the\u59aey-$\\cal N U $\\\u5c31\u662f\u76f4\u5750grape\u7f6e$\\cal N$\u7684\u7ebf\u6027\u7ebf\u5750grape\u7f6e$\\cal R$. Since the affine cone of a Euclidean cone is independent of the linear subspace where it is defined (see e.g.~\\cite{BersteinBook}), the\u59aey-$\\cal C ::=\\ affine cone of $N$ is the same when defined in $\\cal N$ and any other linear subspace $\\cal R$"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the gradients. For Landweber iteration, we refer to the efficiency of the gradient estimates as the \\emph{irregularity} of the function $\\widetilde{\\phi}(x):=\\phi(x)+W(x)$ which gradually gets worse as the function evolves towards its stationary point. For the effective performance of global search methods, we need \\emph{a gradient method that is stable to the irregularity of $\\widetilde{\\phi}(x)$} (quoted from \\cite{JER}). And by stable, they mean stable w.r.t. the speed of convergence of the method. The stable gradient method that meets this property is given by the normal Moore-Penrose solution of the inverse of the continuous-discrete gradient matrix $JCorv$ \\cite{Normal}. (This solution is also the parametric version of the Steffen-Almen method \\cite{Stoffen}.). This normal solution is better than normal in the case of irregularity due to the fact that it is optimal among gradient methods under the condition of weak consistency. The Moore-Penrose solution is calculated by $"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " in each parameter space, and one can always find a tiny parameter space in which the estimate does not converge. In contrast, the linear case is a global one and one can prove convergence in any space where the function is defined. This local convergence is also observed in several other GPC methods, such as Polak-Rackwitz \\cite{PR17}, GPC-IS and we observe that the semi-silent sets (i.e., those corresponding to nonzero optimal parameters) are in much larger scales than those in the silent sets, which are corresponding to zero optimal parameters. \\cite{LPR19} proposed a method to deal with this problem generated by the local characterization of the function value with respect to the parameter space. Moreover, in contrast to \\cite{LPR19}, who proposes a confirmation procedure to deal with\u65e0\u58f0\u96c6, GPC(\\ref{eq:GPC_eq}) converges silently to the optimal solution in any scale. For example, the parameter $\\bk{\\Theta}_i^{(0)}$ is the optimal solution for the given data $L$. However,"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a biological or mathematical process involved in the dynamics. A boundary condition is defined as $\\partial_\\nu \\varphi = 0$ on the boundary $\\bd \\Omega$, and the terminal time is defined as the time at which the system reaches a stationary state. By $k \\in \\mathbb{N}$ being the maximum time when the system is in motion, the user may provide a predictive control strategy of the system before the time $k$, which is used to control the system within time $k$. After $k + 1$, the system is updated via $A \\colon \\varphi_{k + 1} = \\varphi_k$, $B \\colon \\sb{\\Omega (k) } \\varphi_{k + 1} = g_j \\circ \\varphi_k$, and $\\partial_\\nu \\varphi_{k +1} = 0$ on the boundary $\\bd \\Omega (k)$, where $\\Omega (k)$ is a smaller domain containing $\\Omega$ within time $k$. The above process is called the time-evolving domain (TVD) method"
        }
      ]
    },
    {
      "train_loss": 0.03210205078125,
      "update_counter": 6250,
      "epoch": 201.58064516129033,
      "val_loss": 6.783203125,
      "val_entropy": 0.6036376953125,
      "val_ttr": 0.572265625,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the evolution of the maximal negative peak value (corresponding to the magnitude and the sign of the QSO emission line profile, as described in Section~\\ref{peak_definition}) for the three configurations for each grid case for both cases. A quick glance at these figures reveals that our additional configurations provide constraints that can help in the selection of the best fitting QSO. For the case of case A (Figure~\\ref{comparison_whole_timeline_configuration_1}), we can see that for all the grids the acceleration first increases and then develops a plateau around 1~keV and 2~keV. The peak luminosity (see Section~\\ref{peak_definition}) of all lines for a given energy bin increases first and then stabilizes around these energies. This is a very useful information for the selection of the best fitting QSO for a particular absorption. For example, for the \\ifmmode\\mathrm{Ly}\\alpha\\else{}Ly$\\alpha$\\fi{} line at 1220\\AA we can see that the peak luminosity for all cases is maximum at time 500 in the"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for \\emph{geometric} Dirichlet problem. The top half shows the estimated error for the finite element solution of~( \\ref{eq:fluidproblem}) obtained as the solution of \\eqref{eq:GKBprob} by a standard PG-GC method, while the bottom half corresponds to the estimated error of the original fluid problem~\\ref{eq:fluidproblem}. Note that the solution is computed on the same meshes for which we checked the a posteriori error estimator. For the fluid problem we observe that the PG-GC solution has a smaller error, which becomes noticeable for large values of $N$. We also consider the case of finite volume solutions of \\eqref{eq:GKBprob} obtained by a standard PG method. In Table~\\ref{fluid_residuals_uniform_pg} we report the a posteriori error of the finite volume solution for the Dirichlet problem~\\ref{eq:fluidproblem} corresponding to the same meshes. Since the solution of \\eqref{eq:GKBprob} based on finite volumes is much more delicate, the error in the solution is generally much larger"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The fluid step size is determined by the time scale of the valve opening and closure, while the solid step size is required by the time scales of weathering micro-reactions and the plastic deformation. Before the next time step, the multirate model is solved by using the adapted step sizes. This scheme is summarized in the panel (iii) of Figure \\/3\\. Note that the fluid step size is adapted based on the time scale of valve opening and closure, while the solid step size is adapted based on the time scales of weathering micro-reactions and the plastic deformation. Note that we allow the multirate model to solve subproblems, i.e., some rates contain different time scales, and the time step sizes for each subproblem are adapted separately (see panels (i)--(iii) of Figure \\/3). This approach combines the approach of Early and Plagemann (1990) with the multirate time-stepping methods of Stanimirovic and Phillips (2001) and Stanimirovic and Phillips (2002)."
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "3\\% on the second floor, which reduces to 1\\% when removing the noise in the estimates of the front-end parameters. The results are summarized in Fig.~\\ref{fig:frequency_integration}A. The biases on the first and second floor are, respectively, $-8.0\\pm 6.7$ dBm/km and $-7.7\\pm 5.0$ dBm/km. To obtain this reduction in floor errors, the EE framework needs to be applied on the results of the PLS. This is shown in Fig.~\\ref{fig:frequency_integration}B. As can be seen the error between any two floors is below 3\\%. Note that the ground level has not been used to estimate the errors as the noises in the measurements tend to, generally, cancel instead of reducing. Nevertheless, the errors between the ground level and the first or second floor are respectively 5.2\\% and 4.5\\%. The error between any two floors and the error between the ground level and the first or second floor have a global median of 2.5"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " Accepted}, the authors argue that the bound on the mass transfer rate, which is still assuming an equilibrium, is much too large to allow direct violation of the WD equilibrium. However, in the recent study by \\citet{Cheng2018}, the authors conclude a discussion stating that the merger of a closer companion, which is not necessarily visible to heliography, does not allow for a well-defined orbit in the traditional sense. Thus, the\u4f5c\u8005 conclude the merger of a closer companion, which is not necessarily visible to heliography does not allow for a well-defined orbit in the traditional sense. Thus, the authors conclude the merger of a closer companion, which is not necessarily visible to heliography does not allow for a well-defined orbit in the traditional sense. This is true, as neither the traditional radial orbit or the EOB orbit is able to constrict its function of the distance from the companion, which grows without bound with the merger of the secondary. However, what the authors fail to consider is the fact, that an non-equilibrium mass transfer changes the orbit significantly in a short period of"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKotozskRef}), there is also a more abstract point of view that $p$-abelian VIRGAS result should follow immediately from the existence of a commutative conformal metric (see \\cite{CarlsonCurelyRes} and the related literature). This is because a conformal metric is characterized by its trace and the inverse, and the $p$-abelian structure of the trace and the inverse is identical to the $p$-abelian structure of the cone. From this point of view, the existence of a commutative conformal metric is the same as the existence of a tangential $p$-abelian cone. The conformal metric is used to define the norm in the final form where it meets the point of view of the analytic graph theory, but this way of interpreting $p$-abelian VIRAS theorem was started by \\cite{CarlsonCurelyRes} to avoid finding a particular symmetric product in certain non-trivial cases. The result itself was first argued immediately from the abstract $p$-abelian structure by \\cite{Stanley}. The reason for this argument is that the $p$-abelian"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the J-functions derivatives, which in the case of the Landweber iteration are provided by the initial value of the gradient gradient matrix. The initial value of the gradient matrix is automatically produced by the procedure of finding the first local minimum by the Fletcher-Reeves method, even though this minimum is not actually a minimum of the actual J-function. In fact, this first minimum is produced by letting the gradient matrix evolve for a very short time, and therefore the matrix is not contaminated by derivative noise. However in it the diagonal of the gradient matrix is not updated, since it is known to be accurate. This behaviour seems to be compatible with the suggestion in \\cite{Ge2014} not to trust the diagonal of the gradient matrix, even if it is known accurately. However we believe that this action would result in a suboptimal gradient matrix, since the effective time for which the evolution is simulated in this approach is indeed the actual iteration number, which grows with the propagation of false gradients in the matrix. In our view this propagation is motivated by the effort to not to contaminate the non"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " of a given sequence, no matter where it is taken, but one can not prove convergence of all sequences. In fact, the main proof technique of the nonlinear convergence theory is based on the linear convergence theory and the stability analysis of the sub-spaces, which can only control the convergence of a given sequence. Compared with the linear convergence theory, the main difference of the nonlinear convergence theory is that one should deal with a direction distribution in the direction space, and the convergence rate is also dependent on the distance between the two corresponding initial states. As this study is the first version of the theory of the nonlinear case, the convergence rate is only presented in a vague manner for the direction space convergence. In the future version, the objective is to further refine the function space and the direction space and improve the linear and nonlinear convergence rates. In addition, the nonlinear analysis is developed based on the linear and nonlinear cases together, so one may consider more explicit comparisons between the linear and nonlinear cases. One important issue to improve the rate of the nonlinear case is to establish a richer stability matrix for the nonlinear case. In the linear case"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a biological or mathematical phenomenon in $\\Omega$ (e.g., a disease propagation model, a neural network dynamics model with simple biological interpretations, or a model for molecular kinetics). The aim of a numerical simulation is to obtain the evolution of $g_j$ when it is perturbed by small noisy values $\\varepsilon_j$ at a frequent, yet sufficiently short, time interval. A number of numerical simulations, where the values of the perturbations $\\varepsilon_j$ are sampled randomly, and the numerical simulations are run in independent trials, are called experiments. The collection of all the performed experiments is referred to as the collection \\textit{${{\\mathcal E}}$}, with ${{\\mathcal E}} = \\{ (c_1, \\dots, c_m) \\subset \\Lambda \\subseteq \\Omega $, $m \\geq 1 \\}$, where $c_j$ are the values of the perturbations ($\\varepsilon_j$ by the discrete version, e.g., $c_j = \\left. \\varepsilon_j \\right|_{t=0.25\\tau}$ with the sampling"
        }
      ]
    },
    {
      "train_loss": 0.03297265625,
      "update_counter": 6500,
      "epoch": 209.6451612903226,
      "val_loss": 6.701171875,
      "val_entropy": 0.599609375,
      "val_ttr": 0.59375,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the evolution of the system's time-correlation function along the two paths of the same molecular chain for two different values of the microscopic parameters. For $\\lambda = 10$ and $\\gamma = 1$ (Figure~\\ref{comparison_whole_timeline_configuration_1}) the resulting curve resembles the corresponding result for a homogeneous chain of atoms (not shown here), as it has a similar pattern: the function max(S(t, d)) = x for 0 \u2264 t \u2264 25 time units and after that the correlation time extends until about $25 + \\gamma = 28$ time units. In this case the function max(S(t, d)) = x is satisfied for about 20 time units, which means that the model under consideration is indeed able to capture the main feature of the original function. However, for a chain of atoms for which $\\lambda =10$ and $\\gamma = 1$ the peak in the middle becomes much less pronounced and the correlation time extends almost up to $25 + \\gamma = "
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for \\emph{linear} FVM solutions. We choose $\\Omega=([-2,-1],[0,1])$ with $n_{x}=n_{y}=2$ and $c_{0}=1$, which leads to an absolute norm of the solution matrix $A$ around $10^{-2}$. The values of the a posteriori probability are very close to the corresponding values of the residual norm. We observe this particular behavior for the private cloud and the hybrid estimator. On the one hand, the private cloud only depends on the residual norm of the solution vector. This allows for a quick computation even for large values of $n_{b}$. On the other hand, the hybrid estimator complements the residual estimator with a probability based on the history of the ghost node averages. We prove the consistency of the private cloud to estimate the a posteriori error for the normal flow problem in Section~\\ref{simulationresults}, Table~\\ref{fluid_residuals_uniform_normal}\n. In particular, the values of the private cloud used for estimating the a posteriori error are close to the values"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The fluid problem is solved using a lower rate with a step size which is adapted to the solid rate using the proposed algorithm in Section \\ref{sec:adapt_step_size}. Since, adaptation to a fixed rate is not possible in the solid problem, a higher rate is used to solve the solid problem. This, however, may result in high solve times for the solid problem. Therefore, we propose a scheme to generate task workloads that divide the solve time of the solid problem into several parts, where it is solved as a multirate problem using the higher rate. The work done during other times is stored in an auxiliary buffer and is combined with the output of the solid problem when the next period starts. By this method, not only can the solve time of the solid problem can be divided, but also the solve time of the fluid problem can be reduced. We should mention that, since a multirate operator is used to solve both problems, the corresponding solution vectors are generated in multirate fashion. Therefore, it is necessary to apply an error correction process to correct the rates of solution"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": "2\\% on the second floor for the $2.4$ GHz mockboard. This is caused by the similar symbol power of weak weak links on the lower floors of the model. In addition, the average channel power on the lower floors is higher than that on the first floor. This result further justifies the assumption that the signal power on each transmission line follows a normal distribution. We also study the performance of the algorithm in $3.6$ GHz in Fig. \\ref{fig:evaluation_3_6}. The number of measurements is varied from $5$ to $20$ and the average coverage distance on the first floor is $3.57\\text{-}0.23$ m. The average error rate of the algorithm in terms of JPD is shown in Fig. \\ref{fig:evaluation_3_6_JPD} and the average number of updates shown in Fig. \\ref{fig:evaluation_3_6_sizeof_array}. The average error rate of the algorithm mainly increases when the number of measurements varies from $5$ to $20"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " Accepted}, the authors argue that the CAD should be used to determine the initial state of a CME event, rather than an empirical analysis. Using the same process as in \\citet{2012ApJ...754..127V}, they find that encounters with greater alfv\\'enicity are more likely to be counter-clockwise rotating events. However, this is not a true indication of a favorite rotation, as alfv\\'enity is a function of the surface magnetic field, and therefore is not consistent among events. Moreover, the MHD theory of CMEs and coronal wind origin is complex and the initiation location and initial rotation of a CME is likely to be a result of an intricate process involving interaction with the host system, evaluation of mechanical energy, and conversion it to kinetic energy of the ejecta. A complete theory of CME initiation and evolution will likely require understanding of all these processes and may require future separate studies. The rapid rotation of CMEs is most likely a consequence of propagation and rotation of the ejecta, and the HCS does not play any significant role in the process. Throughout the motion"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacherKippenhahn,Kippenhahn1990,Gazzola2015} and the references therein), it turns out to be in some cases too simplistic. In the context of quantum field theories, the interaction between the fields and the particles is mediated by a class of interacting many-body systems (often called a ``physical\" layer), such as quantum fluids (e.g. DOPs or fluids of bosons and fermions), or classical fluids (e.g. fluids of point particles or fluids of balls). The interaction between the fields and the particles is realised through a procedure (usually called an \\emph{abstraction}), which creates a secondary unstable particle (e.g. a DOP) from the field and subsequently collapses it in the physical state. This interaction is realised via the interaction between the fields and the secondary unstable particle, such that the many-body system of the physical layer becomes $H^{\\text{phys}} = H^{\\text{physical}} + H^{\\text{inter}}$. The interaction $H^{\\text{inter}}$ typically involves"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the Jq-functions. For the Landweber iteration, it estimates the current function value by the average of the node values on the current iteration's path, which includes both function values and gradient values. It turns out that this heuristic can produce a spurious first local minimum as the initial path includes a node which contains a large function value. However, by studying the graph with large function value as in Figure \\ref{Gfp}, we can see that the direction of preference from such a node to following nodes has a wide range. This means that the average of the node values will be the function value with large absolute value. In fact, we confirm that the effective heuristic this node produces is of Landweber iteration type. For the GKB algorithm, the effective heuristic also include rule of double gradient check. We extend the GKB algorithm by including the edge where the node contains a node with large function value and edge weight is the negative of the average of the node values. However, from the analysis of generating this graph, we can see that the effective direction of preference between two nodes with large"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " of a given sequence, no matter where in the total space $\\mathbb{R}^p\\times\\mathbb{R}^q$ the iterations of the algorithm converge, is guaranteed to be attracted in some neighborhood of an arbitrary initial condition by the function space $H$. \\cite{S2} presents the proof of local convergence in the nonlinear case. The existence of global convergence is also discussed in \\cite{S3}. It is worth mentioning that the linear convergence with a constant factor does not depend on the initialization choice. However, the local convergence in the nonlinear case does depend on the initialization choice, however this dependence is canceled in a average sense as shown in \\cite{S4}. In conclusion, the local convergence in the nonlinear case holds with a constant factor that is only dependent on the initial condition and the parameter choices, and moreover, this dependence is eliminated by taking an average over all initial conditions. The convergence rates in the linear and nonlinear cases are comparable. The convergence rates due to parameter suboptimality and initialization noise are also comparable in the linear and nonlinear cases, as we will show in Section"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a biological interaction in the environment. Such interactions are usually structured dependent and depending on the environment. Assuming that the biological interaction does not change quickly, the dynamic model of the form $( \\ref{eq:PSEIM}) $ is permanent for the whole period. As a result, the dynamic model $( \\ref{eq:PSEIM})$ is also called a steady-state model. The steady-state model is the abstract framework to describe the evolution of multi-population, single-individual organisms in an adaptive environment. To make the model account for specific behavior observed in the environment, we introduce a shape $S : \\Omega_{t}  \\rightarrow  \\R^{N}$ that models the (possibly) adaptive behavior of the individuals, which is called the function field or vector field in this paper. It indicates the direction of evolution of the organisms. When the domain $\\Omega_{t}$ stays unchanged, the function $S$ depends on the time. Specifically, $S$ is modeled as a smooth function on the domain $\\Omega_{t}$ with smooth derivative. Following the notation of This"
        }
      ]
    },
    {
      "train_loss": 0.0567724609375,
      "update_counter": 6750,
      "epoch": 217.70967741935485,
      "val_loss": 5.7734375,
      "val_entropy": 0.70654296875,
      "val_ttr": 0.5885416666666667,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the evolution of the degree sequence for the optimal and for the greedy configuration for two different values of the intervention time $t_{\\mathrm{max}}$: $t_{\\mathrm{max}}=2$ (Figure~\\ref{comparison_whole_timeline_configuration_1}) and $t_{\\mathrm{max}}=3$ (Figure~\\ref{comparison_whole_timeline_configuration_2}). Notice that for $t_{\\mathrm{max}}=3$ the two sequences are quite different, thus in the remaining parts of this section we focus on the case $t_{\\mathrm{max}}=2$. Also in this case there are some differences for the optimal and greedy strategy. The first observation is that the first intervention was crucial for both the optimal and greedy sequence. This is a consequence of the fact that in both sequences the nodes with highest degrees are the ones that should be intervened first (except for the first intervention). After the first intervention, the only important information left for the subsequent interventions is the node right across from the intervened one. In other words, the latter should be connected to nodes of low degree (see"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for \\textbf{Equilateral} problem with $\\Omega=([-2,-1],[-1,0])$ which corresponds to a triangular prism with equal edge lengths and equal vertex prices. We set $r_i=100i$ (see step 1a in the procedure for a motivation), $T=5$ and $N=100$ (see step 2 in the procedure). We set the maximum number of iterations for the a posteriori error estimator to be 100. As a consequence, we set the percentage of iterations to compute the estimator at each time step as high as 1\\%. We set the subproblem steps as $s=100$ and $\\tau=0.01$. We also set the minimum length of the time steps to be equal to the step size $\\tau$ and the minimum residual to be equal to $10^{-3}$. We also explore other choices of the parameter list $(\\tau,\\sigma,\\gamma)$, where $\\sigma=10$ and $\\gamma=0.01$ on the"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The upper left panel of Fig.~\\ref{fig:stability2} clearly shows representative examples of stability (blue triangles) and instability (red squares) regions in the ($\\Delta t_{\\mathrm{fluid}}\\times\\Delta t_{\\mathrm{solid}}$) space for $m=1$ and $m=3$ on the left and $n=1$ and $n=3$ on the right. The horizontal lines are the boundary lines where a time step of 100 ns is used. Which region is appropriate for a given choice of step size is determined by the expected ratios $ \\frac{h}{h_n}\\frac{h}{h_\\mathrm{sn}} $. As illustrated in the lower panels of Fig.~\\ref{fig:stability2}, a constant material time scale $\\tau_\\mathrm{sn} = 1$ ns is considered. A stability analysis for ratios $ \\frac{h}{h_n}\\frac{h}{ \\tau_\\mathrm{sn}} $ is shown on the left, and for $ \\frac{h}{h_n}\\frac"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": " $1.7\\%$ in $2.4$ GHz and $0.7\\%$ in $5.2$ $GHz$ at average separation between the floors on the FLX system of $1000$ m. Moreover, on the baseline model, the average error for all floors in $2.4$ $GHz$ and $5.2$ $GHz$ is $2.7\\%$ and $2.4\\%$, respectively. In contrast, the best model obtained by integrating the CCF presents average floor-wise errors of $1.7\\%$ and $1.7\\%$ in $2.4$ $GHz$ and $5.2$ $GHz$, respectively. The CCF obtains up to $30\\%$ less error on average compared to the baseline model in $2.4$ $GHz$ and $5.7\\%$ in $5.2$ $GHz$. Besides, it presents less errors on all frequencies and towers compared to the best model obtained by integrating the FCF. Moreover, the top two best models in $2.4$ $GHz"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " Accepted, Valgushev:2015Thesis} it is shown (using the analysis of metric properties and the evaluation of the multivariate Rayleigh--Rayleigh distribution) that the direction of the propagation changes for the ICME-CME system during the evolution. The ICME gradually becomes a CME followed by a new, simultaneously formed ICME. Such a mechanism is consistent with the realistic modeling of propagation described in \\cite{Valgushev:2013ApJ...767..183A}. This is because the realistic modeling of the atmosphere models Sunward flux tubes as well as particles along the flux tube. While the flux tube representation ensures fluid nature of the propagation, preserving the hydrodynamic analogy, the exact copying of values from one modeling component (interplanetary shock) to another (interplanetary medium + heliosphere) keeps the physics and ensures that streams particles along the flux tube. However, this is not consistent with the original equilibrium version of the CME. In this version, the same CME (with the same direction) propagates from the inner (acceleration +"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacher,Press1996,Daivd-Manin,Stanley1986,Almkvist,Almkvist1998,Forster},) there has been a recent demand for geometric applications of ACM in which one starts with a codimension one locus $Z$ of a given symplectic variety $\\mathcal{X}$ (e.g. the solution space of a set of ACM constraints), and tries to study the symplectic variety $\\mathcal{X}$ on the basis of the geometric properties of $Z$. Such applications include new proofs of classical results on symplectic topology (e.g.~\\cite{Forster:JPoly201,Forster:JPoly2011,ForsterJacobs}), as well as more recent results (e.g.~\\cite{ForsterJS2012,ForsterJS2013,ForsterJS2016a,ForsterJS2016b,ForsterJS2018}). In order to obtain information about the underlying symplectic variety $\\mathcal{X}$,"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the gradient and the hessian, and on the choice of the learning rate $\\eta$. For example, the Landweber iteration \\cite{L07} is an effective variant of GLL for complex $f(w)$, when the subordinate function $g(w)=-\\frac{1}{\\eta^2} \\sum_{i=1}^n (\\eta \\frac{\\partial g}{\\partial w_i}(w))^2$ is relatively smooth due to the Cholesky decomposition. But this decomposition is not valid for the smallest eigenvalue $\\lambda_{t,i}$ of the hessian $H$, thus the corresponding derivative $\\frac{1}{\\sqrt{g(w)}} \\sum_{i=1}^n \\lambda_{t,i} \\sqrt{g_{i}(w)}$ is not reliable. This leads to the appearance of a spurious local minimum for Landweber iteration, see Figure~\\ref{fig:local_min_Landweber}, which is far away from the real minimum. However, we argue that this does not really matter: first, the"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " of a sequence, which is derived from a initial function that is perturbed by a small noise. The linear case and the linear-nonlinear case are all convergent, local or global, while the hybrid linear-nonlinear case can be convergence, local or global, see Theorem \\ref{ltm}. However, we do not consider the comparison between the convergence norms in different cases in this paper. See Conclusions for more discussions. One may also notice that the convergence rates in the nonlinear case are larger than those in the linear case, since a perturbation noise will add additional steps during the iteration. For example, if the initial function goes to a minimum value, it may pass through several valleys on the way to a local minimum, which takes $O(h^{10})$ steps in this model and needs $2^{r}$ steps for each single valley for $r\\in[1,9]$. Another important issue is that the nonlinear case also has convergence rates that are faster than usual since it is a higher order method. For example, if the initial function goes to a flat area in Figure \\ref{fig"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a small perturbation such that the original system $\\Delta_N^{\\Omega} f_c$ corresponds to $\\Delta_N f = g$ with $g=f$ on the boundary. If the perturbation $g_j$ is mild enough, the system $\\Delta_N^{\\Omega} f_c$ and $\\Delta_N f$ have similar solvers. The finite difference method described in \\cite{Zeeman1995} works in all dimensions $N=2,3$, and on a grid that is compatible with the boundary condition. We apply this solver to solve $\\Delta_N f = g$ on a grid $grid(N) = \\{j=1,\\ldots,M\\}$, yielding an output vector $\\widehat{Y}_{N,j}$ for $j=1,\\ldots,M$. In each grid point $j$, the value $\\widehat{Y}_{N,j}$ models the solution $g_j$ at that grid point. We define the filtered grid $\\{G(N,k)\\}_k$ as the subset of grid points $j$"
        }
      ]
    },
    {
      "train_loss": 0.03525537109375,
      "update_counter": 7000,
      "epoch": 225.7741935483871,
      "val_loss": 6.34375,
      "val_entropy": 0.6314697265625,
      "val_ttr": 0.6139322916666666,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the evolution of the reconstructed mass with the exposure for the two configurations for a sequence that was taken in the third orbit, but where the exposure was low (second row of Figure~\\ref{comparison_whole_timeline_configuration_1}) and high (second row of Figure~\\ref{comparison_whole_timeline_configuration_2}). The changes in the reconstructed mass as a function of the exposure are shown in the top-left panels. These show that for the two configurations the reconstructed mass is sensitive to the exposure, with the reconstruction method of the dense sequence (configuration $\\ell = 2$) performing better than the reconstruction method of the sparse sequence (configuration $\\ell = 5$). In the bottom-right panels we show the reconstructed mass measured from the dense sequence (configuration $\\ell = 2$) as a function of the measured VLA brightnesses and for different exposures. This shows the reconstruction time as a function of the exposure, and the time change is visible in the top-right panel, where the arrow points to the time when the mass changed. The change in time is shorter than the exposure"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for \\emph{geometric} convergence of $\\Omega$ and $\\mathbf{v}$ in the uniform norm for a set of HLL alternatives that are mutually compatible with a chosen set of HLL alternatives for \\textbf{python} and \\textbf{ MATLAB}. Note that the a posteriori error estimator corresponds to that discussed in Section~\\ref{sec:uniform_error_estimate_table}. According to these results, we can conclude that the error on the approximate solution computed using the uniform time meshes can be estimated by an algebraic error, where the algebraic error depends on the tolerance and the number of iterations. We can also observe that the error estimator works uniformly for all values of the relative initial volume $\\mathcal{D}/C$ and the relative velocity coefficients $\\alpha$ and $\\beta$. For example, for $\\mathcal{D}/C=0.1$ and $\\alpha=0.5$ and $\\beta=0.5$, we have uniform convergence of order $10$ for $N\\in\\{1,2,\\ldots,250\\}$. This sequence satisfies Condition~C-Q"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". For the fluid problem, the choice of a multirate partitioning is done by considering the frequency content of the discrete analog of the continuous analog of the partial differential equations, i.e., the derivatives. A graphical overview of the overall computational load induced by each of the problems under consideration is shown in Fig.~\\ref{fig:multirateVisualization}. The overall numerical overhead compared to a single-rate scheme is negligible. We note that a similar multirate partitioning has been used for solutions of simulations of uncertain nonlinear dynamical systems \\cite{LiLiu2011}, uncertain nonlinear dynamical system experiments \\cite{Chamola2014}, and mixed multirate problems \\cite{HuangShelly2011}. To perform parallel multirate time-stepping schemes for fluid and solid among multiple processors, we use the multiprocessor communication-centric architecture (CNA) \\cite{Urlich2011} to build a concurrent multi-processor grid scheme which shares the time-stepping operation across processors. For demonstration, the fluid and solid multirate"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": " $1.7\\%$ in $2.4$ GHz and $0.6\\%$ in $5.2$ $GHz$ at average separation between the floors on the RF map of $200$~m. This result seems to be promising, but the actual ultimate accuracy is limited by the fact that the motion estimation is computed by finding the consensus on the submaps found in each floor. This consensus step not only increases the number of, but also the complexity of, the image comparisons. Moreover, it requires extracting similar patches or regions from the images having different submaps as well as synchronization of the images. Although the patch or region matched images do not necessarily need to be the same image, as seen in the $3.75$ GHz floor, in our system the different patches or regions from the different images, extracted from the localizations on the different floors using the RF map, are synchronized by the timestamps on the images. Thus, when the snapshots are not scanned, the problem of floor changes on the RF map is not easy to solve, even if the patches or regions from the different"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " Accepted}, the authors argue that a single synchronized equilibrium exists for a corps ring system to be stable, and thus, there is no process capable of outdating the corps and ejection of the CME. However, as pointed out by \\citet{Shi:2017isa}, concurrent evolution of orbits means that synchronized orbits will not occur in most cases. Moreover, with the right aspect ratio, coronagraph images of corps can show that many have tangential components as well as radial components near the Sun. Tangential corps are less likely to synchronize with the rotating surface \\citep{Horne:2011xj,Shi:2014JGRA..119.1572S}, and thus, the synchronized equilibrium hypothesis is invalid. Furthermore, data from {\\emph{STEREO}}-A show that most CMEs are followed by subsequent CMEs rather than one leading to the ejection of the corps at the CME's point of launch. \\citet{Zhang2016 ejectedCME} used archival {\\emph{STEREO}}-A observations to show that"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacher,PrincipalComponentTheory,Benaglia:2013p407}, \\cite{Donati:2011p410}),  the ability to extract this information is not automatic.  In order to overcome this problem, we have developed a novel procedure to interpret the CT image.  We apply a recent method to reconstruct the motion of the rapid rotation point \\cite{Donati:2011p410} from the tangential cone information. The method was originally proposed for the reconstruction of the heartbeat in $3$D by visualizing the rapid motion of the blood in the left ventricle through a CT image of the heart. The method is based on computing the tangential cone while at the same time assuming a rigid body rotation of the object (see below). The advantage of this approach is that the tangential cone information is extracted easily and clearly, without the concern of the $3$-dimensional location of the rapid rotation point. After the reconstruction of the motion, the correct orientation of the elastic body can be determined and then the magnitude of the stretch ("
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the gradient and the hessian, and on a choice of a parameter in the power iteration. Therefore it does not hurt to compare the Landweber iteration to its spurious first local minimum. For the current initialization, it does produce a different result than the standard version. For every $k$ the effective norm of the gradient at $k$ is plotted against time $k/L$ in Figure \\ref{fig:landw_spur_first}. As expected, it suddenly drops to nearly zero at the first stationary point. However, that is not the case for the spurious first local minimum, which results from using the initial optimization solution for the effective iteration. Effective norm of the gradient corresponding to $k=10$ is shown in the left panel of Figure \\ref{fig:landw_spur_first}. Obviously, it does not reach a low value until the last Landweber iteration, which in this case is $k=32$. However, as can be seen from the right panel of the same figure, when the effective norm of the gradient is small after"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " of a sequence, which is derived from a relative iteration space starting from a original given basis, not the standard basis. In this case, we can prove that the convergence rate is same as the linear case, in terms of convergence subspaces and rates, but with an additional nonlinear correction term. \\cite{c3} and \\cite{c2} confirm this theorem with numerical experiments. \\cite{c3} also proposed some strategies to further decrease the convergence rate in the nonlinear case, by using outside iterations and spectral methods.  Moreover, it is interesting to notice that in the nonlinear case, the convergence rate is dependent on the convergence rate of the standard basis starting from a generic initial basis, and only in the linear case can it be universal and independent of the initial basis.  It is also interesting to note that, since we can remove all the using the working subspace of the standard basis from the relative space, the relative space does not need to borrow from the standard space in our method. For these aspects, the standard method is similar but not exactly the same as the nonlinear method so we will"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a small perturbation of the domain $\\Omega$, e.g. a perturbation of the roof function. Moreover, we consider a boundary condition $S$ describing the boundary situation of the problem at $x \\rightarrow \\partial \\Omega$, which can be modeled by a boundary perturbation $(\\delta \\Omega)_{\\partial}$ applied to $\\Omega$ together with a perturbation of the boundary condition, e.g. by introducing small errors $\\epsilon_i$ in the boundary parameter $\\lambda_i$. In this model we assume that the perturbation $g_j$ and the perturbation $S$ are linearly related to the small perturbation $\\sum_{i} \\epsilon_i$, i.e., $g_j = \\alpha_j \\sum_i \\epsilon_i$ and $S = \\beta_j \\sum_i \\epsilon_i$ with certain $\\alpha_j$'s and $\\beta_j$'s (which could be different for different $j$'s). The assumption about the structure of $g_j$'s and $S$'s is motivated by our iterative"
        }
      ]
    },
    {
      "train_loss": 0.03224560546875,
      "update_counter": 7250,
      "epoch": 233.83870967741936,
      "val_loss": 6.552734375,
      "val_entropy": 0.6124267578125,
      "val_ttr": 0.60546875,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the evolution of the reconstructed mass with the exposure for the two configurations for a sequence that was taken in a very busy time, i.e.\\ the sunrise time in Solar Observatory, and another in a less busy time, which we configured in the pre-summum maximum in the Mount Lemmon SkyCenter observatory in Arizona (see Figure~\\ref{fig:title_fig}). In the first plot (Figure~\\ref{comparison_whole_timeline_configuration_1}) we show the results for the sequence with 1000 ms exposure (configuration \\ref{configuration-1}). We can see that the  \\textsc{EsoVoyager} timeline is able to mark all the activities of the PLS in a ordered way. Furthermore, both the detector yield and the flux cut (in Section~\\ref{sec:detector_yield_and_flux_cut}) remain stationary for the whole  activity, and this in turn allows us to compare more accurately the reconstruction results for different exposures. As a result, the  steady yield and flux cut values seem to have an effect in the"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for \\emph{geometric} convergence of $\\Omega$ in size and roundness for a homogeneous, incompressible, Newtonian fluid. We set $\\nu=0.05$ and $p=2$ in~\\eqref{eq:fluid_conditions}. The source terms in~\\eqref{eq:inital_conditions} are given by $f(x,t)=x_e$ for $x_e=(1,0)\\in\\mathbb{R}^2$ and \\(t\\in[0,T_{\\rm max}]\\), where $T_{\\rm max}$ is the time maxinum as given in~\\eqref{eq:time_max}. In this case, we set $T_{\\rm max}=0.25$ to make the data of similar size as the volumes $Volume_i$ in the example in the first column of Table~\\ref{fluid_residuals_uniform_equal}. We can see that the estimator converges at the \\emph{geometric} rate $c_r=1/2$ both for the volume error $\\|\\Omega\\|_{Volume_"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". For the fluid problem, the adaptive stepping scheme has been thoroughly discussed in \\cite{Hansen2003a}, and adapted stepping schemes (in particular iterative scheme) for other problems have been discussed in \\cite{Hansen2003b}. The upper layer in the multirate framework responsible for the fluid boundary conditions is changed from a slow subproblem to a fast subproblem in such a way that the time scales for solving the fluid problems are matched with those for solving the rest of the problem. This is done by introducing an additional layer below the fluid layer, which solves a fixed linear PEC boundary value problem at the fluid-solid boundary. In the current system, the value of the velocity is continuous, and only the matrix A changes, while the sources q and b are constant. Therefore, we change the format of the boundary value problem, and solve a linear Fredholm integral equation of the first kind for the transition data. In this way, we keep the time scale for the fluid boundary value problem identical to that for the solvability matrix. Since the iterative stepping scheme"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": " $1.7\\%$ in $2.4$ GHz and  $3.7\\%$ in $5.2$ GHz when the clustering method is applied. On the other hand, system error rates in the $5.2$ GHz band are somewhat higher than those in $2.4$ GHz, which is shown in Figure \\ref{fig:graph3} in the Appendix. The highest system error for the $2.4$ GHz band is $2.2\\%$ for both the tracking frame and clustering method. Moreover, the clustering method has a much stable performance in terms of system error rate, and the minimum system error between the planning frame types is less than $1\\%$ for all frequencies, see Table \\ref{tab:table3} in the Appendix. The results are in good agreement with the results of the mast-top location in the previous section, and show the importance of using Inertial SVP along with GPS to reduce the error rates. In addition, the error rates presented here are achieved using a planning frame with  $15$ seconds average motion duration to"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " Accepted}, the authors propose an improved method for determining the equilibrium condition of a CME. This method gives values of the radial coordinate of the center of the CME at the time of the maximum coronal expansion, as well as its maximum radius. Using this new method, the authors re-calculated the parameters of the CME from \\cite{2013ApJ...776..164J} for a check. However, re-carrying out the calculation, the authors obtain different results: $r_{2}(t_{max}) = 16.25$\u2032,$z_{2}(t_{max}) = 12.25$\u2032,$M_{2}(t_{max}) = 3.5\\times 10^{10}$ g compared to $r_{1}(t_{cl}) = 12.75$\u2032,$z_{1}(t_{cl}) = 10.25$\u2032,$M_{1}(t_{cl}) = 1.9\\times 10^{10}$ g from \\"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacher,PrincipalComponentTheory,Benaglia:2013iii,Fazi:2013y,Faraon:2015}, and references therein), in this work we will focus on the case in which the tangential condition holds uniformly for all rotations, and we will consider the subset $\\mathcal{U} \\subset \\mathbb{R}^d$ where all vectors belong to the same vector subspace $\\mathcal{U} \\subset \\mathbb{R}^p$. We denote by $\\boldsymbol{0}_{p,d} \\in \\mathbb{R}^p \\times \\mathbb{R}^d$ the zero vector in the dimension vector (i.e., the vector of length $0$ with $p$ ones and $d$ ones) and we set $\\mathcal{I} \\subset \\mathbb{R}$ to be a regular indexing set such that $\\mathcal{I}=\\{i_1,i_2,\\ldots,i_l\\}$ for some $l \\in [\\mathcal{L}]$ with $i_1<i_2<\\dots"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the gradient and the hessian, and on a lower threshold $0 < \\tau < 1$. For the Landweber iteration, we assume that the performance test is satisfied for the local minimum at $(s_{m},i_{m})$, obtained after obtaining effective statistics from site $s_{m}$ using the rule $r_{m}$. Now let us consider the second site $s_{m+1}$ which means that $s_{m+1} = p_{m,1} = p_{m,2}$. If there is a path of steps of type $k(s_{m+1}) > k(s_{m})$, then we obtain a spurious local minimum for Landweber for updating rules $r_{m+1}$ using the performance test trivially satisfied at site $s_{m+1}$. This is related to the p-limitation: if $k(s_{m+1}) \\leq k(s_{m}$, then the performance test is trivially satisfied for all steps, and otherwise it does not. This leads to the p-limitation"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " of a sequence, which is derived from a given small initial data in a local sense. Moreover, the convergence rates are also local in nature and are independent of the nonlinear and linear subproblems solved from different initial data. On the other hand, the convergence rates in the linear case are same for all initial data and can be estimated from the solution outside the boundary as well as from the solution itself. Besides, the linear subproblem is solved in a global manner and one can only guess the accuracy of the solution which converges to the true solution. In this sense, the convergence rates of the nonlinear case are more special and only valid to for initial data which are derived from a given initial data by the Newton algorithm. However, we do prove in Theorem \\ref{thm:regress} that the optimal solution returned by the G-Newton algorithm is better than the one returned by the Newton algorithm. On the other hand, local convergence rates are easier to prove than the global ones so we keep them as such. We will show in Section \\ref{sec:experiment_nonlinearLocal} that the global convergence rates are"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a small perturbation of the domain $\\Omega$. We consider a viscous fluid in the domain $\\Omega$ driven by a self-dual Aurora phenomenon. The velocity of the fluid is $\\mathbf{v}(\\mathbf{x},t)=\\mathbf{E}(\\mathbf{x},t) + \\frac{1}{m_ \\Omega}\\mathbf{v}_{\\rm avg}(\\Omega)$, where $\\mathbf{E}(\\mathbf{x},t)$ is the electric field where the Aurora phenomenon is occurring. By self-dual Aurora phenomenon we mean a field of electric field $\\mathbf{E}$ with a frequency range from low- to ultra-high frequencies. This field corresponds to a homogeneous field oriented within the plane of polarization of the field, which is a requirement of the Maxwell equations, and by definition this field is of second order in the frequency. In self-dual Aurora phenomena the velocity of the fluid follows the electric field, not necessarily corresponding to a flow of the second order in the frequency, but including a high-frequency term. By a high-frequency term we mean a term of the order of the frequency of"
        }
      ]
    },
    {
      "train_loss": 0.0318369140625,
      "update_counter": 7500,
      "epoch": 241.90322580645162,
      "val_loss": 6.666015625,
      "val_entropy": 0.6007080078125,
      "val_ttr": 0.6028645833333334,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the evolution of the reconstructed mass with the implementation of the different techniques we suggested. The configuration we call \\textit{configuration $\\alpha$ out of configuration $\\gamma$} in these figures means that out of the $\\alpha$ parameters in the configuration $\\gamma$ only one of them is turned on. For example, in the first figure, we have \\textit{configuration $3$ out of $\\gamma$}, which means that the first three parameters in the configuration $\\gamma$ are used. In this case, only the parameters $\\alpha_1$ are turned on. The mass estimation implementation we suggested in this case is based on the fact that in one synchronization event (i.e. in one ball input at the ACR) the estimation works in the same way as in the full configuration. In this way, we can estimate the expected modified cost given the modified parameters. This way of implementing the constrained estimation is only valid for a small number of balls in the buffer at any time. In this way, out of $\\alpha$ parameters only $\\alpha_1$ is turned on and we use the full $\\"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for \\emph{geometric} convergence estimate $N_h\\leq c\\cdot h$ where $c\\in(0,1)$ is a constant. To assess the reliability of this estimate we proceed as in Section~\\ref{sec:numerical_applications}. That is, we extract squares from each of the time meshes and convert them into reference squares by using the same parameter $\\tau$ as in Section~\\ref{sec:numerical_applications}. For the method of Section~\\ref{sec:numerical_applications} we obtain reference fluxes $\\widetilde{\\mathbf{F}}^i_\\cdot(\\mathbf{x})$ for the squares $E_i$ by solving the same problem as for the current solution procedure. For the method based on the fluid problem of Section~\\ref{sec:problem} we solve the same problem as well. For that method we extract squares $E_i$ from each time mesh and compute the reference fluxes $\\widetilde{\\mathbf{F}}^i_\\cdot(\\mathbf{x})$ according to the system $( \\ref{eq:fluid_system}).$ Accordingly, for the"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". For the fluid problem, the choice of a multirate partitioning is done by considering the frequency content of the matrix A of the Darcy flux, measured by the spectral norm of A$,i$, , for all rates i$,=1\u30fb\u30fb\u30fb\u30fbN$, at a given time step size $T$. See the discussion in \\cite{MR19871001}. As concerning the solid problem, the stepsize selection is done according to Lemma 5.1 in \\cite{MR19871001}. In both problems, it is mandatory to use accelerated convergence algorithms of nested time-stepping, that is, the step size at each rate is divided into several smaller ones, recursively processed by  QR-algorithms, using previous time step values, the accelerated linear-linear algorithm SLEC or its generalized version, using partial snapshots, \\cite{MR20679883}. For the fluid problem, we use the algorithm of \\cite{MR20679883}, which relies on the generalized eigenvalue decomposition ("
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": " $1.7\\%$ in $2.4$ GHz and  $3.9\\%$ in $5.2$ GHz when the clustering method is applied. On the other hand, the error rate for the \\textit{on-the-fly} method is $0.6\\%$ in $2.4$ GHz and $6.5\\%$ in $5.2$ GHz. This can be explained by the fact that, in the \\textit{on-the-fly} method, the database is updated after the simulations are completed, while the clustering method dynamically updates the database during the scanning. Thus, the database generated by the clustering method is more close to the real database than the \\textit{on-the-fly} method. Furthermore, the clustering method has a much more consistent performance in $2.4$ GHz and $5.2$ GHz. Part of this consistency is due to the frequencies in the received scattered echoes there are similar. For example, in $2.4$ GHz, the highest frequency in $1\\mathrm{x00}$ and $1"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " Accepted}, the authors argue that a single synchronized equilibrium exists for a corps ring system, based on the fact that numerical simulations have shown that corps rings always transition into a harmonically paired configuration. However, as discussed in \\cite{Sharkey:2012ApJ...754L..33S}, that configuration is not stable, and in fact, exactly the same argument can be made against existence of the two-time scale equilibrium proposed in \\cite{2013Sci...342..838M}. Moreover, corotating orbits for the companion bodies do not necessarily imply that the CME is corotating; a non-corotating CME can be maintained around the Moon for hundreds of days \\citep[see][]{2009SoPh..256..337L,2018JGRA..123.9101C}, and there is no dynamical reason why it should change direction as a result. Half time of a CME's orbit about the Moon can be defined as the point where it has returned to"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacher,PrincipalComponentTheory,Benaglia:2013p407}, \\cite[Review on p.7 of]{Kaltenbacher} and the review on Theorum 1.6 in \\cite{Jurgens:2000:A1}), in light of the discussion in \\cite{Kelner:2000:fpl} and in the work of P. Kaltenbacher (see e.g.~\\cite{Kaltenbacher,Kaltenbacher2:a}), there are various other natural geometric constraints that must be satisfied by the solution of a\u661f\u5e73\u53f0\u7f16\u53f7 $k$ \u7684\u6cf0\u52d2\u9525 $\\SS^1_\\bullet(k)$. Two such constraints are the axial constraint and the coaxially-invariant constraint. The axial constraint states that the axial angle $\\gamma^k$ of the solution $\\SS^1_\\bullet(k)$ must satisfy  Axial Constraint (\\ref{axialconstraint}) (see \\cite{Kelner:2000:fpl,Kaltenbacher"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the gradients. The noisy gradient estimation allows the state to move away from the true minimum, which may cause it to visit other local minima. The first local minimum selected by the landweber iteration is the least favorable case where the noisy gradient is directed towards the highest slope direction. This local minimum allows the algorithm to move away not only from the current solution only due to the noise but also due to the fact that the underlying true solution does not have any boundary term. Furthermore, this first local minimum has no performance index (Fig. \\ref{fig:landweber_performance}a) and does not improve over the initial value (Fig \\ref{fig:landweber_performance}b). Therefore, the first local minimum found by the landweber iteration is indeed a spurious minimum. In contrast, since the second local minimum selected by landweber iteration (Fig \\ref{fig:landweber_performance}c) has a performance index (Fig \\ref{fig:landweber_performance}a) and improves over the initial value (Fig \\ref{fig:"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " of a given sequence, but one does not have a way to prove convergence of any other sequence. The established convergence theories in the linear and the nonlinear cases are, therefore, in general not transferable. However, as we will argue below, this local character does not limit one from obtaining a full control of the convergence. In fact, the local character of the theory does not prevent one from giving a full control of the convergence in any given case, by carefully shifting the sequences within the class and by using the established convergence theory. For example, let us assume that a sequence $\\{r^{(k)}\\}$ converges to a function $r$. Then any sequence $\\{p^{(k)}\\}$ with appropriate properties will converge to the same function $r$, by applying the shifting technique used in Lemma \\ref{shift lemma}. Moreover, if the function $r$ is arbitrary, we can safely assume that it is given in the polynomial basis and we can repeat the shifting technique to any polynomial basis function. The shifting technique can be applied infinitely many times so that we can apply the local convergence theory from the nonlinear case to any"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a small perturbations of the domain $\\Omega$. We consider a wave $f_0 \\in H^1(\\Omega)$ such that $\\int_X f_0 \\mu = 0$, where $\\mu = d_N^{-1/2} |_K$ is the positive measure of area of an isolated rough perturbation $d_N^{-1/2} $ of the measure $d_X$ on the rough domain $K \\subset \\Omega$. We consider a so-called \\textit{classical} rough perturbation, i.e. $K \\subset \\Omega$ is a rectangle in the attic with $d_K(A) \\geq d_N$ for all compact subsets $A$ of $K$. We assume that $f_0$ satisfies either condition $S_N$ or condition $C_N$ (see \\cite[Conditions S and C]{citeP}) with a rough parameter $N < \\infty$. We define the noisy copy $f := f_0 + h$, where $h$ is a noisy term in $H^"
        }
      ]
    },
    {
      "train_loss": 0.03162451171875,
      "update_counter": 7750,
      "epoch": 249.96774193548387,
      "val_loss": 6.767578125,
      "val_entropy": 0.5880126953125,
      "val_ttr": 0.5992838541666667,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the evolution of the reconstructed mass with the implementation of the different techniques. We compare in Figure~\\ref{comparison_whole_timeline_configuration_1} the reconstruction results for the optimized configuration, where the specific parameters are given in Table~\\ref{sec:params} (configuration \\textit{configuration 1}), and in Figure~\\ref{comparison_whole_timeline_configuration_2} the reconstruction results for the optimized configuration where the fields are normalized by $n0$ given in Eq.~\\eqref{eq:norm_jt_field_max} (configuration \\textit{configuration 2}). In both figures, the obtained reconstruction results are compared with the reconstruction results obtained from using the unit vector \\eqref{eq:a_1} for determining the value of $m_1$ (green lines), and the unit vector given by Eq.~\\eqref{eq:a_2} (orange lines). In both cases, for the optimized configurations (configurations \\textit{configuration 1} and \\textit{configuration 2}) the reconstruction results using \\textit{configuration 2} are better than"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for \\emph{geometric} convergence of $\\Omega$ and $\\mathbf{v}$ in the uniform norm on these meshes. We compare the estimator with the a priori estimates $E_O+\\varepsilon$ and $E_O+2\\varepsilon$ from \\cref{E-order}. Note that $\\varepsilon<1$ as already mentioned. Since we do not assume alignment it should be clear that the estimator is indeed geometric mean of $E_O_{\\mathbf{v}}$ and $E_O_{\\mathbf{v},\\mathbf{u}}$. If we assume alignment, i.e.~$\\mathbf{u}\\in\\mathcal{R}(\\Omega,\\mathbf{v})$, the estimator is then a high frequency estimate of the order $E_O_{\\mathbf{v},1+\\varepsilon}$, as can be seen from the top row of the table. In all cases the estimator is better than the a priori estimates by a geometric factor of at least $2$. We have also checked that the estimator has geometric convergence of order $1+10\\varepsilon$ on the same time meshes. This confirms the"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". The fluid problem is solved using a fixed number of time steps, whereas the solid problem is solved by a multirate time-stepping scheme. The overall time step is adapted from step to step according to some \\textit{a priori} defined strategy. Examples of such schemes include the mixed method of \\cite{DLW,DLW2}, the third-order split method of \\cite{ESM} and the third-order hybrid method of \\cite{VL}. The use of such methods avoids the expensive convergence check for fixed-step schemes and significantly speeds up the simulation. Moreover, the hybrid methods considered here do not guarantee absolute error reduction for each of the individual subproblems. Instead, they only provide a number of time steps such that the difference between fluid and solid updates does not matter in the overall error, i.e., the difference of fluid and solid updates is noticed well in advance of the desired number of time steps. The advantage of the hybrid methods compared to the fixed-step methods is also application specific. The methods considered here provide faster solution of the simulation problem in a range"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": " $1.7\\%$ in $2.4$ GHz and  $3.9\\%$ in $3.6$ to $3.8$ GHz. This could be because, in the last few scans the receiver side estimation error starts to be sufficiently large to detect the transition between frequencies, so that even if the transmitter uses perfect data, it estions to the highest estimation error level on the toob between $2.4$ GHz and $3.6$ to $3.8$ GHz. In $2.4$ GHz, the error on the same frequency is \\(1.4\\%\\), and it is seen that the system health is about \\(7.9\\%\\) in $2.4$ GHz and $14.6\\%$ in $3.6$ to $3.8$ GHz. In both frequencies, the system health is high, just before the transition level on the toob (see Figure \\ref{fig:fig2} as a comparison). This could be because of the peak power restriction (see \\cite{reg3}) so the"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": " Accepted}, the authors propose a heuristic explanation for this appearance, stating that the field in the proposed position might be too close to an outer-plane source point. However, this explanation does not change the fact that the solution is non-unique. We will now show that there indeed are different choices of the source point such that the CME is in mechanical equilibrium, and some of them even have similar properties as the original version shown in figures.~\\ref{fig:theory_grid:inter_mag_stat,theory_grid:inter_fg_stat}. We choose a source point within the CME, such that it is shifted by $r = 1$~AU compared to the above example, which is shown in figure.~\\ref{fig:theory_grid:inter_between_points}. Furthermore, we imagine the left-hand side of the CME to be longer with a maximum radius of $r_{\\rm L} = 13.2$~AU instead of $r_{\\rm L} = 10$~AU in order to reduce the angle between the CME and the CME/Corona boundary"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacher,PrincipalComponentTheory,Benaglia:2013p407}, \\cite{DonatiNogari:2011p401}),  there is a different physical motivation  for our proposal.  We consider the space $\\mathbb{S}^{d-1}$ as the angle space: the vector $\\alpha \\in \\mathbb{S}^{d-1}$ denotes the angle of a vector $z \\in \\mathbb{R}^d$ from the normal direction. Clearly, this implies a restriction to angles smaller than $\\pi$. If the restriction to positive angles is considered, i.e., $z \\in \\mathbb{R}^d$ is allowed if it holds $z^\\top \\mathbf{e} > 0$ where $\\mathbf{e} \\in \\mathbb{R}^d$ is the standard basis vector, then the angle $\\alpha \\in \\mathbb{S}^{d-1}$ of $z$ is defined by $z = \\mathbf{e} \\alpha$.  This approach is suitable for vectors $z"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the gradient. For Landweber iteration, we rely on a formulae (\\%13.1) for the gradient of the energy, which is accurate for small perturbations. This yields a heuristic for Landweber iteration, which is effective only for models of small standard deviation $\\lambda$. However, in our search for the optimal $\\lambda$, we occasionally obtain a spurious local minimum for Landweber iteration with a short optimal iteration number of 2. This occurs because the effective cost function for Landweber iteration does not include the effective gradient, but only the effective iteration number. This yields an effective local cost per atom which increases with early iteration, and can occasionally yield a spurious minimum. This occurs because the true cost per atom increases with iteration, and the optimal iteration becomes larger as it evolves. This phenomenon does not occur for the optimal $\\lambda$ estimated by the Powell algorithm since this algorithm yields a cost per atom which increases with iteration. Moreover, since the Powell algorithm yields a optimal iteration that increases with evolution, this does not yield a spurious local minimum. To summarize this point, the first minimum"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " of a given sequence, but one does not have a way to prove convergence of any other sequence. The established conditions are also local in sense that they depend only on the finite dimensional subsets of the linear subspace $\\mathbb{K} \\mathbb{E} \\charge{width=0.5pt}{\\textbf{s}}$ generated by the specific basis set in which the nonlinear function is written. We will argue that what is required for verification is to check the local conditions in a control space where the reference trajectory exists. In the linear case, the local convergence theory is proved in all dimensions and for all convergent sequences. However, in the nonlinear case, it is proved in a low dimensional case where the convergence speed is bounded by a fixed low degree of linear independence of the applied force, i.e. the convergence speed is identical to the linear case. Thus, the local convergence theory established in the nonlinear case is as if we ``switch and multiply'' the convergence speed in the linear case from $c_u=1$ to $c_u=\\charge{width=0.5pt}{\\textbf{"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a small perturbations of the domain $\\Omega$. We consider a wave $u_t = \\Delta u + g$ in $\\Omega$ at time $t=0$, where $g = \\sqrt{v}$ is a complex wave function. We model the introduction of chaos into the domain $\\Omega$ by considering the case where $g = \\sqrt{v} + \\i \\eta v$, with $\\eta>0$ small, and take $g$ to zero at $t>0$ so that it does not recur. By numerical simulation, we find that waves with small $\\eta$ cause little chaos; larger $\\eta$ disrupt the domain $\\Omega$ by creating small holes in it, which quickly drift and re-drift to merge; waves with $\\eta$ a little larger cause more chaos, but for a short time, so that the resulting chaos structure is not fully independent and spread out, more like a collection of large holes rather than smaller ones; waves with $\\eta$ even larger cause large holes which stay independent and widely spread, but for a short time so that they do"
        }
      ]
    },
    {
      "train_loss": 0.03218603515625,
      "update_counter": 8000,
      "epoch": 258.03225806451616,
      "val_loss": 6.64453125,
      "val_entropy": 0.6033935546875,
      "val_ttr": 0.6038411458333334,
      "val_gen_seqs": [
        {
          "context": "\t    (1, 11) (2, 11) (3, 11) (4, 11) (5",
          "generated_continuation": ", 11) (6, 11) (7, 11) (8, 11) (9, 11) (10, 11) (11, 11) (12, 11) (13, 11) (14, 11) (15, 11) (16, 11) (17, 11) (18, 11) (19, 11) (20, 11) (21, 11) (22, 11) (23, 11) (24, 11) (25, 11) (26, 11) (27, 11) (28, 11) (29, 11) (30, 11) (31, 11) (32, 11) (33, 11"
        },
        {
          "context": "In Figures~\\ref{comparison_whole_timeline_configuration_1} and \\ref{comparison_whole_timeline_configuration_2} we show",
          "generated_continuation": " the evolution of the reconstructed mass for the modeled signal event (blue) for the two configuration case and the corresponding performances corresponding to the trained models (green and red). In the case of configuration case 2 (Figure~\\ref{comparison_whole_timeline_configuration_2}), the fit works better for both peak widths values. However, the reconstructed mass is optimized for wide width case when the peak signal intensity is low (i.e. \\textit{x} value around 2.38mm and \\textit{y} value around 1.28mm) and for narrow width case for higher \\textit{x} and \\textit{y} values (i.e. for \\textit{x} around 2.62mm and \\textit{y} around 1.79mm). In the case of configuration case 1 (Figure~\\ref{comparison_whole_timeline_configuration_1}), the reconstructed mass for the modeled signal event lies close to the modeled peak for all the performed measurements, thus confirming the high accuracy of the proposed methodology. However, for"
        },
        {
          "context": "In Table~\\ref{fluid_residuals_uniform_equal} we show results of the a posteriori error estimator on a sequence of uniform time meshes",
          "generated_continuation": " for \\emph{geometric} convergence estimate $N_t\\geq 2\\max\\{1,\\sqrt[3]{2\\lambda_1/|\\mathcal{L}|}}\\ $ in~\\eqref{eq:gm_conv_est}. Note that $N_t\\geq2$ is the minimum number of time grids such that the solution is non-divergent on each of the grids. We observe that the estimator works best when the chosen time grid is large more than one unit. Indeed, assuming that $u$ is continuous on the interval $[0,1]$, then the accumulated norm $\\sum_{t=0}^N\\|u(t)\\|$ tends to $4$\\footnote{On the other hand, if we assume that $u$ is discrete on the grid $\\{\\frac{t}{N}: 0\\leq t\\leq N\\}$, then $\\sum_{t=0}^N\\|u(t)\\|\\leq 4$ with a optimal $N$ equal to $4\\lambda_1/|\\mathcal{L}|$. See the discussion after Theorem~\\ref{"
        },
        {
          "context": "Featuring distinct time scales in each of the problems, the use of multirate time-stepping schemes with adapted step sizes for fluid and solid is obvious",
          "generated_continuation": ". For the fluid problem, the chosen time-step size is given by \\eqref{eq:adaptedTSfluid}, while the time-step size for the solid problem is given by the generalized version of \\eqref{eq:adaptedTSsol} that accounts for a general time-stepping scheme of the form $\\frac{d}{dt} P(X) = J(X, \\theta(t)) P(X) + G(X, \\theta(t)$, $\\theta'(t)$, $P(X))$ where $P(X)$ is a positive definite correlation matrix of the solid particle positions at time $t$. The time-step size is then the solution of \\eqref{eq:adaptedTSsol} using a formulation as in \\cite{DReAM}. By numerical experiments, the number of fluid blocks can be set to $K_b = 1$ for the kinetic energy expression in \\eqref{eq:adaptedTSfluid}, and $K_b = 10$ for the remaining fluid terms, while grid size and multiple scaling effects have shown that grid"
        },
        {
          "context": "Scanning frames not only reduce the localization errors on the same floor in $2.4$ GHz but, even floor errors start at as high as ",
          "generated_continuation": " $1.7\\%$ in $2.4$ GHz and  $3.9\\%$ in $5.2$ GHz when the \\mbox{CSU} is located in the same floor as the receiver. In addition, when the gap between the two floors is larger than $20\\%$, the floor error in $2.4$ GHz is $0.7\\%$ and in $5.2$ GHz is $3.0\\%$. It should be noted that, in the case of $2.4$ GHz, the floor error for the proposed sampling and filtering method is even less than the gap between the two floors, which emphasizes the effect of floors in $2.4$ GHz compared to $5.2$ GHz. As shown in the table \\ref{tab:pic}, the \\mbox{CSU} location effect in $2.4$ GHz depends on the gap between the two floors. When the two floors are equal or the gap is larger than $40\\%$, the floor error for the proposed sampling and filtering method is smaller than $1\\%$ in both $2"
        },
        {
          "context": "Nevertheless, the more accurate consideration demonstrates that the original equilibrium version of the CME does not exist. In \\cite{Valgushev:2015",
          "generated_continuation": "ppa} it is shown that the fly-by CME is the result of a high-speed Alfv\\'enic wind interaction between two adjacent eruptions, which appears to be initialized by an interplanetary wave (IMW). In addition, the initialization process occurs much closer to the Sun than the second CME, which is shown by distinct channel features in {\\emph{STEREO}}-A and {\\emph{PSP}} observations \\citep{2019SoPh..294...92V}. Therefore, the {\\emph{in situ}} speed of the second CME should be close to the local Alfv\\'en speed but initially smaller than $\\sim$300~km~s$^{-1}$. We can therefore conclude that the second CME is not an equilibrium structure and the speed of which matches the orbital velocity of the S/C only under the assumption of a constant mass flow from the Sun into the CME. Based on the analysis of hydrodynamic simulations \\citep{2019MNRAS.489.1789H}, \\cite{2019SoPh"
        },
        {
          "context": "Even though the tangential cone condition \\eqref{tangentialconecondition} holds for a number of different applications (see e.g.~\\cite{Kalten",
          "generated_continuation": "bacher,PrincipalComponentTheory,BenagliaPietrini}), it is not the most natural condition when starting with a biological model. The motion of an animal is determined by the interaction between its sensory system and the action of its muscles. The interaction induces a generalization of the deterministic drift of the sensory system across the environment. Such a drift does not allow for the intuitive interpretation of a `wave' moving towards the corner of its starting point due to the intervention of the muscles (see e.g.~\\cite{Duffy06a}). From a mathematical point of view, the tangential cone condition makes immediate by defining a preference vector \\eqref{initialhobe} for the initial conditions. Using Bayen's function \\eqref{InitialHobe}, the preference can be written as $\\nu \\propto \\exp(-z^T \\sqrt{D} \\phi)(0,\\dots,0)$, where $z$ is a column vector of parameters. This yields a unique preference for every set of parameters. As it is linear, one can easily optimize the preference by varying the covariance matrix $D"
        },
        {
          "context": "    \\item \\emph{The spurious first local minimum for Landweber iteration.} Recall that the effective performance of heuristic rules depend also on efficent estimates",
          "generated_continuation": " of the gradients. The chaotic nature of the loss function means that efficiencies are only practical up to some optimality threshold. For the Landweber iteration, we use $1000$ iterations and a step size of $s=0.01$. These parameters are general settings which produce an optimality threshold of an average gap of $10$ between the first solution and all subsequent solutions. However, as the number of parameters $d$ grows, the size of the local minimum increases and the upper bound for the optimal solution weight becomes a heuristic. We analyze the resulting trajectory, see Fig.~\\ref{fig:landweber_first}, and compare it to the optimal trajectories for $d=30$ and $d=128\"\\), which have almost the same nominal loss (see Fig.~\\ref{fig:landweber_true_30} and Fig.~\\ref{fig:landweber_true_128}). Obviously the first solution found by the landweber iteration is not the optimal solution for $d=128\".} However, as can"
        },
        {
          "context": "    \\item \\emph{Only local convergence in nonlinear case.} The established convergence theory in the nonlinear case is a local one: one can only prove convergence",
          "generated_continuation": " of a given sequence, however directionally, whether it is convergent in the entire space. In most cases, the convergence of the original space is not important, and only the convergence of a restricted subspace is of interest. This is because the scaled nonlinear functions $\\widetilde{f}_i(\\mathbf{x})$ are not compatible with each other, in the sense that the composition of two different scaled nonlinear functions $\\widetilde{f}_i(\\mathbf{x})$ is not equal to $\\widetilde{f}_j(\\mathbf{x})$ for all $i,j$, thus a mixed solution cannot be constructed from a combination of different scaled nonlinear functions. From the convergence theory in Section~\\ref{sec:NL}, one can see that the scaled nonlinear functions $\\widetilde{f}_i(\\mathbf{x})$ tend to converge locally in the subspace $\\mathcal{R}_\\infty+\\mathcal{H}_0^*\\times\\mathcal{H}_1^*$, but already this local convergence in the subspace is enough to determine the direction of convergence of the original space $\\mathcal{C}_\\infty$. In the following, \\autoref"
        },
        {
          "context": "where $\\Omega \\subset \\R^N$, $N=2,3$ is a bounded and smooth domain, and $g_j$ models",
          "generated_continuation": " a non-smooth component. In this context, a level set $S:\\Omega\\rightarrow \\R$, $S=\\det\\left|\\mathbf{X}\\right.|$, where $\\mathbf{X}:\\Omega\\rightarrow \\R^{N-1}$ is the position vector, is used to denote the smooth component of the shape function $\\mathbf{X}$. When $N=2$, $S=x_2$ holds and we describe a slope free shape function. For $N=3$, we set $S=x_2x_3$ and $S=x_2x_3x_4$ for smooth regions with no slope along the corresponding coordinate direction. These shape functions are defined enough to include different types of obstacles and different material properties in the smooth component. In particular, we allow for thin and thick walls and different conductivities for the flow and the walls. The smooth component also interacts with the non-smooth component through a jump in the shape function. This is the component of the shape function where the change in the function induces a change in the direction of movement without"
        }
      ]
    }
  ]
}