Language,GPT-2,r50k_base,p50k_base,p50k_edit,cl100k_base,RoBERTa,GottBERT,CamemBERT,PhoBERT,RoCBert,XLM-RoBERTa,M2M100,MBart50,mT5,FlanT5,ByT5,CANINE,BLOOM,ArabicBERT,MuRIL,UTF-32,BERT Japanese
Acehnese (Arabic script),251497,251497,251497,251497,199956,251497,390347,–––,–––,–––,115461,119667,115462,117938,–––,391515,219474,140671,–––,–––,219474,–––
Acehnese (Latin script),113531,113531,113531,113531,104849,113531,122823,124070,114495,236887,93369,92887,93370,94854,147715,283335,278522,92532,138840,109149,278522,146081
Mesopotamian Arabic,224237,224237,224234,224234,158035,224237,402524,–––,–––,–––,69291,80345,69292,84300,–––,405062,223981,60960,53301,103943,223981,–––
Ta’izzi-Adeni Arabic,227986,227986,227986,227986,158955,227986,407200,–––,–––,–––,70079,81240,70080,86472,–––,409628,226400,60967,53198,104701,226400,–––
Tunisian Arabic,220845,220845,220841,220841,154582,220845,396918,–––,–––,–––,71776,81874,71777,84938,–––,399376,221017,63181,55062,102587,221017,–––
Afrikaans,101771,101771,101771,101771,89171,101771,98519,110438,105102,229264,71571,77060,71572,78558,124449,277732,274530,89808,127906,99171,274530,131262
South Levantine Arabic,211501,211501,211501,211501,149832,211501,382023,–––,–––,–––,67110,77076,67111,81281,–––,385654,214652,59291,53150,98143,214652,–––
Akan,147060,147060,147060,147060,141807,147060,149734,131229,120943,–––,117967,115776,117968,119492,171393,285241,259737,109025,–––,–––,259737,150069
Tosk Albanian,139180,139180,139180,139180,118970,139180,139968,145681,140863,241693,78642,86041,78643,97346,178604,312452,289969,115353,140644,135796,289969,–––
Amharic,409343,409343,409343,409343,405644,409343,409257,–––,–––,–––,79683,89970,79684,113583,–––,445855,174268,269501,–––,–––,174268,–––
North Levantine Arabic,212418,212418,212418,212418,149520,212418,381535,–––,–––,–––,68660,78633,68661,80726,–––,384850,213236,59993,52690,98757,213236,–––
Standard Arabic,231316,231316,231316,231316,160485,231316,411432,–––,–––,–––,70160,81754,70161,88560,–––,413881,228923,60841,52834,106186,228923,–––
Standard Arabic (Romanized),132163,132163,132126,132126,129672,132163,139173,138183,129567,258329,115661,115815,115662,113902,170119,302462,302341,114133,154071,122996,302341,169235
Najdi Arabic,231630,231630,231630,231630,160703,231630,412164,–––,–––,–––,70566,82195,70567,88765,–––,414423,229162,61136,53069,106404,229162,–––
Moroccan Arabic,221115,221115,221115,221115,156350,221115,400935,–––,–––,–––,74275,84409,74276,84624,–––,403707,223535,67216,60326,102895,223535,–––
Egyptian Arabic,222553,222553,222552,222552,156292,222553,402538,–––,–––,–––,69984,80468,69985,83875,–––,404608,223944,61836,55464,101822,223944,–––
Assamese,514374,514374,514368,514368,327657,514374,656585,–––,–––,–––,113113,141670,113114,127306,–––,658998,248983,74842,–––,66888,248983,–––
Asturian,99420,99420,99419,99419,83704,99420,105192,105422,103303,224060,75945,73168,75946,84180,119928,276557,267356,69668,120088,97579,267356,130355
Awadhi,378103,378103,378082,378082,252559,378103,646209,–––,–––,–––,81930,93158,81931,106457,–––,648620,252973,75880,–––,69624,252973,–––
Central Aymara,122124,122124,122124,122124,114730,122124,127777,130158,122409,236592,101269,104125,101270,103045,157095,278302,271932,103162,139013,106931,271932,149459
South Azerbaijani,271447,271447,271437,271437,176561,271447,419700,–––,–––,–––,85106,95216,85107,93095,–––,421538,230356,96441,107137,93021,230356,–––
North Azerbaijani,182348,182348,182348,182348,139670,182348,182216,–––,159049,–––,68418,79961,68419,88807,–––,327051,282580,122337,167502,–––,282580,–––
Bashkir,316164,316164,316164,316164,226124,316164,313601,–––,–––,–––,122871,77959,122872,105089,–––,480838,262555,189762,–––,–––,262555,–––
Bambara,140011,140011,139965,139965,135569,140011,145217,123509,116633,–––,108316,108853,108317,108746,156086,270935,249080,100402,–––,–––,249080,138189
Balinese,103682,103682,103666,103666,95203,103682,109604,114664,107056,247527,78612,81765,78613,84863,137278,288815,287688,77770,134647,98690,287688,139353
Belarusian,344748,344748,344748,344748,187319,344748,328950,–––,240969,–––,87245,98633,87246,104504,–––,535528,293469,172266,250560,–––,293469,–––
Bemba,129364,129364,129364,129364,117904,129364,133283,134862,128043,273099,104974,105613,104975,102918,173992,318923,318153,102359,159038,117182,318153,169134
Bengali,507509,507509,507507,507507,308325,507509,674272,–––,–––,–––,82135,98507,82136,103680,–––,676264,254327,62014,–––,54583,254327,–––
Bhojpuri,377309,377309,377267,377267,247793,377309,637964,–––,–––,–––,87860,97689,87861,107289,–––,641057,252371,81327,–––,75210,252371,–––
Banjar (Arabic script),264266,264266,264266,264266,200951,264266,436641,–––,–––,–––,114731,122324,114732,115648,–––,437480,240281,131401,100285,–––,240281,–––
Banjar (Latin script),103982,103982,103982,103982,90276,103982,109168,108556,101073,233919,72076,73211,72077,76538,127530,272644,272244,69307,127025,92101,272244,132944
Standard Tibetan,784601,784601,784570,784570,595659,784601,857781,–––,–––,–––,–––,–––,–––,242119,–––,859707,291590,354304,–––,–––,291590,–––
Bosnian,115005,115005,115004,115004,98989,115005,116335,117347,112807,220911,66919,73907,66920,87222,143283,267950,260762,97705,133927,–––,260762,134892
Buginese,115798,115798,115791,115791,104607,115798,117358,116092,113066,237814,89886,94280,89887,94630,145074,283841,275699,90959,139630,105783,275699,143530
Bulgarian,289393,289393,289393,289393,139501,289393,276630,–––,215050,–––,68968,77990,68969,84216,–––,489890,270630,132248,227106,–––,270630,–––
Catalan,100917,100917,100917,100917,90427,100917,110332,106665,109174,238062,75451,79876,75452,89612,124034,291806,285853,62983,124819,102473,285853,134705
Cebuano,117716,117716,117716,117716,101774,117716,124007,127770,117575,259722,90833,87220,90834,93284,165260,311013,310925,94620,145902,113208,310925,158499
Czech,137859,137859,137859,137859,111264,137859,136455,–––,123396,213411,69647,77680,69648,83670,157688,280405,251082,107754,126693,–––,251082,–––
Chokwe,113735,113735,113735,113735,104503,113735,119429,119231,110095,236893,92286,93123,92287,92683,154067,278117,277102,91317,141673,104766,277102,147047
Central Kurdish,341390,341390,341390,341390,253685,341390,459720,–––,–––,–––,136941,157264,136942,114761,–––,461199,251845,170668,158989,–––,251845,–––
Crimean Tatar,130893,130893,130659,130659,111855,130893,131923,134640,128877,–––,82206,86741,82207,86988,162045,291908,265769,110003,139670,–––,265769,–––
Welsh,123262,123262,123262,123262,111804,123262,131194,134885,127694,229811,85327,91291,85328,111445,180487,277959,276951,111312,149568,125348,276951,151952
Danish,100083,100083,99973,99973,85831,100083,99284,111753,107454,223959,65265,71154,65266,75027,130567,272408,266753,88852,123375,98506,266753,–––
German,112600,112600,112600,112600,83450,112600,58508,123942,116552,259689,69727,78466,69728,78142,79288,307304,302377,89320,139166,108976,302377,141390
Southwestern Dinka,130281,130281,130254,130254,118865,130281,125995,114303,109959,162625,100413,98186,100414,104155,–––,250279,221929,96684,–––,–––,221929,–––
Dyula,115705,115705,115705,115705,108433,115705,121866,114862,108895,212549,98157,96750,98158,101708,154918,278145,261275,95832,125478,111235,261275,144030
Dzongkha,860070,860070,860066,860066,651364,860070,942994,–––,–––,–––,–––,–––,–––,278950,–––,943132,323571,391306,–––,–––,323571,–––
Greek,343953,343953,343953,343953,271846,343953,393935,–––,259761,249809,86669,100218,86670,108200,–––,562986,310159,202450,260481,–––,310159,–––
English,52567,52567,52567,52567,52835,52567,78923,80199,83535,216302,59656,63374,59657,65729,57876,259396,259170,53174,96467,53946,259170,103383
Esperanto,106503,106503,106503,106503,98813,106503,108038,108276,105543,217770,71512,87589,71513,78270,126939,263370,258512,87615,119270,–––,258512,–––
Estonian,111089,111089,111089,111089,98826,111089,109399,113611,110867,221758,66864,76037,66865,73511,140518,262393,254538,94067,123324,91983,254538,–––
Basque,110251,110251,110250,110250,99592,110251,109524,115691,111466,240728,69305,77988,69306,80064,134955,276279,275448,60830,136206,102270,275448,139769
Ewe,152353,152353,152353,152353,145385,152353,155357,135264,121730,–––,120047,118048,120048,119854,165046,276783,252072,112330,–––,–––,252072,–––
Faroese,125367,125367,125367,125367,109406,125367,130964,131658,121814,–––,85972,89494,85973,91884,157775,284012,263787,103830,136254,–––,263787,–––
Fijian,121044,121044,121044,121044,113623,121044,132003,122256,116500,244685,102559,102897,102560,104403,174918,303678,303403,105667,159150,108251,303403,158535
Finnish,119621,119621,119621,119621,105276,119621,115400,125385,123141,244727,67990,77809,67991,76281,151302,287021,276102,100568,137245,110839,276102,149717
Fon,214346,214346,214346,214346,193724,214346,217332,–––,–––,–––,149608,146704,149609,155092,–––,327076,265357,117430,–––,–––,265357,–––
French,104971,104971,104971,104971,84407,104971,116382,67031,115302,258575,77352,84236,77353,91954,92423,320613,308489,63881,128023,105894,308489,140258
Friulian,108675,108675,108675,108675,97607,108675,115943,111015,110929,232073,92873,93343,92874,99837,133187,293730,284088,90630,123275,104739,284088,132954
Nigerian Fulfulde,104628,104628,104628,104628,97955,104628,108352,103243,96944,186121,87218,80379,87219,87007,123788,247982,242184,88035,111927,83039,242184,124587
West Central Oromo,132819,132819,132819,132819,122460,132819,135871,139003,134682,268913,106294,94455,106295,110976,182606,311988,307188,116457,157621,117080,307188,168782
Scottish Gaelic,141855,141855,141855,141855,127836,141855,146408,144558,134854,267356,104559,102196,104560,121647,187248,332878,320213,119665,151269,122313,320213,153752
Irish,134446,134446,134446,134446,123062,134446,138891,140529,129584,249323,89323,95296,89324,109777,181920,318859,299807,114550,139543,132936,299807,156000
Galician,100549,100549,100549,100549,82499,100549,109486,109337,108753,240324,67114,72186,67115,86400,126094,293863,286720,67516,125278,102970,286720,136270
Guarani,129285,129285,129268,129268,114837,129285,132854,124168,121173,226759,102405,103340,102406,106773,148620,281888,261956,99535,135241,107150,261956,–––
Gujarati,644828,644828,644817,644817,406224,644828,645186,–––,–––,–––,84540,99875,84541,113530,–––,647801,248721,71638,–––,64036,248721,–––
Haitian Creole,100092,100092,100092,100092,91870,100092,106609,105879,96290,193530,83038,73297,83039,79946,134250,245512,238876,83111,113484,90566,238876,122543
Hausa,112972,112972,112972,112972,105931,112972,117546,118065,105075,219549,83445,81634,83446,89867,151250,279824,276665,94598,129358,96035,276665,139593
Hebrew,231019,231019,231019,231019,193616,231019,356374,–––,–––,–––,67112,77338,67113,80177,–––,359555,201565,155280,165810,–––,201565,–––
Hindi,392092,392092,392092,392092,253270,392092,658313,–––,–––,–––,74861,86379,74862,104604,–––,661374,258472,68276,–––,62712,258472,–––
Chhattisgarhi,378849,378849,378849,378849,247937,378849,635526,–––,–––,–––,84368,95768,84369,105029,–––,638720,250204,76711,–––,72073,250204,–––
Croatian,113059,113059,113058,113058,97676,113059,114870,114787,110874,216201,65682,72894,65683,85612,140906,262354,255014,95947,131330,–––,255014,131810
Hungarian,139602,139602,139600,139600,113411,139602,141611,142790,131104,235951,70308,81277,70309,82939,172889,299608,272675,109899,135056,124421,272675,–––
Armenian,526451,526451,526451,526451,527113,526451,526464,–––,–––,–––,82580,94934,82581,103808,–––,529415,287555,229282,–––,–––,287555,–––
Igbo,179977,179977,179977,179977,129168,179977,183972,141791,123799,214673,126634,93374,126635,117780,183723,312799,263254,91237,144576,–––,263254,–––
Ilocano,118613,118613,118613,118613,108129,118613,125270,129022,117649,262670,96095,84539,96096,105633,163179,314085,313360,101123,149848,108277,313360,160015
Indonesian,104006,104006,103989,103989,82006,104006,108315,112174,104373,241697,56004,61915,56005,70942,129783,281022,280788,51241,130092,93990,280788,137026
Icelandic,127643,127643,127643,127643,113423,127643,135970,–––,125216,–––,73561,81848,73562,86678,162574,282751,255925,105890,129624,–––,255925,–––
Italian,105594,105594,105594,105594,86628,105594,112844,109149,111317,257935,70965,79123,70966,88132,125884,307457,305238,86218,136373,103826,305238,141489
Javanese,101610,101610,101610,101610,91337,101610,107079,111582,100716,230239,68844,69843,68845,79381,128021,269810,269310,74537,131089,94006,269310,132911
Japanese,157858,157858,157858,157858,121627,157858,254548,–––,–––,111466,66414,76356,66415,59289,–––,329167,113611,96085,97622,–––,113611,69209
Kabyle,131364,131364,131364,131364,130359,131364,137564,127175,119182,195433,109639,108225,109640,119434,163686,274466,256723,107463,124098,–––,256723,–––
Jingpho,139457,139457,139456,139456,124278,139457,149129,142637,128420,259923,115947,112627,115948,117829,197335,330717,330686,113746,164614,125372,330686,171095
Kamba,121739,121739,121739,121739,114796,121739,127481,118543,108374,211466,96785,96361,96786,100228,155742,263257,253015,93920,128103,–––,253015,–––
Kannada,719397,719397,719274,719274,470190,719397,731252,–––,–––,–––,81107,96883,81108,94741,–––,735146,272113,69560,–––,57383,272113,–––
Kashmiri (Arabic script),325291,325291,325145,325145,244307,325291,444451,–––,–––,–––,115090,122425,115091,131446,–––,447300,248311,123488,121604,94413,248311,–––
Kashmiri (Devanagari script),369802,369802,369778,369778,247927,369802,612620,–––,–––,–––,108675,117719,108676,117866,–––,621887,249147,98492,–––,94654,249147,–––
Georgian,727866,727866,727866,727866,520316,727866,727448,–––,–––,–––,80173,99022,80174,101748,–––,764251,283885,264919,–––,–––,283885,–––
Kazakh,311213,311213,311212,311212,200493,311213,308595,–––,222082,–––,68428,81093,68429,79112,–––,489005,266102,171823,–––,–––,266102,–––
Kabiyè,256080,256080,256066,256066,250629,256080,258642,–––,–––,–––,177790,171937,177791,185803,–––,356120,282100,177842,–––,–––,282100,–––
Kabuverdianu,101684,101684,101684,101684,91103,101684,104228,104206,100905,212659,80636,82272,80637,84027,127940,263624,257338,80174,120718,97797,257338,133103
Halh Mongolian,337351,337351,337351,337351,199035,337351,334336,–––,227151,–––,72046,84635,72047,97042,–––,495824,270527,179887,–––,–––,270527,–––
Khmer,805983,805983,805983,805983,469026,805983,806501,–––,–––,–––,96638,118485,96639,94217,–––,864382,305984,340448,–––,–––,305984,–––
Kikuyu,181066,181066,180960,180960,173742,181066,186000,–––,138417,255442,138024,137581,138025,143352,–––,336536,302637,131787,150928,–––,302637,–––
Kinyarwanda,124507,124507,124507,124507,113070,124507,127053,127430,122870,248670,102456,103304,102457,99277,159781,294412,288228,84146,148708,115812,288228,155270
Kyrgyz,301606,301606,301606,301606,185559,301606,299457,–––,222699,–––,69278,105463,69279,86514,–––,487931,265506,160837,–––,–––,265506,–––
Kimbundu,122622,122622,122622,122622,112524,122622,129691,126512,119666,241994,98046,97621,98047,97043,168245,288236,286800,96494,149610,107216,286800,156856
Northern Kurdish,128950,128950,128950,128950,116457,128950,130825,132522,117175,213428,82489,105243,82490,93428,158576,285648,257926,108099,124890,–––,257926,–––
Central Kanuri (Arabic script),249173,249173,249173,249173,191917,249173,410142,–––,–––,–––,155367,157984,155368,160018,–––,414556,228438,111740,–––,127913,228438,–––
Central Kanuri (Latin script),135330,135330,135330,135330,125283,135330,140581,128497,120540,–––,103625,104673,103626,104001,163460,287925,273117,106465,–––,–––,273117,–––
Kikongo,114227,114227,114227,114227,105361,114227,127161,115771,114676,241355,94290,93629,94291,95889,174143,295155,294834,92840,153421,106477,294834,158706
Korean,266663,266663,266660,266660,125737,266663,304725,–––,–––,214170,69124,76644,69125,83718,–––,310727,130986,148546,125267,–––,130986,–––
Lao,693551,693551,693550,693550,508348,693551,694035,–––,–––,–––,82850,101726,82851,83315,–––,707645,257362,462849,–––,–––,257362,–––
Ligurian,120391,120391,120391,120391,104558,120391,123687,119908,119632,235685,98515,100538,98516,111123,147062,302581,286080,96349,133177,110857,286080,–––
Limburgish,107733,107733,107733,107733,95157,107733,105731,111137,110596,225590,86570,87324,86571,90468,130066,278124,269296,93152,127464,103761,269296,131832
Lingala,106455,106455,106455,106455,98517,106455,116034,109537,105400,232882,90766,79977,90767,90700,158212,279939,279583,87885,141933,102285,279583,145278
Lithuanian,128701,128701,128701,128701,116635,128701,128824,122546,118645,223996,70001,78909,70002,80627,149057,275925,258253,103350,128643,–––,258253,–––
Lombard,124681,124681,124681,124681,107743,124681,124609,121716,118002,224668,101898,99104,101899,111495,149212,300684,278244,97705,124335,105633,278244,–––
Latgalian,125800,125800,125799,125799,116433,125800,131764,130139,123432,220726,93413,95927,93414,96113,156294,271741,256016,105855,131014,–––,256016,–––
Luxembourgish,118140,118140,118140,118140,104985,118140,102428,121814,119575,247827,97961,83888,97962,96291,129506,298710,290166,100571,135153,116866,290166,135901
Luba-Kasai,111796,111796,111796,111796,102583,111796,118429,115197,109428,234942,92166,90561,92167,90089,143377,280012,279832,89426,139093,102067,279832,145909
Ganda,114169,114169,114169,114169,103642,114169,117146,118218,113889,230752,92353,87471,92354,92028,153108,268270,265137,89042,140788,104487,265137,145269
Luo,107393,107393,107393,107393,96393,107393,110396,111461,105676,223587,90538,90396,90539,92902,147703,271845,271168,89122,130663,100968,271168,139626
Mizo,109968,109968,109936,109936,103801,109968,120526,121894,107582,229391,98499,97666,98500,103490,159888,285572,284965,97367,137722,103538,284965,141753
Standard Latvian,133664,133664,133664,133664,124411,133664,138821,134492,130020,228113,73546,81921,73547,85736,160833,289046,264388,110674,130600,–––,264388,–––
Magahi,379598,379598,379597,379597,248523,379598,636697,–––,–––,–––,84403,95355,84404,106136,–––,639032,249109,76977,–––,72201,249109,–––
Maithili,390772,390772,390772,390772,258768,390772,652899,–––,–––,–––,94310,104117,94311,114345,–––,655569,254221,82790,–––,80828,254221,–––
Malayalam,801380,801380,801332,801332,475535,801380,801618,–––,–––,–––,82039,100506,82040,88992,–––,805120,293448,73183,–––,63546,293448,–––
Marathi,413498,413498,413459,413459,267905,413498,691253,–––,–––,–––,72668,87538,72669,100064,–––,693619,261180,64398,–––,57010,261180,–––
Minangkabau (Arabic script),275937,275937,275937,275937,209767,275937,450840,–––,–––,–––,120767,125921,120768,120685,–––,451565,247588,136964,108557,–––,247588,–––
Minangkabau (Latin script),103515,103515,103515,103515,93314,103515,110793,111845,104242,236464,78163,79094,78164,82037,135960,276661,276447,76545,130784,95339,276447,136909
Macedonian,287256,287256,287255,287255,146208,287256,274701,–––,215520,–––,69908,78646,69909,84820,–––,490881,270378,132974,–––,–––,270378,–––
Maltese,141664,141664,141664,141664,127435,141664,141958,137935,131061,223294,116915,118299,116916,111373,169882,301178,286521,119539,138499,–––,286521,–––
Meitei (Bengali script),537035,537035,537031,537031,354727,537035,714848,–––,–––,–––,152589,163965,152590,144945,–––,717455,267963,125103,–––,126265,267963,–––
Mossi,133299,133299,133299,133299,122348,133299,137491,121347,115535,183555,106415,105404,106416,118443,167997,267823,247517,105627,115195,–––,247517,–––
Maori,128730,128730,128730,128730,124332,128730,139528,135219,122779,227957,111143,110110,111144,111337,190121,301068,287051,112586,143900,114516,287051,149583
Burmese,887930,887930,887927,887927,618268,887930,888482,–––,–––,–––,102718,139944,102719,102566,–––,911313,320882,534510,–––,–––,320882,–––
Dutch,103566,103566,103566,103566,84254,103566,101070,112431,110240,243532,68075,74602,68076,76684,126814,289068,288665,90731,133150,103224,288665,137350
Norwegian Nynorsk,101503,101503,101503,101503,86536,101503,99045,112500,107965,220190,70057,74146,70058,77387,132296,269067,263026,87533,123432,98322,263026,126356
Norwegian Bokmål,97720,97720,97720,97720,82508,97720,96684,109934,105860,218898,63780,69761,63781,73405,129726,266748,261039,85981,121070,96432,261039,122233
Nepali,398853,398853,398853,398853,252889,398853,660650,–––,–––,–––,67463,81179,67464,96395,–––,664662,249703,62148,–––,54422,249703,–––
Northern Sotho,121975,121975,121975,121975,115442,121975,128420,126908,123249,241631,104521,96361,104522,103446,162466,304765,297511,103338,142768,117845,297511,153466
Nuer,222487,222487,222294,222294,211318,222487,219996,–––,–––,–––,156169,154335,156170,159275,–––,343204,280059,148266,–––,–––,280059,–––
Nyanja,118972,118972,118972,118972,110115,118972,124300,124303,118341,252030,94859,98042,94860,88840,157062,291363,290698,94469,146982,108899,290698,149128
Occitan,108564,108564,108564,108564,96606,108564,116408,112576,115322,247081,89419,83283,89420,97576,130600,304250,294944,79296,127922,103994,294944,137862
Odia,703585,703585,703579,703579,659440,703585,705821,–––,–––,–––,86376,99019,86377,204745,–––,709275,265861,72384,–––,65490,265861,–––
Pangasinan,87191,87191,87191,87191,82719,87191,99993,99902,92749,216247,76766,77833,76767,80227,126028,258211,258142,76906,120015,83257,258142,124607
Eastern Panjabi,415296,415296,415296,415296,415623,415296,668463,–––,–––,–––,93737,106290,93738,138377,–––,672235,261839,76290,–––,72700,261839,–––
Papiamento,104236,104236,104236,104236,92425,104236,104744,110164,104066,223400,82016,83786,82017,89114,132095,280322,273279,82092,120711,96986,273279,133963
Southern Pashto,283451,283451,283451,283451,202207,283451,423740,–––,–––,–––,82222,88992,82223,107480,–––,430166,246722,135424,–––,–––,246722,–––
Western Persian,279481,279481,279481,279481,173333,279481,432074,–––,–––,–––,65918,73910,65919,88203,–––,440163,244860,94432,107484,87412,244860,–––
Plateau Malagasy,135619,135619,135619,135619,119189,135619,137421,135901,124463,271859,93654,94172,93655,104757,173758,325965,316908,110108,158636,125488,316908,164355
Polish,141182,141182,141182,141182,101102,141182,141254,137085,132050,216024,70891,80029,70892,86127,163197,292098,274970,113594,146715,–––,274970,–––
Portuguese,101774,101774,101765,101765,78313,101774,109158,108852,108466,235580,66406,72168,66407,84640,128190,289511,281590,59813,125447,101495,281590,128052
Dari,268840,268840,268840,268840,166945,268840,418739,–––,–––,–––,64923,72850,64924,86220,–––,422038,237961,87464,105149,84975,237961,–––
Ayacucho Quechua,115829,115829,115829,115829,109926,115829,127131,123437,116930,245549,95148,97898,95149,93642,149972,278975,276777,97496,141619,105279,276777,146464
Romanian,130433,130433,130433,130433,99201,130433,133544,123322,121856,245167,74034,81531,74035,89729,86617,307925,291717,101592,128736,–––,291717,–––
Rundi,122445,122445,122429,122429,112641,122445,128356,127656,122997,248188,101867,103202,101868,99604,160924,290494,289035,87453,148549,115067,289035,155208
Russian,301727,301727,301726,301726,131496,301727,290010,–––,226451,222122,70031,77512,70032,83361,–––,514650,282208,132062,241387,–––,282208,–––
Sango,117409,117409,117409,117409,109951,117409,121661,120462,110639,219737,98752,97236,98753,107411,181962,290129,281357,95574,139566,110502,281357,154203
Sanskrit,417136,417136,417133,417133,264225,417136,678863,–––,–––,–––,85075,106855,85076,108350,–––,682390,254233,86484,–––,65020,254233,–––
Santali,675930,675930,675929,675929,676349,675930,675937,–––,–––,–––,–––,–––,–––,–––,–––,723211,274886,675805,–––,–––,274886,–––
Sicilian,119419,119419,119419,119419,105965,119419,123844,114797,114164,229071,94093,97030,94094,100524,142627,287680,273185,95956,136096,99295,273185,–––
Shan,986091,986091,986085,986085,795074,986091,987434,–––,–––,–––,263999,293592,264000,215725,–––,1020993,368897,641171,–––,–––,368897,–––
Sinhala,675811,675811,675806,675806,466435,675811,678328,–––,–––,–––,80352,96745,80353,108866,–––,683740,260389,436463,–––,–––,260389,–––
Slovak,132406,132406,132406,132406,112877,132406,130351,128214,121835,221546,70420,78465,70421,85374,158343,283449,260044,106872,130514,–––,260044,–––
Slovenian,111123,111123,111123,111123,99575,111123,114890,115649,110608,218816,67649,75111,67650,78923,140116,264297,258309,96182,132122,–––,258309,134145
Samoan,135286,135286,135286,135286,121040,135286,133154,130690,124947,235821,114297,113938,114298,126188,179105,315363,301267,113270,151254,119932,301267,159873
Shona,120455,120455,120455,120455,112401,120455,124984,126390,120317,256147,97431,99977,97432,88729,161542,290272,290236,95881,149383,111107,290236,153133
Sindhi,262974,262974,262974,262974,211562,262974,412078,–––,–––,–––,76539,82308,76540,114413,–––,414811,234918,133480,–––,65560,234918,–––
Somali,123967,123967,123967,123967,115377,123967,130693,135538,123653,250484,82896,87135,82897,97328,177052,296373,295873,108016,146788,110739,295873,156858
Southern Sotho,123024,123024,123024,123024,116991,123024,129269,131043,123654,255719,106299,101495,106300,104203,168891,312773,312241,104296,154957,116440,312241,159136
Spanish,104549,104549,104548,104548,81735,104549,114094,115325,113958,257284,71459,76775,71460,86318,128905,314077,308284,64466,132786,106948,308284,145418
Sardinian,118821,118821,118821,118821,105258,118821,121127,118457,116701,251257,95752,95788,95753,103494,142414,309950,301401,92156,133157,106888,301401,140989
Serbian,280731,280731,280725,280725,154192,280731,269463,–––,204589,–––,70646,79791,70647,85556,–––,466463,257321,136830,–––,–––,257321,–––
Swati,121681,121681,121677,121677,114096,121681,125659,128417,121453,260765,96139,91104,96140,92890,162234,291685,291628,97063,149923,112753,291628,156659
Sundanese,106071,106071,106071,106071,96166,106071,110038,111251,103720,231104,73021,69744,73022,80459,134315,272644,269967,78474,128625,96990,269967,135056
Swedish,102557,102557,102489,102489,83368,102557,96202,113476,109778,220619,63927,69772,63928,73261,128558,271024,260666,87945,116462,102548,260666,123549
Swahili,112044,112044,112044,112044,103109,112044,117978,114281,110022,230132,69335,76352,69336,82050,154117,272166,271792,66080,139702,100565,271792,147678
Silesian,136571,136571,136571,136571,115365,136571,137368,136327,133069,214742,98456,100997,98457,103492,166242,285706,269821,114749,146209,–––,269821,–––
Tamil,819110,819110,819084,819084,404265,819110,819443,–––,–––,–––,80444,98132,80445,82947,–––,822452,302271,67792,–––,57000,302271,–––
Tamasheq (Latin script),125394,125394,125394,125394,117339,125394,127786,120090,107965,–––,101928,99757,101929,107584,147324,262500,245729,100840,–––,–––,245729,–––
Tamasheq (Tifinagh script),548473,548473,548473,548473,535378,548473,548700,–––,–––,–––,–––,–––,–––,236215,–––,593545,244280,411636,–––,–––,244280,–––
Tatar,305861,305861,305861,305861,198081,305861,303215,–––,–––,–––,107718,97435,107719,92640,–––,479926,262636,167238,–––,–––,262636,–––
Telugu,688151,688151,688126,688126,440831,688151,688897,–––,–––,–––,79091,–––,79092,93592,–––,694562,261716,70848,–––,65209,261716,–––
Tajik,320045,320045,320045,320045,192259,320045,315920,–––,235753,–––,127526,130315,127527,106398,–––,521168,287415,174935,230544,–––,287415,–––
Tagalog,119822,119822,119658,119658,108909,119822,128762,134171,121010,273629,85423,90331,85424,95719,165047,326793,326630,98578,150059,112334,326630,165309
Thai,475633,475633,475632,475632,231995,475633,520076,–––,236002,–––,64512,80663,64513,64898,–––,713841,248859,246116,–––,–––,248859,–––
Tigrinya,414484,414484,414484,414484,412308,414484,414536,–––,–––,–––,117506,121172,117507,133644,–––,454149,178958,274296,–––,–––,178958,–––
Tok Pisin,116032,116032,116032,116032,107799,116032,122346,133519,121212,269513,102949,104877,102950,108762,159801,331464,331192,102258,155553,113268,331192,162326
Tswana,125587,125587,125587,125587,120710,125587,132720,134201,129455,262020,110661,106750,110662,110234,173957,323250,322918,107602,155913,121139,322918,162717
Tsonga,128795,128795,128795,128795,119187,128795,134248,136016,122354,257578,106752,106802,106753,105816,181153,312392,310434,106783,158885,117943,310434,169911
Turkmen,148036,148036,148036,148036,127012,148036,138738,142456,135583,240203,106090,108524,106091,110577,166347,303011,275172,116601,138788,–––,275172,–––
Tumbuka,146210,146210,146210,146210,135756,146210,152556,148078,139306,290363,114679,118850,114680,105756,190205,342449,338139,116690,172660,–––,338139,–––
Turkish,127797,127797,127797,127797,101176,127797,126681,132058,126042,–––,61972,72580,61973,73434,154269,289924,266383,104088,139867,–––,266383,–––
Twi,137471,137471,137471,137471,132821,137471,141707,126182,115474,–––,112411,110223,112412,112189,164908,273291,253416,96323,–––,–––,253416,144273
Central Atlas Tamazight,546058,546058,546058,546058,530718,546058,545844,–––,–––,–––,–––,–––,–––,228864,–––,591767,231581,408891,–––,–––,231581,–––
Uyghur,376366,376366,376366,376366,274341,376366,507965,–––,–––,–––,83957,190015,83958,168749,–––,511363,276109,194931,–––,–––,276109,–––
Ukrainian,302058,302058,302053,302053,158296,302058,291269,–––,215613,–––,71931,81306,71932,87227,–––,483479,265408,146145,226342,–––,265408,–––
Umbundu,117580,117580,117580,117580,106384,117580,121024,118998,113677,227635,93947,94237,93948,96604,157321,271523,262904,92708,140549,104492,262904,137958
Urdu,331280,331280,331278,331278,231974,331280,452920,–––,–––,–––,73620,82182,73621,99921,–––,455529,256560,72171,140013,67978,256560,–––
Northern Uzbek,120831,120831,120831,120831,114577,120831,128551,127816,123290,257429,79559,87115,79560,90504,161946,293473,293146,105295,152589,114395,293146,158219
Venetian,105316,105316,105316,105316,89961,105316,108907,107443,102465,–––,81416,83014,81417,89580,128185,273797,261537,83714,119988,99461,261537,127037
Vietnamese,238544,238544,238544,238544,129258,238544,241285,–––,69628,212672,70314,72894,70315,128452,–––,360436,272596,67335,133327,–––,272596,–––
Waray,125136,125136,125136,125136,103003,125136,127028,133122,118966,269575,92332,92160,92333,95050,153866,323306,323168,95906,153943,116083,323168,156697
Wolof,112432,112432,112432,112432,101369,112432,117841,114816,107121,200594,95168,88570,95169,94966,151584,258805,249234,89305,123622,104207,249234,130362
Xhosa,119033,119033,119033,119033,108969,119033,123648,125740,116871,244331,89389,86598,89390,88735,157869,275162,274900,88866,146904,110840,274900,149963
Eastern Yiddish,348452,348452,348447,348447,294166,348452,500081,–––,–––,–––,94121,101844,94122,109303,–––,503003,278746,235209,232041,–––,278746,–––
Yoruba,204447,204447,204447,204447,156430,204447,207890,–––,138921,189973,135479,110007,135480,135311,–––,332203,251232,87077,119607,–––,251232,–––
Yue Chinese,162457,162457,162457,162457,111948,162457,219180,–––,–––,76927,55486,65409,55487,62583,–––,225376,80134,49618,–––,–––,80134,56593
Chinese (Simplified),168850,168850,168850,168850,101138,168850,231200,–––,–––,83317,57887,66386,57888,60193,–––,240075,87445,50330,–––,–––,87445,56652
Chinese (Traditional),165883,165883,165883,165883,114977,165883,223346,–––,–––,78103,57297,67258,57298,64529,–––,229715,81884,51471,–––,–––,81884,58453
Standard Malay,107633,107633,107633,107633,85585,107633,112462,116301,107182,249178,56764,63455,56765,73097,134140,289645,288891,56793,133623,96834,288891,140806
Zulu,126593,126593,126593,126593,116322,126593,130363,131628,123210,259878,92688,85533,92689,92182,164151,291196,290777,93438,156409,116080,290777,158734
