Language,GPT-2,r50k_base,p50k_base,p50k_edit,cl100k_base,RoBERTa,GottBERT,CamemBERT,PhoBERT,RoCBert,XLM-RoBERTa,M2M100,MBart50,mT5,FlanT5,ByT5,CANINE,BLOOM,ArabicBERT,MuRIL,UTF-32,BERT Japanese
Acehnese (Arabic script),251497,251497,251497,251497,199956,251497,390347,90832,176648,86495,115461,119667,115462,117938,90896,391515,219474,140671,91429,77105,219474,50731
Acehnese (Latin script),113531,113531,113531,113531,104849,113531,122823,124070,114495,236887,93369,92887,93370,94854,147715,283335,278522,92532,138840,109149,278522,146081
Mesopotamian Arabic,224237,224237,224234,224234,158035,224237,402524,76027,185388,83820,69291,80345,69292,84300,76067,405062,223981,60960,53301,103943,223981,43175
Ta’izzi-Adeni Arabic,227986,227986,227986,227986,158955,227986,407200,77834,187401,83097,70079,81240,70080,86472,77877,409628,226400,60967,53198,104701,226400,44174
Tunisian Arabic,220845,220845,220841,220841,154582,220845,396918,76376,182504,81242,71776,81874,71777,84938,76423,399376,221017,63181,55062,102587,221017,43432
Afrikaans,101771,101771,101771,101771,89171,101771,98519,110438,105102,229264,71571,77060,71572,78558,124449,277732,274530,89808,127906,99171,274530,131262
South Levantine Arabic,211501,211501,211501,211501,149832,211501,382023,76220,175385,81325,67110,77076,67111,81281,76300,385654,214652,59291,53150,98143,214652,44143
Akan,147060,147060,147060,147060,141807,147060,149734,131229,120943,169024,117967,115776,117968,119492,171393,285241,259737,109025,98738,80219,259737,150069
Tosk Albanian,139180,139180,139180,139180,118970,139180,139968,145681,140863,241693,78642,86041,78643,97346,178604,312452,289969,115353,140644,135796,289969,108437
Amharic,409343,409343,409343,409343,405644,409343,409257,69695,137978,40969,79683,89970,79684,113583,69705,445855,174268,269501,39717,39435,174268,36602
North Levantine Arabic,212418,212418,212418,212418,149520,212418,381535,72702,175929,82341,68660,78633,68661,80726,72770,384850,213236,59993,52690,98757,213236,40847
Standard Arabic,231316,231316,231316,231316,160485,231316,411432,79409,189134,83412,70160,81754,70161,88560,79450,413881,228923,60841,52834,106186,228923,43742
Standard Arabic (Romanized),132163,132163,132126,132126,129672,132163,139173,138183,129567,258329,115661,115815,115662,113902,170119,302462,302341,114133,154071,122996,302341,169235
Najdi Arabic,231630,231630,231630,231630,160703,231630,412164,79736,189486,83668,70566,82195,70567,88765,79780,414423,229162,61136,53069,106404,229162,44210
Moroccan Arabic,221115,221115,221115,221115,156350,221115,400935,77398,184382,83427,74275,84409,74276,84624,77441,403707,223535,67216,60326,102895,223535,45016
Egyptian Arabic,222553,222553,222552,222552,156292,222553,402538,78776,184358,85359,69984,80468,69985,83875,78787,404608,223944,61836,55464,101822,223944,44935
Assamese,514374,514374,514368,514368,327657,514374,656585,80572,209938,46202,113113,141670,113114,127306,80568,658998,248983,74842,45553,66888,248983,43421
Asturian,99420,99420,99419,99419,83704,99420,105192,105422,103303,224060,75945,73168,75946,84180,119928,276557,267356,69668,120088,97579,267356,130355
Awadhi,378103,378103,378082,378082,252559,378103,646209,102179,201278,56756,81930,93158,81931,106457,102216,648620,252973,75880,62972,69624,252973,53252
Central Aymara,122124,122124,122124,122124,114730,122124,127777,130158,122409,236592,101269,104125,101270,103045,157095,278302,271932,103162,139013,106931,271932,149459
South Azerbaijani,271447,271447,271437,271437,176561,271447,419700,71248,195084,59923,85106,95216,85107,93095,71269,421538,230356,96441,107137,93021,230356,58599
North Azerbaijani,182348,182348,182348,182348,139670,182348,182216,164849,159049,202067,68418,79961,68419,88807,198069,327051,282580,122337,167502,131737,282580,160390
Bashkir,316164,316164,316164,316164,226124,316164,313601,126302,218262,97842,122871,77959,122872,105089,197474,480838,262555,189762,98443,73521,262555,42632
Bambara,140011,140011,139965,139965,135569,140011,145217,123509,116633,167226,108316,108853,108317,108746,156086,270935,249080,100402,90958,70648,249080,138189
Balinese,103682,103682,103666,103666,95203,103682,109604,114664,107056,247527,78612,81765,78613,84863,137278,288815,287688,77770,134647,98690,287688,139353
Belarusian,344748,344748,344748,344748,187319,344748,328950,174057,240969,221292,87245,98633,87246,104504,222105,535528,293469,172266,250560,99776,293469,50884
Bemba,129364,129364,129364,129364,117904,129364,133283,134862,128043,273099,104974,105613,104975,102918,173992,318923,318153,102359,159038,117182,318153,169134
Bengali,507509,507509,507507,507507,308325,507509,674272,79472,214970,45227,82135,98507,82136,103680,79481,676264,254327,62014,44467,54583,254327,41936
Bhojpuri,377309,377309,377267,377267,247793,377309,637964,104652,200403,57831,87860,97689,87861,107289,104629,641057,252371,81327,61484,75210,252371,55920
Banjar (Arabic script),264266,264266,264266,264266,200951,264266,436641,82110,201154,92742,114731,122324,114732,115648,82251,437480,240281,131401,100285,80363,240281,45094
Banjar (Latin script),103982,103982,103982,103982,90276,103982,109168,108556,101073,233919,72076,73211,72077,76538,127530,272644,272244,69307,127025,92101,272244,132944
Standard Tibetan,784601,784601,784570,784570,595659,784601,857781,9929,286046,148576,41954,9870,41955,242119,9915,859707,291590,354304,147821,147672,291590,194970
Bosnian,115005,115005,115004,115004,98989,115005,116335,117347,112807,220911,66919,73907,66920,87222,143283,267950,260762,97705,133927,96857,260762,134892
Buginese,115798,115798,115791,115791,104607,115798,117358,116092,113066,237814,89886,94280,89887,94630,145074,283841,275699,90959,139630,105783,275699,143530
Bulgarian,289393,289393,289393,289393,139501,289393,276630,162063,215050,201725,68968,77990,68969,84216,193583,489890,270630,132248,227106,153501,270630,49896
Catalan,100917,100917,100917,100917,90427,100917,110332,106665,109174,238062,75451,79876,75452,89612,124034,291806,285853,62983,124819,102473,285853,134705
Cebuano,117716,117716,117716,117716,101774,117716,124007,127770,117575,259722,90833,87220,90834,93284,165260,311013,310925,94620,145902,113208,310925,158499
Czech,137859,137859,137859,137859,111264,137859,136455,129272,123396,213411,69647,77680,69648,83670,157688,280405,251082,107754,126693,86320,251082,107196
Chokwe,113735,113735,113735,113735,104503,113735,119429,119231,110095,236893,92286,93123,92287,92683,154067,278117,277102,91317,141673,104766,277102,147047
Central Kurdish,341390,341390,341390,341390,253685,341390,459720,78771,213031,46831,136941,157264,136942,114761,78770,461199,251845,170668,158989,57373,251845,41518
Crimean Tatar,130893,130893,130659,130659,111855,130893,131923,134640,128877,172511,82206,86741,82207,86988,162045,291908,265769,110003,139670,104210,265769,128153
Welsh,123262,123262,123262,123262,111804,123262,131194,134885,127694,229811,85327,91291,85328,111445,180487,277959,276951,111312,149568,125348,276951,151952
Danish,100083,100083,99973,99973,85831,100083,99284,111753,107454,223959,65265,71154,65266,75027,130567,272408,266753,88852,123375,98506,266753,119120
German,112600,112600,112600,112600,83450,112600,58508,123942,116552,259689,69727,78466,69728,78142,79288,307304,302377,89320,139166,108976,302377,141390
Southwestern Dinka,130281,130281,130254,130254,118865,130281,125995,114303,109959,162625,100413,98186,100414,104155,142648,250279,221929,96684,93382,81202,221929,112280
Dyula,115705,115705,115705,115705,108433,115705,121866,114862,108895,212549,98157,96750,98158,101708,154918,278145,261275,95832,125478,111235,261275,144030
Dzongkha,860070,860070,860066,860066,651364,860070,942994,26999,310026,162796,58041,26989,58042,278950,27005,943132,323571,391306,162773,162768,323571,67780
Greek,343953,343953,343953,343953,271846,343953,393935,98127,259761,249809,86669,100218,86670,108200,98202,562986,310159,202450,260481,65658,310159,75035
English,52567,52567,52567,52567,52835,52567,78923,80199,83535,216302,59656,63374,59657,65729,57876,259396,259170,53174,96467,53946,259170,103383
Esperanto,106503,106503,106503,106503,98813,106503,108038,108276,105543,217770,71512,87589,71513,78270,126939,263370,258512,87615,119270,89255,258512,116991
Estonian,111089,111089,111089,111089,98826,111089,109399,113611,110867,221758,66864,76037,66865,73511,140518,262393,254538,94067,123324,91983,254538,121532
Basque,110251,110251,110250,110250,99592,110251,109524,115691,111466,240728,69305,77988,69306,80064,134955,276279,275448,60830,136206,102270,275448,139769
Ewe,152353,152353,152353,152353,145385,152353,155357,135264,121730,168581,120047,118048,120048,119854,165046,276783,252072,112330,100448,76916,252072,135643
Faroese,125367,125367,125367,125367,109406,125367,130964,131658,121814,186548,85972,89494,85973,91884,157775,284012,263787,103830,136254,94405,263787,117253
Fijian,121044,121044,121044,121044,113623,121044,132003,122256,116500,244685,102559,102897,102560,104403,174918,303678,303403,105667,159150,108251,303403,158535
Finnish,119621,119621,119621,119621,105276,119621,115400,125385,123141,244727,67990,77809,67991,76281,151302,287021,276102,100568,137245,110839,276102,149717
Fon,214346,214346,214346,214346,193724,214346,217332,158537,150103,158506,149608,146704,149609,155092,203927,327076,265357,117430,100261,91583,265357,154975
French,104971,104971,104971,104971,84407,104971,116382,67031,115302,258575,77352,84236,77353,91954,92423,320613,308489,63881,128023,105894,308489,140258
Friulian,108675,108675,108675,108675,97607,108675,115943,111015,110929,232073,92873,93343,92874,99837,133187,293730,284088,90630,123275,104739,284088,132954
Nigerian Fulfulde,104628,104628,104628,104628,97955,104628,108352,103243,96944,186121,87218,80379,87219,87007,123788,247982,242184,88035,111927,83039,242184,124587
West Central Oromo,132819,132819,132819,132819,122460,132819,135871,139003,134682,268913,106294,94455,106295,110976,182606,311988,307188,116457,157621,117080,307188,168782
Scottish Gaelic,141855,141855,141855,141855,127836,141855,146408,144558,134854,267356,104559,102196,104560,121647,187248,332878,320213,119665,151269,122313,320213,153752
Irish,134446,134446,134446,134446,123062,134446,138891,140529,129584,249323,89323,95296,89324,109777,181920,318859,299807,114550,139543,132936,299807,156000
Galician,100549,100549,100549,100549,82499,100549,109486,109337,108753,240324,67114,72186,67115,86400,126094,293863,286720,67516,125278,102970,286720,136270
Guarani,129285,129285,129268,129268,114837,129285,132854,124168,121173,226759,102405,103340,102406,106773,148620,281888,261956,99535,135241,107150,261956,129575
Gujarati,644828,644828,644817,644817,406224,644828,645186,86997,206252,48164,84540,99875,84541,113530,86977,647801,248721,71638,47321,64036,248721,47009
Haitian Creole,100092,100092,100092,100092,91870,100092,106609,105879,96290,193530,83038,73297,83039,79946,134250,245512,238876,83111,113484,90566,238876,122543
Hausa,112972,112972,112972,112972,105931,112972,117546,118065,105075,219549,83445,81634,83446,89867,151250,279824,276665,94598,129358,96035,276665,139593
Hebrew,231019,231019,231019,231019,193616,231019,356374,74001,165362,43248,67112,77338,67113,80177,73998,359555,201565,155280,165810,41632,201565,41219
Hindi,392092,392092,392092,392092,253270,392092,658313,104653,206714,57485,74861,86379,74862,104604,104672,661374,258472,68276,61966,62712,258472,55596
Chhattisgarhi,378849,378849,378849,378849,247937,378849,635526,101998,198867,57685,84368,95768,84369,105029,102245,638720,250204,76711,60177,72073,250204,53987
Croatian,113059,113059,113058,113058,97676,113059,114870,114787,110874,216201,65682,72894,65683,85612,140906,262354,255014,95947,131330,94765,255014,131810
Hungarian,139602,139602,139600,139600,113411,139602,141611,142790,131104,235951,70308,81277,70309,82939,172889,299608,272675,109899,135056,124421,272675,148102
Armenian,526451,526451,526451,526451,527113,526451,526464,81045,248181,46996,82580,94934,82581,103808,81050,529415,287555,229282,45900,45630,287555,44419
Igbo,179977,179977,179977,179977,129168,179977,183972,141791,123799,214673,126634,93374,126635,117780,183723,312799,263254,91237,144576,92627,263254,117234
Ilocano,118613,118613,118613,118613,108129,118613,125270,129022,117649,262670,96095,84539,96096,105633,163179,314085,313360,101123,149848,108277,313360,160015
Indonesian,104006,104006,103989,103989,82006,104006,108315,112174,104373,241697,56004,61915,56005,70942,129783,281022,280788,51241,130092,93990,280788,137026
Icelandic,127643,127643,127643,127643,113423,127643,135970,135673,125216,164395,73561,81848,73562,86678,162574,282751,255925,105890,129624,88083,255925,104434
Italian,105594,105594,105594,105594,86628,105594,112844,109149,111317,257935,70965,79123,70966,88132,125884,307457,305238,86218,136373,103826,305238,141489
Javanese,101610,101610,101610,101610,91337,101610,107079,111582,100716,230239,68844,69843,68845,79381,128021,269810,269310,74537,131089,94006,269310,132911
Japanese,157858,157858,157858,157858,121627,157858,254548,7823,109819,111466,66414,76356,66415,59289,7788,329167,113611,96085,97622,62538,113611,69209
Kabyle,131364,131364,131364,131364,130359,131364,137564,127175,119182,195433,109639,108225,109640,119434,163686,274466,256723,107463,124098,91910,256723,128276
Jingpho,139457,139457,139456,139456,124278,139457,149129,142637,128420,259923,115947,112627,115948,117829,197335,330717,330686,113746,164614,125372,330686,171095
Kamba,121739,121739,121739,121739,114796,121739,127481,118543,108374,211466,96785,96361,96786,100228,155742,263257,253015,93920,128103,85132,253015,119634
Kannada,719397,719397,719274,719274,470190,719397,731252,69244,237693,38301,81107,96883,81108,94741,69237,735146,272113,69560,37225,57383,272113,39045
Kashmiri (Arabic script),325291,325291,325145,325145,244307,325291,444451,90825,200969,61554,115090,122425,115091,131446,90829,447300,248311,123488,121604,94413,248311,47025
Kashmiri (Devanagari script),369802,369802,369778,369778,247927,369802,612620,99203,194507,61900,108675,117719,108676,117866,99416,621887,249147,98492,62201,94654,249147,55490
Georgian,727866,727866,727866,727866,520316,727866,727448,74007,247918,43649,80173,99022,80174,101748,73969,764251,283885,264919,41942,41598,283885,41772
Kazakh,311213,311213,311212,311212,200493,311213,308595,131792,222082,125483,68428,81093,68429,79112,199145,489005,266102,171823,151041,78017,266102,40514
Kabiyè,256080,256080,256066,256066,250629,256080,258642,173564,176184,115963,177790,171937,177791,185803,212995,356120,282100,177842,79365,69936,282100,181488
Kabuverdianu,101684,101684,101684,101684,91103,101684,104228,104206,100905,212659,80636,82272,80637,84027,127940,263624,257338,80174,120718,97797,257338,133103
Halh Mongolian,337351,337351,337351,337351,199035,337351,334336,147114,227151,162538,72046,84635,72047,97042,213998,495824,270527,179887,193950,101965,270527,43784
Khmer,805983,805983,805983,805983,469026,805983,806501,59829,285730,26501,96638,118485,96639,94217,59993,864382,305984,340448,20091,17816,305984,96761
Kikuyu,181066,181066,180960,180960,173742,181066,186000,154133,138417,255442,138024,137581,138025,143352,196687,336536,302637,131787,150928,73081,302637,96233
Kinyarwanda,124507,124507,124507,124507,113070,124507,127053,127430,122870,248670,102456,103304,102457,99277,159781,294412,288228,84146,148708,115812,288228,155270
Kyrgyz,301606,301606,301606,301606,185559,301606,299457,138165,222699,192583,69278,105463,69279,86514,203777,487931,265506,160837,199078,134335,265506,41054
Kimbundu,122622,122622,122622,122622,112524,122622,129691,126512,119666,241994,98046,97621,98047,97043,168245,288236,286800,96494,149610,107216,286800,156856
Northern Kurdish,128950,128950,128950,128950,116457,128950,130825,132522,117175,213428,82489,105243,82490,93428,158576,285648,257926,108099,124890,82847,257926,84067
Central Kanuri (Arabic script),249173,249173,249173,249173,191917,249173,410142,73595,189321,80361,155367,157984,155368,160018,73550,414556,228438,111740,67235,127913,228438,40507
Central Kanuri (Latin script),135330,135330,135330,135330,125283,135330,140581,128497,120540,185972,103625,104673,103626,104001,163460,287925,273117,106465,118250,88169,273117,128294
Kikongo,114227,114227,114227,114227,105361,114227,127161,115771,114676,241355,94290,93629,94291,95889,174143,295155,294834,92840,153421,106477,294834,158706
Korean,266663,266663,266660,266660,125737,266663,304725,66637,97696,214170,69124,76644,69125,83718,66645,310727,130986,148546,125267,36801,130986,37961
Lao,693551,693551,693550,693550,508348,693551,694035,29037,235138,29653,82850,101726,82851,83315,28978,707645,257362,462849,22010,19116,257362,105290
Ligurian,120391,120391,120391,120391,104558,120391,123687,119908,119632,235685,98515,100538,98516,111123,147062,302581,286080,96349,133177,110857,286080,130522
Limburgish,107733,107733,107733,107733,95157,107733,105731,111137,110596,225590,86570,87324,86571,90468,130066,278124,269296,93152,127464,103761,269296,131832
Lingala,106455,106455,106455,106455,98517,106455,116034,109537,105400,232882,90766,79977,90767,90700,158212,279939,279583,87885,141933,102285,279583,145278
Lithuanian,128701,128701,128701,128701,116635,128701,128824,122546,118645,223996,70001,78909,70002,80627,149057,275925,258253,103350,128643,79995,258253,104388
Lombard,124681,124681,124681,124681,107743,124681,124609,121716,118002,224668,101898,99104,101899,111495,149212,300684,278244,97705,124335,105633,278244,128877
Latgalian,125800,125800,125799,125799,116433,125800,131764,130139,123432,220726,93413,95927,93414,96113,156294,271741,256016,105855,131014,106224,256016,133466
Luxembourgish,118140,118140,118140,118140,104985,118140,102428,121814,119575,247827,97961,83888,97962,96291,129506,298710,290166,100571,135153,116866,290166,135901
Luba-Kasai,111796,111796,111796,111796,102583,111796,118429,115197,109428,234942,92166,90561,92167,90089,143377,280012,279832,89426,139093,102067,279832,145909
Ganda,114169,114169,114169,114169,103642,114169,117146,118218,113889,230752,92353,87471,92354,92028,153108,268270,265137,89042,140788,104487,265137,145269
Luo,107393,107393,107393,107393,96393,107393,110396,111461,105676,223587,90538,90396,90539,92902,147703,271845,271168,89122,130663,100968,271168,139626
Mizo,109968,109968,109936,109936,103801,109968,120526,121894,107582,229391,98499,97666,98500,103490,159888,285572,284965,97367,137722,103538,284965,141753
Standard Latvian,133664,133664,133664,133664,124411,133664,138821,134492,130020,228113,73546,81921,73547,85736,160833,289046,264388,110674,130600,101888,264388,141844
Magahi,379598,379598,379597,379597,248523,379598,636697,100620,198869,55884,84403,95355,84404,106136,100654,639032,249109,76977,58308,72201,249109,52703
Maithili,390772,390772,390772,390772,258768,390772,652899,98814,204666,55282,94310,104117,94311,114345,98807,655569,254221,82790,57092,80828,254221,51906
Malayalam,801380,801380,801332,801332,475535,801380,801618,64905,262479,36916,82039,100506,82040,88992,64895,805120,293448,73183,35837,63546,293448,36910
Marathi,413498,413498,413459,413459,267905,413498,691253,78893,222435,43827,72668,87538,72669,100064,78903,693619,261180,64398,44964,57010,261180,42706
Minangkabau (Arabic script),275937,275937,275937,275937,209767,275937,450840,83898,207581,87249,120767,125921,120768,120685,83944,451565,247588,136964,108557,84650,247588,46179
Minangkabau (Latin script),103515,103515,103515,103515,93314,103515,110793,111845,104242,236464,78163,79094,78164,82037,135960,276661,276447,76545,130784,95339,276447,136909
Macedonian,287256,287256,287255,287255,146208,287256,274701,170656,215520,198011,69908,78646,69909,84820,193977,490881,270378,132974,196687,148149,270378,48590
Maltese,141664,141664,141664,141664,127435,141664,141958,137935,131061,223294,116915,118299,116916,111373,169882,301178,286521,119539,138499,100804,286521,132837
Meitei (Bengali script),537035,537035,537031,537031,354727,537035,714848,78284,228906,44749,152589,163965,152590,144945,78263,717455,267963,125103,44044,126265,267963,41968
Mossi,133299,133299,133299,133299,122348,133299,137491,121347,115535,183555,106415,105404,106416,118443,167997,267823,247517,105627,115195,90459,247517,117912
Maori,128730,128730,128730,128730,124332,128730,139528,135219,122779,227957,111143,110110,111144,111337,190121,301068,287051,112586,143900,114516,287051,149583
Burmese,887930,887930,887927,887927,618268,887930,888482,42762,297952,29883,102718,139944,102719,102566,42757,911313,320882,534510,28291,27729,320882,35510
Dutch,103566,103566,103566,103566,84254,103566,101070,112431,110240,243532,68075,74602,68076,76684,126814,289068,288665,90731,133150,103224,288665,137350
Norwegian Nynorsk,101503,101503,101503,101503,86536,101503,99045,112500,107965,220190,70057,74146,70058,77387,132296,269067,263026,87533,123432,98322,263026,126356
Norwegian Bokmål,97720,97720,97720,97720,82508,97720,96684,109934,105860,218898,63780,69761,63781,73405,129726,266748,261039,85981,121070,96432,261039,122233
Nepali,398853,398853,398853,398853,252889,398853,660650,75281,210954,43888,67463,81179,67464,96395,75275,664662,249703,62148,44014,54422,249703,40157
Northern Sotho,121975,121975,121975,121975,115442,121975,128420,126908,123249,241631,104521,96361,104522,103446,162466,304765,297511,103338,142768,117845,297511,153466
Nuer,222487,222487,222294,222294,211318,222487,219996,165630,163406,157472,156169,154335,156170,159275,203900,343204,280059,148266,94096,80096,280059,172631
Nyanja,118972,118972,118972,118972,110115,118972,124300,124303,118341,252030,94859,98042,94860,88840,157062,291363,290698,94469,146982,108899,290698,149128
Occitan,108564,108564,108564,108564,96606,108564,116408,112576,115322,247081,89419,83283,89420,97576,130600,304250,294944,79296,127922,103994,294944,137862
Odia,703585,703585,703579,703579,659440,703585,705821,80061,225800,45297,86376,99019,86377,204745,80082,709275,265861,72384,44249,65490,265861,43026
Pangasinan,87191,87191,87191,87191,82719,87191,99993,99902,92749,216247,76766,77833,76767,80227,126028,258211,258142,76906,120015,83257,258142,124607
Eastern Panjabi,415296,415296,415296,415296,415623,415296,668463,104343,209780,58822,93737,106290,93738,138377,104351,672235,261839,76290,57539,72700,261839,55302
Papiamento,104236,104236,104236,104236,92425,104236,104744,110164,104066,223400,82016,83786,82017,89114,132095,280322,273279,82092,120711,96986,273279,133963
Southern Pashto,283451,283451,283451,283451,202207,283451,423740,109009,188701,83837,82222,88992,82223,107480,109167,430166,246722,135424,92922,85832,246722,59735
Western Persian,279481,279481,279481,279481,173333,279481,432074,101304,196273,78678,65918,73910,65919,88203,101373,440163,244860,94432,107484,87412,244860,57779
Plateau Malagasy,135619,135619,135619,135619,119189,135619,137421,135901,124463,271859,93654,94172,93655,104757,173758,325965,316908,110108,158636,125488,316908,164355
Polish,141182,141182,141182,141182,101102,141182,141254,137085,132050,216024,70891,80029,70892,86127,163197,292098,274970,113594,146715,99390,274970,121041
Portuguese,101774,101774,101765,101765,78313,101774,109158,108852,108466,235580,66406,72168,66407,84640,128190,289511,281590,59813,125447,101495,281590,128052
Dari,268840,268840,268840,268840,166945,268840,418739,97655,188417,80625,64923,72850,64924,86220,97677,422038,237961,87464,105149,84975,237961,52265
Ayacucho Quechua,115829,115829,115829,115829,109926,115829,127131,123437,116930,245549,95148,97898,95149,93642,149972,278975,276777,97496,141619,105279,276777,146464
Romanian,130433,130433,130433,130433,99201,130433,133544,123322,121856,245167,74034,81531,74035,89729,86617,307925,291717,101592,128736,87690,291717,111136
Rundi,122445,122445,122429,122429,112641,122445,128356,127656,122997,248188,101867,103202,101868,99604,160924,290494,289035,87453,148549,115067,289035,155208
Russian,301727,301727,301726,301726,131496,301727,290010,158588,226451,222122,70031,77512,70032,83361,204346,514650,282208,132062,241387,157810,282208,47163
Sango,117409,117409,117409,117409,109951,117409,121661,120462,110639,219737,98752,97236,98753,107411,181962,290129,281357,95574,139566,110502,281357,154203
Sanskrit,417136,417136,417133,417133,264225,417136,678863,69453,218070,40535,85075,106855,85076,108350,69445,682390,254233,86484,39395,65020,254233,37569
Santali,675930,675930,675929,675929,676349,675930,675937,97648,227755,53011,97661,97642,97662,176524,97687,723211,274886,675805,52919,52894,274886,51174
Sicilian,119419,119419,119419,119419,105965,119419,123844,114797,114164,229071,94093,97030,94094,100524,142627,287680,273185,95956,136096,99295,273185,132809
Shan,986091,986091,986085,986085,795074,986091,987434,41902,335280,42254,263999,293592,264000,215725,41365,1020993,368897,641171,31344,27076,368897,90879
Sinhala,675811,675811,675806,675806,466435,675811,678328,91626,217749,47836,80352,96745,80353,108866,91632,683740,260389,436463,46635,46455,260389,51931
Slovak,132406,132406,132406,132406,112877,132406,130351,128214,121835,221546,70420,78465,70421,85374,158343,283449,260044,106872,130514,95243,260044,121069
Slovenian,111123,111123,111123,111123,99575,111123,114890,115649,110608,218816,67649,75111,67650,78923,140116,264297,258309,96182,132122,94182,258309,134145
Samoan,135286,135286,135286,135286,121040,135286,133154,130690,124947,235821,114297,113938,114298,126188,179105,315363,301267,113270,151254,119932,301267,159873
Shona,120455,120455,120455,120455,112401,120455,124984,126390,120317,256147,97431,99977,97432,88729,161542,290272,290236,95881,149383,111107,290236,153133
Sindhi,262974,262974,262974,262974,211562,262974,412078,101026,184033,87161,76539,82308,76540,114413,101046,414811,234918,133480,86905,65560,234918,53907
Somali,123967,123967,123967,123967,115377,123967,130693,135538,123653,250484,82896,87135,82897,97328,177052,296373,295873,108016,146788,110739,295873,156858
Southern Sotho,123024,123024,123024,123024,116991,123024,129269,131043,123654,255719,106299,101495,106300,104203,168891,312773,312241,104296,154957,116440,312241,159136
Spanish,104549,104549,104548,104548,81735,104549,114094,115325,113958,257284,71459,76775,71460,86318,128905,314077,308284,64466,132786,106948,308284,145418
Sardinian,118821,118821,118821,118821,105258,118821,121127,118457,116701,251257,95752,95788,95753,103494,142414,309950,301401,92156,133157,106888,301401,140989
Serbian,280731,280731,280725,280725,154192,280731,269463,156248,204589,163030,70646,79791,70647,85556,194005,466463,257321,136830,164311,124028,257321,46732
Swati,121681,121681,121677,121677,114096,121681,125659,128417,121453,260765,96139,91104,96140,92890,162234,291685,291628,97063,149923,112753,291628,156659
Sundanese,106071,106071,106071,106071,96166,106071,110038,111251,103720,231104,73021,69744,73022,80459,134315,272644,269967,78474,128625,96990,269967,135056
Swedish,102557,102557,102489,102489,83368,102557,96202,113476,109778,220619,63927,69772,63928,73261,128558,271024,260666,87945,116462,102548,260666,123549
Swahili,112044,112044,112044,112044,103109,112044,117978,114281,110022,230132,69335,76352,69336,82050,154117,272166,271792,66080,139702,100565,271792,147678
Silesian,136571,136571,136571,136571,115365,136571,137368,136327,133069,214742,98456,100997,98457,103492,166242,285706,269821,114749,146209,116074,269821,134349
Tamil,819110,819110,819084,819084,404265,819110,819443,71205,267457,41143,80444,98132,80445,82947,71190,822452,302271,67792,39935,57000,302271,39451
Tamasheq (Latin script),125394,125394,125394,125394,117339,125394,127786,120090,107965,178063,101928,99757,101929,107584,147324,262500,245729,100840,106531,76705,245729,107362
Tamasheq (Tifinagh script),548473,548473,548473,548473,535378,548473,548700,119369,199613,83443,119320,118946,119321,236215,90673,593545,244280,411636,125317,80737,244280,81361
Tatar,305861,305861,305861,305861,198081,305861,303215,126070,217763,146263,107718,97435,107719,92640,201698,479926,262636,167238,148212,110755,262636,42534
Telugu,688151,688151,688126,688126,440831,688151,688897,73242,224597,42936,79091,92140,79092,93592,73166,694562,261716,70848,40829,65209,261716,41538
Tajik,320045,320045,320045,320045,192259,320045,315920,196021,235753,178938,127526,130315,127527,106398,222852,521168,287415,174935,230544,120176,287415,50929
Tagalog,119822,119822,119658,119658,108909,119822,128762,134171,121010,273629,85423,90331,85424,95719,165047,326793,326630,98578,150059,112334,326630,165309
Thai,475633,475633,475632,475632,231995,475633,520076,17546,236002,14809,64512,80663,64513,64898,17552,713841,248859,246116,19953,10599,248859,119386
Tigrinya,414484,414484,414484,414484,412308,414484,414536,77416,139662,44467,117506,121172,117507,133644,77423,454149,178958,274296,43509,43384,178958,40420
Tok Pisin,116032,116032,116032,116032,107799,116032,122346,133519,121212,269513,102949,104877,102950,108762,159801,331464,331192,102258,155553,113268,331192,162326
Tswana,125587,125587,125587,125587,120710,125587,132720,134201,129455,262020,110661,106750,110662,110234,173957,323250,322918,107602,155913,121139,322918,162717
Tsonga,128795,128795,128795,128795,119187,128795,134248,136016,122354,257578,106752,106802,106753,105816,181153,312392,310434,106783,158885,117943,310434,169911
Turkmen,148036,148036,148036,148036,127012,148036,138738,142456,135583,240203,106090,108524,106091,110577,166347,303011,275172,116601,138788,82733,275172,98543
Tumbuka,146210,146210,146210,146210,135756,146210,152556,148078,139306,290363,114679,118850,114680,105756,190205,342449,338139,116690,172660,116804,338139,165582
Turkish,127797,127797,127797,127797,101176,127797,126681,132058,126042,180958,61972,72580,61973,73434,154269,289924,266383,104088,139867,99569,266383,123214
Twi,137471,137471,137471,137471,132821,137471,141707,126182,115474,173917,112411,110223,112412,112189,164908,273291,253416,96323,100888,80651,253416,144273
Central Atlas Tamazight,546058,546058,546058,546058,530718,546058,545844,92474,186053,50900,92349,92314,92350,228864,92483,591767,231581,408891,129471,49561,231581,49590
Uyghur,376366,376366,376366,376366,274341,376366,507965,72619,239537,46133,83957,190015,83958,168749,72598,511363,276109,194931,123299,79490,276109,39382
Ukrainian,302058,302058,302053,302053,158296,302058,291269,147881,215613,203693,71931,81306,71932,87227,200470,483479,265408,146145,226342,102189,265408,45862
Umbundu,117580,117580,117580,117580,106384,117580,121024,118998,113677,227635,93947,94237,93948,96604,157321,271523,262904,92708,140549,104492,262904,137958
Urdu,331280,331280,331278,331278,231974,331280,452920,109138,200807,71900,73620,82182,73621,99921,109199,455529,256560,72171,140013,67978,256560,58740
Northern Uzbek,120831,120831,120831,120831,114577,120831,128551,127816,123290,257429,79559,87115,79560,90504,161946,293473,293146,105295,152589,114395,293146,158219
Venetian,105316,105316,105316,105316,89961,105316,108907,107443,102465,195878,81416,83014,81417,89580,128185,273797,261537,83714,119988,99461,261537,127037
Vietnamese,238544,238544,238544,238544,129258,238544,241285,167208,69628,212672,70314,72894,70315,128452,210805,360436,272596,67335,133327,95986,272596,100976
Waray,125136,125136,125136,125136,103003,125136,127028,133122,118966,269575,92332,92160,92333,95050,153866,323306,323168,95906,153943,116083,323168,156697
Wolof,112432,112432,112432,112432,101369,112432,117841,114816,107121,200594,95168,88570,95169,94966,151584,258805,249234,89305,123622,104207,249234,130362
Xhosa,119033,119033,119033,119033,108969,119033,123648,125740,116871,244331,89389,86598,89390,88735,157869,275162,274900,88866,146904,110840,274900,149963
Eastern Yiddish,348452,348452,348447,348447,294166,348452,500081,96376,231663,54220,94121,101844,94122,109303,96308,503003,278746,235209,232041,53311,278746,52693
Yoruba,204447,204447,204447,204447,156430,204447,207890,153502,138921,189973,135479,110007,135480,135311,191106,332203,251232,87077,119607,90969,251232,118110
Yue Chinese,162457,162457,162457,162457,111948,162457,219180,13247,74583,76927,55486,65409,55487,62583,13056,225376,80134,49618,74913,74441,80134,56593
Chinese (Simplified),168850,168850,168850,168850,101138,168850,231200,16920,79501,83317,57887,66386,57888,60193,16694,240075,87445,50330,80224,79320,87445,56652
Chinese (Traditional),165883,165883,165883,165883,114977,165883,223346,13692,75813,78103,57297,67258,57298,64529,13564,229715,81884,51471,76225,75800,81884,58453
Standard Malay,107633,107633,107633,107633,85585,107633,112462,116301,107182,249178,56764,63455,56765,73097,134140,289645,288891,56793,133623,96834,288891,140806
Zulu,126593,126593,126593,126593,116322,126593,130363,131628,123210,259878,92688,85533,92689,92182,164151,291196,290777,93438,156409,116080,290777,158734
