Density,Vocab Diversity,Local vs Global (“contexty”),Tags,Necessary context,Necessary condition,Success Criterion,Feature ID,Layer,Human explanation,Feature Grade
low (<0.1%),low,local/isolated,,,,Contains “sales”,9824,22,“sales”,4
low (<0.1%),low,local/isolated,,,,Contains “\n”,15118,17,“\n”,3
low (<0.1%),low,local/isolated,,,,,13166,21,"file paths with dots, often in imports",3
low (<0.1%),low,local/isolated,,,,Contains “}”,5,19,closing “}”,5 (best quality)
low (<0.1%),low,local/isolated,,,,Contains “->”,14358,23,“->”,4
low (<0.1%),low,local/isolated,,,,,15959,21,“theory”,4
low (<0.1%),low,local/isolated,context-dependent,code,,Contains “return” or “return;”,7581,24,“Return” or “return;”,5 (best quality)
low (<0.1%),low,local/isolated,context-dependent,"French, Spanish or German text",,,2815,25,"“Un”, “ein”",3
low (<0.1%),low,regional,,,,,9186,16,"Programming syntax, white space",2
low (<0.1%),low,regional,,,,,18,25,White spaces,2
low (<0.1%),low,global/contexty,context-dependent,C# code,,,13624,21,"Coding syntax, esp. “=”",2
low (<0.1%),low,global/contexty,,,,,7514,20,“Private static”,3
low (<0.1%),medium,local/isolated,context-dependent,code,,,6832,15,syntax elements in code,5 (best quality)
low (<0.1%),medium,local/isolated,,,,,15068,19,articles and prepositions in different languages,5 (best quality)
low (<0.1%),medium,local/isolated,,,,,11119,16,mention of different areas in the US,4
low (<0.1%),medium,regional,,,,Contains “why”,7544,15,Questions starting with “why”,4
low (<0.1%),medium,regional,,,,Contains “spend” or contains “spent”,7550,15,"Spending sth, mostly spending time",4
low (<0.1%),medium,regional,,,,,8517,17,Phrases relating age,5 (best quality)
low (<0.1%),medium,global/contexty,,,,,6686,24,Performance in sports,2
low (<0.1%),medium,global/contexty,,,,,15694,21,CSS properties and values,4
low (<0.1%),high,local/isolated,,,,,1,25,"Numbers, random symbols",1 (low quality)
low (<0.1%),high,local/isolated,,,,,1305,23,Phrases for asking questions in different languages,4
low (<0.1%),high,regional,context-dependent,Context of gratitude,,,4007,19,“for” when used in the context of gratitude/thanking someone,3
low (<0.1%),high,regional,,,,,11000,19,programming keywords,4
low (<0.1%),high,global/contexty,,,,,9131,16,Html document structure elements,4
low (<0.1%),high,global/contexty,,,,,14245,17,Expressions of sympathy and condolence,5 (best quality)
low (<0.1%),high,global/contexty,,,,,38,38,Czech language,4
medium (0.1%-0.5%),low,local/isolated,,,,Contains “James”,8637,23,“James”,5 (best quality)
medium (0.1%-0.5%),low,local/isolated,condition-dependent,,In most cases preceded by “in”,Contains “addition” or contains “additionally”,47,16,"“addition” when referring to supplementary material (“in addition”, “additionally”",5 (best quality)
medium (0.1%-0.5%),low,local/isolated,,,,,5834,21,negation of “can”,5 (best quality)
medium (0.1%-0.5%),low,local/isolated,,,,Contains “=”,7000,15,“=” in coding context,5 (best quality)
medium (0.1%-0.5%),low,local/isolated,,,,Contains “%”,13588,23,“%”,4
medium (0.1%-0.5%),low,local/isolated,,,,Contains “should”,11388,25,“should”,4
medium (0.1%-0.5%),low,local/isolated,,,,Contains “Microsoft”,1238,16,“Microsoft”,5 (best quality)
medium (0.1%-0.5%),low,local/isolated,,,,"Contains “,”",4992,24,"“,”",5 (best quality)
medium (0.1%-0.5%),low,regional,,,,Contains “average”,9230,16,Phrases starting with “average”,5 (best quality)
medium (0.1%-0.5%),low,regional,,,,,5804,20,Several mentions of “i” in variable assignment,3
medium (0.1%-0.5%),low,global/contexty,,,,Contains several numbers,14238,17,numbers,5 (best quality)
medium (0.1%-0.5%),low,global/contexty,,,,,6721,24,Lots of “&”,3
medium (0.1%-0.5%),medium,local/isolated,,,,,9785,23,Emoji feature,3
medium (0.1%-0.5%),medium,local/isolated,condition-dependent,,There is a “but” right after,"Ends with “,”",1116,23,The last word and comma before “but” in a sentence,4
medium (0.1%-0.5%),medium,local/isolated,condition-dependent,,The word “level” comes right after,,13175,21,Phrases with “... level”,5 (best quality)
medium (0.1%-0.5%),medium,local/isolated,,,,,105,19,reference to materials,5 (best quality)
medium (0.1%-0.5%),medium,local/isolated,,,,,13993,22,"Celebrities, esp. From the 80s-2000s",5 (best quality)
medium (0.1%-0.5%),medium,local/isolated,,,,,9030,15,"Family relations (husband, wife, father, brother)",4
medium (0.1%-0.5%),medium,regional,,,,,10001,19,"References to pregnancy, reproductive health",4
medium (0.1%-0.5%),medium,regional,,Python code,,,2867,25,References to python variables in loops,3
medium (0.1%-0.5%),medium,regional,context-dependent,sports,,,109,22,Performance metrics in sports,4
medium (0.1%-0.5%),medium,regional,,,,,4566,18,Types of bread or similar foods,5 (best quality)
medium (0.1%-0.5%),medium,regional,,,,,1977,25,The “in” in python loops,2
medium (0.1%-0.5%),medium,regional,,,,,12664,23,expletives and derogatory terms,5 (best quality)
medium (0.1%-0.5%),medium,regional,context-dependent,programming,,,6012,16,“class” when used in the programming context,5 (best quality)
medium (0.1%-0.5%),medium,regional,condition-dependent,,"Preceeded by “,”",,12881,18,“Which” as start of a subclause after comma,5 (best quality)
medium (0.1%-0.5%),medium,global/contexty,,,,,14279,17,Programming in a GUI context,3
medium (0.1%-0.5%),medium,global/contexty,,,,,5004,24,References to Swiss locations,3
medium (0.1%-0.5%),high,local/isolated,context-dependent,Topic of writing,,,3452,25,Sentence endings and beginnings,3
medium (0.1%-0.5%),high,local/isolated,,,,,2814,25,"“Well”, after rhetorical questions",1 (low quality)
medium (0.1%-0.5%),high,regional,,,,,16330,18,Topic of food,5 (best quality)
medium (0.1%-0.5%),high,regional,,,,,11921,22,Topic of Isolation and exclusion in social contexts,5 (best quality)
medium (0.1%-0.5%),high,regional,,,,,2399,18,Topic of Social identity and belonging,4
medium (0.1%-0.5%),high,regional,,,,,14283,17,Physics and velocity,2
medium (0.1%-0.5%),high,global/contexty,,,,,8,25,Phrases related to software licensing,4
medium (0.1%-0.5%),high,global/contexty,,,,,3390,25,Topic of climate change,4
medium (0.1%-0.5%),high,global/contexty,,,,,13572,24,French,5 (best quality)
medium (0.1%-0.5%),high,global/contexty,,,,,11691,24,capitalisation,5 (best quality)
medium (0.1%-0.5%),high,global/contexty,,,,,10004,23,The topic of health,5 (best quality)
medium (0.1%-0.5%),high,global/contexty,,,,,38,20,German,5 (best quality)
high (>0.5%),low,local/isolated,condition-dependent,,Start of sentence,“The” when at the start of the sentence,3142,18,“The” when at the start of the sentence,5 (best quality)
high (>0.5%),low,local/isolated,,,,,6706,24,“1”,4
high (>0.5%),low,local/isolated,,,,Contains “.”,3444,18,“.”,4
high (>0.5%),low,local/isolated,condition-dependent,,Not start of sentence,“a” when not at the start of the sentence,483,22,the article “a” when it’s not at the start of the sentence,5 (best quality)
high (>0.5%),low,local/isolated,,,,Contains “ “,9382,25,the tab character,5 (best quality)
high (>0.5%),low,local/isolated,,,,,6001,18,"citizenship, governance",4
high (>0.5%),low,local/isolated,context-dependent,quantitative,,,14244,17,“On”,5 (best quality)
high (>0.5%),low,regional,,,,,9219,16,Numbers,5 (best quality)
high (>0.5%),low,regional,,,,,15714,21,"Results of significance tests, statistical characteristics",4
high (>0.5%),low,global/contexty,,,,,2566,25,"repetition, but for symbols, e.g. ////",5 (best quality)
high (>0.5%),low,global/contexty,,,,,12694,21,"Brackets, arithmetic symbols",4
high (>0.5%),medium,local/isolated,condition-dependent,,Followed by the word “know” or “known”,,15004,22,Phrases containing “let me know” or “little is known”,5 (best quality)
high (>0.5%),medium,local/isolated,,,,,1538,16,"Negation, prevention, hindering",4
high (>0.5%),medium,local/isolated,,,,,6538,16,"Dependence, influence, effect and synonyms",2
high (>0.5%),medium,local/isolated,,,,,1491,16,“Moon” or phrases related to it,3
high (>0.5%),medium,regional,,,,,4899,24,Simple arithmetic,5 (best quality)
high (>0.5%),medium,regional,,,,,14013,22,Development of new scientific things,4
high (>0.5%),medium,regional,,,,,14289,17,“Applies to ..” “limited to …” and synonymous phrases,4
high (>0.5%),medium,global/contexty,,,,,4990,24,Phrases related to chemical processes,2
high (>0.5%),medium,global/contexty,,,,,5810,20,The topic of grief and mourning,4
high (>0.5%),high,local/isolated,,,,,9225,16,Coding elements,3
high (>0.5%),high,local/isolated,,,,,4986,24,Medical terms,3
high (>0.5%),high,regional,context-dependent,Medicine or chemistry,,,1328,23,Actions that happen in time/are repeated over time,5 (best quality)
high (>0.5%),high,regional,,,,,6779,23,Phrases related to solving problems,4
high (>0.5%),high,regional,,,,,10616,22,biology and animals,5 (best quality)
high (>0.5%),high,regional,,,,,11,20,applications of products,3
high (>0.5%),high,regional,,,,,33,25,Descriptions of professions,5 (best quality)
high (>0.5%),high,global/contexty,,,,,15367,22,Biological topics related to immune system,4
high (>0.5%),high,global/contexty,,,,,14002,22,Data management and processing,5 (best quality)
high (>0.5%),high,global/contexty,,,,,1486,16,Topics of algebra and topology,3
medium (0.1%-0.5%),low,local/isolated,bimodal density,,,,7390,22,“cur” in various contexts,3
medium (0.1%-0.5%),low,local/isolated,bimodal density,,,,9317,21,"strongly bimodal distribution, not sure what it activates on aside from personal pronouns",4
medium (0.1%-0.5%),low,local/isolated,bimodal density,,,,14072,23,"activates on left “(“; strongly bimodal, but unclear what else it activates on",4
high (>0.5%),low,local/isolated,bimodal density,,,,5816,20,The beginning of a sentence or a conversational turn or instance of code,4
high (>0.5%),low,local/isolated,bimodal density,,,,9134,23,"“Is”, “be”",4
