This is the characteristic for plain BPE: it is based solely on distribution, meaning it does not have knowledge of which bytes can form a valid Unicode codepoint, character, or meaningful word.The byproduct is that text may be sub-tokenized differently in different contexts, even for words ...
It also has a "tri-state" option, meaning a node with some of its children checked will get a "square" icon.Keep in mind that if any sort of cascade is enabled, disabled nodes may be checked too (not by themselves, but for example when a parent of a disabled node is checked and ...