I don't think [this](https://github.com/huggingface/tokenizers/blob/main/tokenizers%2Fsrc%2Fnormalizers%2Fbert.rs#L33-L34) is true for Japanese? Kana is not separated both for Japanese words and latin words.