from janome.tokenizer import Tokenizer t = Tokenizer() for token in t.tokenize('すもももももももものうち'): print(token)