CharFilters: process text before the Tokenizer