Note: this does a decent job for most European languages, but does a terrible job for some Asian languages, where words are not separated by spaces.
You must specify the required {@link Version} compatibility when creating{@link LowerCaseTokenizer}:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|