blob: 1dc8d1ad243d2df98e95b4cbdfb25fe5c05dfde0 (
plain) (
blame)
1
2
3
|
The segments package provides Unicode Standard tokenization routines and
orthography segmentation, implementing the linear algorithm described in
the orthography profile specification from The Unicode Cookbook.
|