summaryrefslogtreecommitdiff
path: root/textproc/py-segments/pkg-descr
blob: 1dc8d1ad243d2df98e95b4cbdfb25fe5c05dfde0 (plain) (blame)
1
2
3
The segments package provides Unicode Standard tokenization routines and
orthography segmentation, implementing the linear algorithm described in
the orthography profile specification from The Unicode Cookbook.