summaryrefslogtreecommitdiff
path: root/textproc/py-tokenizer/pkg-descr
blob: c1f700edffe5f9d3153adbdd53bda808cc62b86d (plain) (blame)
1
2
3
4
5
Tokenizer: A tokenizer for Icelandic text

Tokenization is a necessary first step in many natural language processing
tasks, such as word counting, parsing, spell checking, corpus generation, and
statistical analysis of text.