blob: c1f700edffe5f9d3153adbdd53bda808cc62b86d (
plain) (
blame)
1
2
3
4
5
|
Tokenizer: A tokenizer for Icelandic text
Tokenization is a necessary first step in many natural language processing
tasks, such as word counting, parsing, spell checking, corpus generation, and
statistical analysis of text.
|