summaryrefslogtreecommitdiff
path: root/textproc/py-unicode-segmentation-rs/pkg-descr
blob: 47fc565f10b3868644ea1e51f9d3b26ef6883e5c (plain) (blame)
1
2
3
4
5
6
7
8
9
10
11
12
Python bindings for the Rust unicode-segmentation and unicode-width crates,
providing Unicode text segmentation and width calculation according to Unicode
standards.

Features:
- Grapheme Cluster Segmentation: Split text into user-perceived characters
- Word Segmentation: Split text into words according to Unicode rules
- Sentence Segmentation: Split text into sentences
- Display Width Calculation: Get the display width of text (for
  terminal/monospace display)
- Gettext PO Wrapping: Wrap text for gettext PO files with proper handling of
  escape sequences and CJK characters