Segments a string.
Ill-formed input is treated as if errors had been replaced with REPLACEMENT CHARACTERs according to the WHATWG Encoding Standard.
See the Rust documentation for segment_utf16 for more information.
StaticcreateConstruct a [SentenceSegmenter] for content known to be of a given locale, using compiled data.
StaticcreateConstruct a [SentenceSegmenter] for content known to be of a given locale, using a particular data source.
An ICU4X sentence-break segmenter, capable of finding sentence breakpoints in strings.
See the Rust documentation for
SentenceSegmenterfor more information.