ICU4X
International Components for Unicode
Loading...
Searching...
No Matches
ICU4XSentenceSegmenter Class Reference

#include <ICU4XSentenceSegmenter.hpp>

Public Member Functions

ICU4XSentenceBreakIteratorUtf8 segment_utf8 (const std::string_view input) const
 
ICU4XSentenceBreakIteratorUtf16 segment_utf16 (const std::u16string_view input) const
 
ICU4XSentenceBreakIteratorLatin1 segment_latin1 (const diplomat::span< const uint8_t > input) const
 
 ICU4XSentenceSegmenter (capi::ICU4XSentenceSegmenter *i)
 
 ICU4XSentenceSegmenter ()=default
 
 ICU4XSentenceSegmenter (ICU4XSentenceSegmenter &&) noexcept=default
 
ICU4XSentenceSegmenteroperator= (ICU4XSentenceSegmenter &&other) noexcept=default
 

Static Public Member Functions

static diplomat::result< ICU4XSentenceSegmenter, ICU4XErrorcreate (const ICU4XDataProvider &provider)
 

Detailed Description

An ICU4X sentence-break segmenter, capable of finding sentence breakpoints in strings.

See the Rust documentation for SentenceSegmenter for more information.

Constructor & Destructor Documentation

◆ ICU4XSentenceSegmenter() [1/3]

ICU4XSentenceSegmenter::ICU4XSentenceSegmenter ( capi::ICU4XSentenceSegmenter * i)
inlineexplicit

◆ ICU4XSentenceSegmenter() [2/3]

ICU4XSentenceSegmenter::ICU4XSentenceSegmenter ( )
default

◆ ICU4XSentenceSegmenter() [3/3]

ICU4XSentenceSegmenter::ICU4XSentenceSegmenter ( ICU4XSentenceSegmenter && )
defaultnoexcept

Member Function Documentation

◆ create()

diplomat::result< ICU4XSentenceSegmenter, ICU4XError > ICU4XSentenceSegmenter::create ( const ICU4XDataProvider & provider)
inlinestatic

Construct an [ICU4XSentenceSegmenter].

See the Rust documentation for new for more information.

◆ operator=()

ICU4XSentenceSegmenter & ICU4XSentenceSegmenter::operator= ( ICU4XSentenceSegmenter && other)
defaultnoexcept

◆ segment_latin1()

ICU4XSentenceBreakIteratorLatin1 ICU4XSentenceSegmenter::segment_latin1 ( const diplomat::span< const uint8_t > input) const
inline

Segments a Latin-1 string.

See the Rust documentation for segment_latin1 for more information.

Lifetimes: this, input must live at least as long as the output.

◆ segment_utf16()

ICU4XSentenceBreakIteratorUtf16 ICU4XSentenceSegmenter::segment_utf16 ( const std::u16string_view input) const
inline

Segments a string.

Ill-formed input is treated as if errors had been replaced with REPLACEMENT CHARACTERs according to the WHATWG Encoding Standard.

See the Rust documentation for segment_utf16 for more information.

Lifetimes: this, input must live at least as long as the output.

◆ segment_utf8()

ICU4XSentenceBreakIteratorUtf8 ICU4XSentenceSegmenter::segment_utf8 ( const std::string_view input) const
inline

Segments a string.

Ill-formed input is treated as if errors had been replaced with REPLACEMENT CHARACTERs according to the WHATWG Encoding Standard.

See the Rust documentation for segment_utf8 for more information.

Lifetimes: this, input must live at least as long as the output.


The documentation for this class was generated from the following file: