Class WordSegmenter

An ICU4X word-break segmenter, capable of finding word breakpoints in strings.

See the Rust documentation for WordSegmenter for more information.

Index

Constructors

Accessors

Methods

segment createAuto createAutoWithContentLocale createAutoWithContentLocaleAndProvider createDictionary createDictionaryWithContentLocale createDictionaryWithContentLocaleAndProvider createLstm createLstmWithContentLocale createLstmWithContentLocaleAndProvider

Constructors

constructor

new WordSegmenter(): WordSegmenter
Returns WordSegmenter

Accessors

ffiValue

get ffiValue(): number
Returns number
- Defined in WordSegmenter.d.ts:17

Methods

segment

segment(input: string): WordBreakIteratorUtf16
Segments a string.

Ill-formed input is treated as if errors had been replaced with REPLACEMENT CHARACTERs according to the WHATWG Encoding Standard.

See the Rust documentation for segment_utf16 for more information.
Parameters
- input: string
Returns WordBreakIteratorUtf16
- Defined in WordSegmenter.d.ts:127

`Static`createAuto

createAuto(): WordSegmenter
Construct an [WordSegmenter] with automatically selecting the best available LSTM or dictionary payload data, using compiled data. This does not assume any content locale.

Note: currently, it uses dictionary for Chinese and Japanese, and LSTM for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for new_auto for more information.

Returns WordSegmenter
- Defined in WordSegmenter.d.ts:29

`Static`createAutoWithContentLocale

createAutoWithContentLocale(locale: Locale): WordSegmenter
Construct an [WordSegmenter] with automatically selecting the best available LSTM or dictionary payload data, using compiled data.

Note: currently, it uses dictionary for Chinese and Japanese, and LSTM for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for try_new_auto for more information.
Parameters
- locale: Locale
Returns WordSegmenter
- Defined in WordSegmenter.d.ts:40

`Static`createAutoWithContentLocaleAndProvider

createAutoWithContentLocaleAndProvider(
provider: DataProvider,
locale: Locale,
): WordSegmenter
Construct an [WordSegmenter] with automatically selecting the best available LSTM or dictionary payload data, using a particular data source.

Note: currently, it uses dictionary for Chinese and Japanese, and LSTM for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for try_new_auto for more information.
Parameters
- provider: DataProvider
- locale: Locale
Returns WordSegmenter
- Defined in WordSegmenter.d.ts:51

`Static`createDictionary

createDictionary(): WordSegmenter
Construct an [WordSegmenter] with with dictionary payload data for Chinese, Japanese, Burmese, Khmer, Lao, and Thai, using compiled data. This does not assume any content locale.

Note: currently, it uses dictionary for Chinese and Japanese, and dictionary for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for new_dictionary for more information.

Returns WordSegmenter
- Defined in WordSegmenter.d.ts:95

`Static`createDictionaryWithContentLocale

createDictionaryWithContentLocale(locale: Locale): WordSegmenter
Construct an [WordSegmenter] with dictionary payload data for Chinese, Japanese, Burmese, Khmer, Lao, and Thai, using compiled data.

Note: currently, it uses dictionary for Chinese and Japanese, and dictionary for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for try_new_dictionary for more information.
Parameters
- locale: Locale
Returns WordSegmenter
- Defined in WordSegmenter.d.ts:106

`Static`createDictionaryWithContentLocaleAndProvider

createDictionaryWithContentLocaleAndProvider(
provider: DataProvider,
locale: Locale,
): WordSegmenter
Construct an [WordSegmenter] with dictionary payload data for Chinese, Japanese, Burmese, Khmer, Lao, and Thai, using a particular data source.

Note: currently, it uses dictionary for Chinese and Japanese, and dictionary for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for try_new_dictionary for more information.
Parameters
- provider: DataProvider
- locale: Locale
Returns WordSegmenter
- Defined in WordSegmenter.d.ts:117

`Static`createLstm

createLstm(): WordSegmenter
Construct an [WordSegmenter] with LSTM payload data for Burmese, Khmer, Lao, and Thai, using compiled data. This does not assume any content locale.

Note: currently, it uses dictionary for Chinese and Japanese, and LSTM for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for new_lstm for more information.

Returns WordSegmenter
- Defined in WordSegmenter.d.ts:62

`Static`createLstmWithContentLocale

createLstmWithContentLocale(locale: Locale): WordSegmenter
Construct an [WordSegmenter] with LSTM payload data for Burmese, Khmer, Lao, and Thai, using compiled data.

Note: currently, it uses dictionary for Chinese and Japanese, and LSTM for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for try_new_lstm for more information.
Parameters
- locale: Locale
Returns WordSegmenter
- Defined in WordSegmenter.d.ts:73

`Static`createLstmWithContentLocaleAndProvider

createLstmWithContentLocaleAndProvider(
provider: DataProvider,
locale: Locale,
): WordSegmenter
Construct an [WordSegmenter] with LSTM payload data for Burmese, Khmer, Lao, and Thai, using a particular data source.

Note: currently, it uses dictionary for Chinese and Japanese, and LSTM for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for try_new_lstm for more information.
Parameters
- provider: DataProvider
- locale: Locale
Returns WordSegmenter
- Defined in WordSegmenter.d.ts:84

Class WordSegmenter

Index

Constructors

Accessors

Methods

Constructors

constructor

Returns WordSegmenter

Accessors

ffiValue

Returns number

Methods

segment

Parameters

Returns WordBreakIteratorUtf16

StaticcreateAuto

Returns WordSegmenter

StaticcreateAutoWithContentLocale

Parameters

Returns WordSegmenter

StaticcreateAutoWithContentLocaleAndProvider

Parameters

Returns WordSegmenter

StaticcreateDictionary

Returns WordSegmenter

StaticcreateDictionaryWithContentLocale

Parameters

Returns WordSegmenter

StaticcreateDictionaryWithContentLocaleAndProvider

Parameters

Returns WordSegmenter

StaticcreateLstm

Returns WordSegmenter

StaticcreateLstmWithContentLocale

Parameters

Returns WordSegmenter

StaticcreateLstmWithContentLocaleAndProvider

Parameters

Returns WordSegmenter

Settings

On This Page

`Static`createAuto

`Static`createAutoWithContentLocale

`Static`createAutoWithContentLocaleAndProvider

`Static`createDictionary

`Static`createDictionaryWithContentLocale

`Static`createDictionaryWithContentLocaleAndProvider

`Static`createLstm

`Static`createLstmWithContentLocale

`Static`createLstmWithContentLocaleAndProvider