Class WordSegmenter

An ICU4X word-break segmenter, capable of finding word breakpoints in strings.

See the Rust documentation for WordSegmenter for more information.

Index

Methods

createAuto createAutoWithContentLocale createAutoWithContentLocaleAndProvider createDictionary createDictionaryWithContentLocale createDictionaryWithContentLocaleAndProvider createLstm createLstmWithContentLocale createLstmWithContentLocaleAndProvider segment

Methods

`Static`createAuto

createAuto(): WordSegmenter
Construct an WordSegmenter with automatically selecting the best available LSTM or dictionary payload data, using compiled data. This does not assume any content locale.

Note: currently, it uses dictionary for Chinese and Japanese, and LSTM for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for new_auto for more information.

Returns WordSegmenter
- Defined in WordSegmenter.d.ts:31

`Static`createAutoWithContentLocale

createAutoWithContentLocale(locale): WordSegmenter
Construct an WordSegmenter with automatically selecting the best available LSTM or dictionary payload data, using compiled data.

Note: currently, it uses dictionary for Chinese and Japanese, and LSTM for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for try_new_auto for more information.
Parameters
- locale: Locale
Returns WordSegmenter
- Defined in WordSegmenter.d.ts:42

`Static`createAutoWithContentLocaleAndProvider

createAutoWithContentLocaleAndProvider(provider, locale): WordSegmenter
Construct an WordSegmenter with automatically selecting the best available LSTM or dictionary payload data, using a particular data source.

Note: currently, it uses dictionary for Chinese and Japanese, and LSTM for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for try_new_auto for more information.
Parameters
- provider: DataProvider
- locale: Locale
Returns WordSegmenter
- Defined in WordSegmenter.d.ts:53

`Static`createDictionary

createDictionary(): WordSegmenter
Construct an WordSegmenter with with dictionary payload data for Chinese, Japanese, Burmese, Khmer, Lao, and Thai, using compiled data. This does not assume any content locale.

Note: currently, it uses dictionary for Chinese and Japanese, and dictionary for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for new_dictionary for more information.

Returns WordSegmenter
- Defined in WordSegmenter.d.ts:97

`Static`createDictionaryWithContentLocale

createDictionaryWithContentLocale(locale): WordSegmenter
Construct an WordSegmenter with dictionary payload data for Chinese, Japanese, Burmese, Khmer, Lao, and Thai, using compiled data.

Note: currently, it uses dictionary for Chinese and Japanese, and dictionary for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for try_new_dictionary for more information.
Parameters
- locale: Locale
Returns WordSegmenter
- Defined in WordSegmenter.d.ts:108

`Static`createDictionaryWithContentLocaleAndProvider

createDictionaryWithContentLocaleAndProvider(provider, locale): WordSegmenter
Construct an WordSegmenter with dictionary payload data for Chinese, Japanese, Burmese, Khmer, Lao, and Thai, using a particular data source.

Note: currently, it uses dictionary for Chinese and Japanese, and dictionary for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for try_new_dictionary for more information.
Parameters
- provider: DataProvider
- locale: Locale
Returns WordSegmenter
- Defined in WordSegmenter.d.ts:119

`Static`createLstm

createLstm(): WordSegmenter
Construct an WordSegmenter with LSTM payload data for Burmese, Khmer, Lao, and Thai, using compiled data. This does not assume any content locale.

Note: currently, it uses dictionary for Chinese and Japanese, and LSTM for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for new_lstm for more information.

Returns WordSegmenter
- Defined in WordSegmenter.d.ts:64

`Static`createLstmWithContentLocale

createLstmWithContentLocale(locale): WordSegmenter
Construct an WordSegmenter with LSTM payload data for Burmese, Khmer, Lao, and Thai, using compiled data.

Note: currently, it uses dictionary for Chinese and Japanese, and LSTM for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for try_new_lstm for more information.
Parameters
- locale: Locale
Returns WordSegmenter
- Defined in WordSegmenter.d.ts:75

`Static`createLstmWithContentLocaleAndProvider

createLstmWithContentLocaleAndProvider(provider, locale): WordSegmenter
Construct an WordSegmenter with LSTM payload data for Burmese, Khmer, Lao, and Thai, using a particular data source.

Note: currently, it uses dictionary for Chinese and Japanese, and LSTM for Burmese, Khmer, Lao, and Thai.

See the Rust documentation for try_new_lstm for more information.
Parameters
- provider: DataProvider
- locale: Locale
Returns WordSegmenter
- Defined in WordSegmenter.d.ts:86

segment

segment(input): WordBreakIteratorUtf16
Segments a string.

Ill-formed input is treated as if errors had been replaced with REPLACEMENT CHARACTERs according to the WHATWG Encoding Standard.

See the Rust documentation for segment_utf16 for more information.
Parameters
- input: string
Returns WordBreakIteratorUtf16
- Defined in WordSegmenter.d.ts:129

Class WordSegmenter

Index

Methods

Methods

`Static`createAuto

Returns WordSegmenter

`Static`createAutoWithContentLocale

Parameters

Returns WordSegmenter

`Static`createAutoWithContentLocaleAndProvider

Parameters

Returns WordSegmenter

`Static`createDictionary

Returns WordSegmenter

`Static`createDictionaryWithContentLocale

Parameters

Returns WordSegmenter

`Static`createDictionaryWithContentLocaleAndProvider

Parameters

Returns WordSegmenter

`Static`createLstm

Returns WordSegmenter

`Static`createLstmWithContentLocale

Parameters

Returns WordSegmenter

`Static`createLstmWithContentLocaleAndProvider

Parameters

Returns WordSegmenter

segment

Parameters

Returns WordBreakIteratorUtf16

Settings

On This Page

Class WordSegmenter

Index

Methods

Methods

StaticcreateAuto

Returns WordSegmenter

StaticcreateAutoWithContentLocale

Parameters

Returns WordSegmenter

StaticcreateAutoWithContentLocaleAndProvider

Parameters

Returns WordSegmenter

StaticcreateDictionary

Returns WordSegmenter

StaticcreateDictionaryWithContentLocale

Parameters

Returns WordSegmenter

StaticcreateDictionaryWithContentLocaleAndProvider

Parameters

Returns WordSegmenter

StaticcreateLstm

Returns WordSegmenter

StaticcreateLstmWithContentLocale

Parameters

Returns WordSegmenter

StaticcreateLstmWithContentLocaleAndProvider

Parameters

Returns WordSegmenter

segment

Parameters

Returns WordBreakIteratorUtf16

Settings

On This Page

`Static`createAuto

`Static`createAutoWithContentLocale

`Static`createAutoWithContentLocaleAndProvider

`Static`createDictionary

`Static`createDictionaryWithContentLocale

`Static`createDictionaryWithContentLocaleAndProvider

`Static`createLstm

`Static`createLstmWithContentLocale

`Static`createLstmWithContentLocaleAndProvider