Using Wordbreakers with Product Catalogs

The Product Catalog System uses a neutral-language wordbreaker for searching language-insensitive content and a language-specific wordbreaker for each language in the catalog. You can change these wordbreakers to suit your needs.

To specify the wordbreaker to be used for a language, use the UpdateLanguageInfo.vbs script in the Program Files\Microsoft Commerce Server 2002\Support directory.

SQL Server 2000 supports the following wordbreakers:

Wordbreaker Locale ID
Neutral 0
Chinese_Simplified 0x0804
Chinese_Traditional 0x0404
Dutch 0x0413
English_UK 0x0809
English_US 0x0409
French 0x040c
German 0x0407
Italian 0x0410
Japanese 0x0411
Korean 0x0412
Spanish_Modern 0x0c0a
Swedish_Default 0x041d

By default, Commerce Server 2002 is configured to use the following wordbreakers for the following languages.

Language code Wordbreaker
Neutral Neutral
Chs, zh-CN Chinese simplified
cht, zh-TW Chinese traditional
Deu, de German
Eng, en-GB English_UK
Enu, en-US English_US
Ens, es Spanish
Fra, fr French
Ita, it Italian
Jpn, ja Japanese
Kor, ko Korean
nld, nl Dutch
Sve, sv Swedish

If you want to change this default mapping or associate a wordbreaker with a language, modify the languages.xml file in the Program Files\Microsoft Commerce Server 2002 folder as follows:

<?xml version = '1.0' encoding='utf-8' ?>
<Languages>
<Language culture_name="en-ZW"    display_name="English - Zimbabwe" culture_code="0x3009" iso639x="ENW" collation="" word_breaker="0x0409"/>
</Languages>

This will use the English wordbreaker for the culture en-ZW.

Run the following script: UpdateLanguageInfo <path of the languages.xml file>.

By default, the language-neutral data in the catalog uses the Neutral language wordbreaker. To use a different wordbreaker for the language-neutral data, add the following entry in languages.xml:

<Languages>
<Language culture_name="Neutral"    display_name="Neutral" culture_code="" iso639x="Neutral" collation="" word_breaker="0x0411"/>
</Languages>

This will use the Japanese wordbreaker for searching the language-neutral data and can be helpful on a Japanese installation.

Run the following script: UpdateLanguageInfo <path of the languages.xml file>.

Copyright © 2005 Microsoft Corporation.
All rights reserved.