WoodWing Help Center

Setting up words to ignore when using Solr in Enterprise Server 9

Setting up words to ignore when using Solr in Enterprise Server 9

The default installation of Solr contains a list of very commonly used words that are ignored when users use them in a search.

Example: Such words include: 'a', 'the', 'on', 'at', 'can', 'go', and so on.

Including these words in a search would result in files that may not be related to what the user is actually searching for.

The stop words list

This list is referred to as the 'stop words' list. Because the list that is shipped with Solr is not very comprehensive and is for English only, Enterprise Server provides alternative lists. These lists are more comprehensive and available in the following languages:

  • American English
  • Brazilian Portuguese
  • Czech
  • Dutch
  • French
  • German
  • Italian
  • Japanese
  • Polish
  • Russian
  • Spanish
  • Traditional Chinese

Installation

Step 1. Unpack the file SolrEnterprise_9.x.x_Buildyz.zip that is delivered together with the Enterprise Server installation.

Step 2. Copy one of the stopwords.txt file from

Solr/languages/<language code>

to

Solr/conf

where <language code> is one of the following:

Language code Language
csCZ Czech
deDE German
enUS American English
frFR French
itIT Italian
jaJP Japanese
nlNL Dutch
plPL Polish
ptBR Brazilian Portuguese
ruRU Russian
zhTW Traditional Chinese

Step 3. (Optional) Edit the file in a plain-text editor that supports UTF-8.

Was this article helpful?
0 out of 0 found this helpful / Created: / Updated:
Have more questions? Submit a request

0 Comments

Please sign in to leave a comment.