The default installation of Solr contains a list of very commonly used words that are ignored when users use them in a search.
Example: Such words include: 'a', 'the', 'on', 'at', 'can', 'go', and so on.
Including these words in a search would result in files that may not be related to what the user is actually searching for.
The stop words list
This list is referred to as the 'stop words' list. Because the list that is shipped with Solr is not very comprehensive and is for English only, Enterprise Server provides alternative lists. These lists are more comprehensive and available in the following languages:
- American English
- Brazilian Portuguese
- Czech
- Dutch
- French
- German
- Italian
- Japanese
- Polish
- Russian
- Spanish
- Traditional Chinese
Installation
Step 1. Unpack the file SolrEnterprise_9.x.x_Buildyz.zip that is delivered together with the Enterprise Server installation.
Step 2. Copy one of the stopwords.txt file from
Solr/languages/<language code>
to
Solr/conf
where <language code> is one of the following:
Language code | Language |
---|---|
csCZ | Czech |
deDE | German |
enUS | American English |
frFR | French |
itIT | Italian |
jaJP | Japanese |
nlNL | Dutch |
plPL | Polish |
ptBR | Brazilian Portuguese |
ruRU | Russian |
zhTW | Traditional Chinese |
Step 3. (Optional) Edit the file in a plain-text editor that supports UTF-8.
Related Information
Related Tasks
Enabling sorting on Placed On columns in Enterprise Server 9 with Solr installed
Adding custom Enterprise metadata properties to the Solr search
Changing the time-out period for Solr 4.5
Setting up synonyms to include when using Solr in Enterprise Server 9
Integrating Solr 4.5 in Enterprise Server 9
Reference Materials
Comment
Do you have corrections or additional information about this article? Leave a comment! Do you have a question about what is described in this article? Please contact Support.
0 comments
Please sign in to leave a comment.