Indexing Japanese Content for DITA WebHelp Pages

To optimize the indexing of Japanese content in WebHelp pages generated from DITA map transformations, the Kuromoji analyzer can be used. This analyzer in not included in the Oxygen XML Editor installation kit and must be downloaded and added.

To use the Kuromoji analyzer to index Japanese content in your WebHelp system, follow these steps:
  1. Download the analyzer jar file from http://mvnrepository.com/artifact/org.apache.lucene/lucene-analyzers-kuromoji/4.0.0.
  2. Place the Kuromoji analyzer jar file in the following directory: DITA_OT_DIR/plugins/com.oxygenxml.webhelp/lib.
  3. For the analyzer to work properly, search terms that are entered into your WebHelp pages must be separated by spaces.

Optionally a Japanese user dictionary can be set with the webhelp.search.japanese.dictionary parameter.