Class DictionaryCompoundWordTokenFilterFactory

java.lang.Object
org.apache.lucene.analysis.AbstractAnalysisFactory
org.apache.lucene.analysis.TokenFilterFactory
org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilterFactory
All Implemented Interfaces:
ResourceLoaderAware

public class DictionaryCompoundWordTokenFilterFactory extends TokenFilterFactory implements ResourceLoaderAware
Factory for DictionaryCompoundWordTokenFilter.
 <fieldType name="text_dictcomp" class="solr.TextField" positionIncrementGap="100">
   <analyzer>
     <tokenizer class="solr.WhitespaceTokenizerFactory"/>
     <filter class="solr.DictionaryCompoundWordTokenFilterFactory" dictionary="dictionary.txt"
         minWordSize="5" minSubwordSize="2" maxSubwordSize="15" onlyLongestMatch="true"/>
   </analyzer>
 </fieldType>
Since:
3.1
  • Field Details

    • NAME

      public static final String NAME
      SPI name
      See Also:
    • dictionary

      private CharArraySet dictionary
    • dictFile

      private final String dictFile
    • minWordSize

      private final int minWordSize
    • minSubwordSize

      private final int minSubwordSize
    • maxSubwordSize

      private final int maxSubwordSize
    • onlyLongestMatch

      private final boolean onlyLongestMatch
  • Constructor Details

    • DictionaryCompoundWordTokenFilterFactory

      public DictionaryCompoundWordTokenFilterFactory(Map<String,String> args)
      Creates a new DictionaryCompoundWordTokenFilterFactory
    • DictionaryCompoundWordTokenFilterFactory

      public DictionaryCompoundWordTokenFilterFactory()
      Default ctor for compatibility with SPI
  • Method Details