Why solr RemoveDuplicatesTokenFilterFactory dont work?

China☆狼群 提交于 2019-12-07 11:56:41

问题


My schema.xml is splitting product name and then uses RemoveDuplicate to remove duplicated words after split.

    <fieldType name="type_name" class="solr.TextField">
        <analyzer type="index">
            <tokenizer class="solr.PatternTokenizerFactory" pattern="\|| " />
            <filter class="solr.LowerCaseFilterFactory"/>
            <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>

And in query analyzer I see that RemoveDuplicatesTokenFilterFactory did absolutely nothing to duplicated words. Why?


回答1:


If you read Wiki you will see that it only removes duplicates at the same position, which is not the case here.



来源:https://stackoverflow.com/questions/10329470/why-solr-removeduplicatestokenfilterfactory-dont-work

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!