Phonetic filter factory for Hindi

a 夏天 提交于 2019-12-12 05:51:44

问题


I am working with Apache solr ,I am trying to use phonetic filter factory , I have tried all the encoders that are available with solr.PhoneticFilterFactory but none of them is supporting indian languages . Is there any other Filter/Method available so that i can get phonetic representation for indian languages e.g Hindi,tamil,Bengali etc

If not then how we can modify existing filters to support these languages.


回答1:


Have you tried the new Beider Morse Filter Factory, which was just added in version 3.6 and is (alas) not yet well-documented?

https://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.BeiderMorseFilterFactory

It was developed for phonetic searching of Central and Eastern European surnames, but maybe it would work for other languages too. I have personally found that it works much better than Soundex or the other older soundalike methods.



来源:https://stackoverflow.com/questions/13023424/phonetic-filter-factory-for-hindi

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!