In cassandra SASI custom index, need to change Analyzer class from StandardAnalyzer to NonTokenizingAnalyzer. Does it causes impact on any means?

放肆的年华 提交于 2019-12-13 03:56:10

问题


Which analyzer is costlier over time, disk space , based on search criteria over the data?

Note: I'm using NonTokenizing Analyzer for the case-sensitive feature.


回答1:


Analyzer_class : Analyzers can be specified that will analyze the text in the specified column.

  • The NonTokenizingAnalyzer is used for cases where the text is not analyzed, but case normalization or sensitivity is required.
  • The StandardAnalyzer is used for analysis that involves stemming, case normalization, case sensitivity, skipping common words like "and" and "the", and localization of the language used to complete the analysis

So moving from StandardAnalyzer to NonTokenizingAnalyzer you loose the capability of skipping common words, localization, etc. So it really depends on the query that you are trying to solve, determines the switch.

In terms of cost on disk space, StandardAnalyzer does use more as it has to process more, but provides more functionality as well. So it really depends on your use case.



来源:https://stackoverflow.com/questions/48828354/in-cassandra-sasi-custom-index-need-to-change-analyzer-class-from-standardanaly

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!