Components not indexed in sitecore lucene search indexes

狂风中的少年 提交于 2019-12-08 02:47:51

问题


I have configured lucene search index in configuration & tested index with lukeall tool it searches for all fields of defined templates but content on pages are using another external component, which is not searched but data in fields of page are searchable. is there any way to search it something like html search so that all data on page could be indexed.

Thanks guys.


回答1:


It's a common requirement.

This screencast outlines an approach where the crawler loops through each of the page's components (at about 38 minutes in).

http://www.techphoria414.com/Blog/2012/May/Sitecore_Page_Editor_Unleashed

The above example uses the old Advanced Database Crawler, but the principle is sound.

Another common approach is to create a computed field in your index which causes the application to request to the page, so it's HTML can be scraped.

https://github.com/hermanussen/sitecore-html-crawler

My preference is the second option because it's more accurate




回答2:


Or, if you want your crawled content completely separated you could go for https://github.com/efocus-nl/sitecorewebsearch

It also offers you some extra options like skipping parts of the page (eg the menu, footer, header)



来源:https://stackoverflow.com/questions/30343211/components-not-indexed-in-sitecore-lucene-search-indexes

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!