google-search-appliance

How can I get MediaWiki to ignore page views from a Google Search Appliance?

两盒软妹~` 提交于 2020-01-14 20:41:52
问题 The page view counter on each MediaWiki page seems like a great way to identify popular pages which are worth putting more effort into keeping up-to-date and useful, but I've hit a problem. We use a Google Search Appliance to index our MediaWiki installation. The problem I have is that the GSA increments the page view counter each time it crawls the page. This completely dominates the statistics, swamping the views made by real users. I know how to reset the page counters to start again. But

Google Search Appliance index content from database

喜欢而已 提交于 2020-01-05 06:37:29
问题 Instead using the google search appliance crawler for index content, im using a query to a sql server database. The query is running fine, but i supposed that all these registries belong now to the Default collection (the complete index of Google Search Appliance). Do you know how could i create a new Collection with only the data of the query? Thanks in advance. Best Regards. Jose 回答1: Hi try adding the following: Add the following to your Crawl URLs Start Crawling from the Following URLs

GSA 7.4 default_frontend and suggest and user-added results

淺唱寂寞╮ 提交于 2019-12-25 08:50:13
问题 The team that is in charge of GSA in my company seem to accidentally overwrite our collections and frontends where we have 4 of each now I am new developer for the application that utilize GSA search and I have been told that the search functionality was working fine in the beginning of this year but what surprise me is the following inside the code we are using the following call for GSA: http://go-yyy/suggest?token=as&access=p&client=yyy_frontend&output=xml_no_dtd&proxystylesheet=yyy

GSA OneBox - Internal Provider will account for the overall QPS

雨燕双飞 提交于 2019-12-13 21:13:00
问题 I think the title is self explanatory. When I create a GSA OneBox which is powered for an internal collection. I then make a GSA search request which triggers the OneBox. Now, will the number of search request on that given second will be 1 or 2? All i'm trying to figure out is if the OneBox which is an internal provider account for the Appliance's QPS. 回答1: It will be two while the onebox search is executing. 来源: https://stackoverflow.com/questions/32641517/gsa-onebox-internal-provider-will

Result Grouping in GSA - Aggregating results based on a metadata or collection

妖精的绣舞 提交于 2019-12-11 21:21:34
问题 This is a tricky question to make you understand but let me attempt it. Today, I have 8 different portlets in my page and when global search box is triggered I fire 8 GSA queries to show each result set onto the respective portlet. The difference between each of the GSA call are the collection names Now I could not manage to think of a solution to replace 8 different GSA calls with one call. Each collection has atleast 100k documents and the max result a GSA search response can give is only

Search in default_collection minus a specific collection

蹲街弑〆低调 提交于 2019-12-11 07:24:06
问题 In our GSA index of 500K documents half of the documents are coming from an internal bug tracking system. We have been hearing some power users complain about results from the bug tracking system pushing down other useful results from many other sources. We discussed about using result biasing to lower the importance of bug tracking documents but I am not very keen on this approach as I believe we should let GSA do its magic and decide on the relevancy of the results. Instead what I want to

BadStatusLine exception raised when returning reply from server in Python 3

情到浓时终转凉″ 提交于 2019-11-29 07:21:23
I am trying to port a script to python 3 that submits XML feeds found here: https://developers.google.com/search-appliance/documentation/files/pushfeed_client.py.txt After running 2to3.py and making a few minor adjustments to remove any syntax errors the script fails with this: (py33dev) d:\dev\workspace>python pushfeed_client.py --datasource="TEST1" --feedtype="full" --url="http://gsa:19900/xmlfeed" --xmlfilename="test.xml" Traceback (most recent call last): File "pushfeed_client.py", line 108, in <module> main(sys.argv) File "pushfeed_client.py", line 56, in main result = urllib.request

BadStatusLine exception raised when returning reply from server in Python 3

徘徊边缘 提交于 2019-11-28 01:07:37
问题 I am trying to port a script to python 3 that submits XML feeds found here: https://developers.google.com/search-appliance/documentation/files/pushfeed_client.py.txt After running 2to3.py and making a few minor adjustments to remove any syntax errors the script fails with this: (py33dev) d:\dev\workspace>python pushfeed_client.py --datasource="TEST1" --feedtype="full" --url="http://gsa:19900/xmlfeed" --xmlfilename="test.xml" Traceback (most recent call last): File "pushfeed_client.py", line