search-engine | 易学教程

Recommend a linux-based Site Search Engine? [closed]

阅读更多关于 Recommend a linux-based Site Search Engine? [closed]

问题 As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance. Closed 7 years ago . I need a site search engine to provide search for my members-only content. I've previously used Fluid Dynamics Search Engine but was

how can i make google cache delete old webpages and start to index new ones

阅读更多关于 how can i make google cache delete old webpages and start to index new ones

问题 Hello i have a problem concerning google cache my old content urls while i created a new website I have an old website where the old webpages are dead now and created a new website with new webpages. Becuase I have old content so when people search on Google for old content the old URLs appear in the search results (as it was cached) instead of the new ones which should be appearing (but not indexed yet), this is becuase the old content is already indexed by Google and the new ones are not

Fastest service for crawling web pages or invoking APIs (iTunes in particular)?

阅读更多关于 Fastest service for crawling web pages or invoking APIs (iTunes in particular)?

问题 We need to download metadata for all iOS apps on a daily basis. We plan on extracting the information by crawling the iTunes website and by using the iTunes search API. Since there are 700K+ apps, we need an efficient way to do this. One approach is to set up a bunch of scripts on EC2 and run them in parallel. Before we embark down this path, are there services like 80legs that people have used to accomplish a similar task? Essentially, we want something to help us crawl hundreds of thousands

php, mysql search for the website

阅读更多关于 php, mysql search for the website

问题 I need a search engine for a website I am building. I decided to try my own using php and mysql. Currently it looks like the viable options is to create three tables. One for words, one for pages, and one reference table. Then when I am inserting a new article I would scan the text and put the separate words in the words table and refernce those words on the third table. In the end when a search is made. The script should return the pages with the most indexed words for a given word. However

Block some of dynamic pages from search engines

阅读更多关于 Block some of dynamic pages from search engines

问题 I need to block some of my pages from search engines.How can I do that ? App has been developed by using ASP.net MVC and AngularJS. Thanks in advance. These are the urls which I want to block from the search engines. http://localhost:12534/myurl123-event?participant=12957 http://localhost:12534/myurl123-event Note : The last part of the url is dynamic (i.e. myurl123-event?participant=12957 and myurl123-event ). 回答1: You can use a robots.txt with a disallow setting: User-agent: * Disallow:

Including a Search Plugin in my Firefox Extension

阅读更多关于 Including a Search Plugin in my Firefox Extension

问题 I am developing a Firefox Extension (Firefox 3.6 and up) and i would like, when the user installs my extension to: Add my own Search Engine to the list of available Search Engines in Firefox. Change the default search engine to my new installed search engine. How strange that it is, I couldn't find much help in the Mozilla Developer Network and Googeling it... Thanks. Aviv 回答1: Just put your search plugin definition in the searchplugins/ subdirectory of your extension (see https://developer

How do search engines crawl Javascript?

阅读更多关于 How do search engines crawl Javascript?

问题 If I add random keywords alt attrb to the images using jQuery document.ready (thinking that the page is already loaded), how does it affect search engines? Will the search engines get the alt attrb that I added with any JavaScript at all? If not how come it can understand Ajax calls sent via JavaScript? I want to add the alt attrb to images that don't have any in my client's site, in case they forget to put an alt text. jQuery will replace the empty ones with keywords. Is this possible? 回答1:

Does Google crawl HTTPS Links?

阅读更多关于 Does Google crawl HTTPS Links?

问题 I have a secured site. Will Google (and other search engines) crawl the pages on the site? 回答1: Yes, Google will index both HTTP and HTTPS pages. It's evident if you try to use search, although I can't seem to find confirmation from any "official" source at the moment. 来源： https://stackoverflow.com/questions/9980613/does-google-crawl-https-links

building safe search engine for kids?

阅读更多关于 building safe search engine for kids?

问题 my project this year : build safe search engine for kids so i need some info about : 1- where to find some studies , papers about this topic ? 2- how to make use of open source tools ? 3- what the technologies that i need to learn for my project ? and some advices please thanks . update : safe mean that when kids search for any thing they dont see results about (pornography) 回答1: Your solution will be a generalization of spam filters for email. You can either write a rule-based solution or

Create search engine for site in asp .net c#

阅读更多关于 Create search engine for site in asp .net c#

问题 Newbie alert! ASP .net | C# | VS2008 | MS SQL server 2008 R2 Express I have a database with file names, file paths and file category. I have lots of static pages in the website also. I want to develop a custom search engine (since I am not allowed to use other 3rd party tools - basically they want me to reinvent the wheel). When the user searches for something, it should search the file names and also the content of the static pages. File names can be searched straightforward from the