search-engine

programmer-friendly search engine? [duplicate]

我们两清 提交于 2019-12-03 06:38:47
This question already has answers here : Programmer-friendly search engine? [closed] (11 answers) Possible Duplicate: Programmer-friendly search engine? Google is unfriendly to searching for verbatim strings with characters like $ and #. Is there a search engine that supports searching for verbatim strings? Try http://www.google.com/codesearch Just remember that it's a REGEXP language, so to search for $value, use \$value. For example: http://www.google.com/codesearch?hl=en&lr=&q= \%24value&sbtn=Search this one would be cool if more was covered.... http://start.gotapi.com/ or http://www.gotapi

Do search engines respect the HTTP header field “Content-Location”?

南楼画角 提交于 2019-12-03 06:26:01
I was wondering whether search engines respect the HTTP header field Content-Location . This could be useful, for example, when you want to remove the session ID argument out of the URL: GET /foo/bar?sid=0123456789 HTTP/1.1 Host: example.com … HTTP/1.1 200 OK Content-Location: http://example.com/foo/bar … Clarification: I don’t want to redirect the request, as removing the session ID would lead to a completely different request and thus probably also a different response. I just want to state that the enclosed response is also available under its “main URL”. Maybe my example was not a good

Is it possible to develop a powerful web search engine using Erlang, Mnesia & Yaws?

岁酱吖の 提交于 2019-12-03 06:25:28
问题 I am thinking of developing a web search engine using Erlang, Mnesia & Yaws. Is it possible to make a powerful and the fastest web search engine using these software? What will it need to accomplish this and how what do I start with? 回答1: Erlang can make the most powerful web crawler today. Let me take you through my simple crawler. Step 1. I create a simple parallelism module, which i call mapreduce -module(mapreduce). -export([compute/2]). %%=================================================

how to configure the synonyms_path in elasticsearch

梦想与她 提交于 2019-12-03 06:06:02
i'm pretty new to elasticsearch and i want to use synonyms, i added these lines in the configuration file: index : analysis : analyzer : synonym : type : custom tokenizer : whitespace filter : [synonym] filter : synonym : type : synonym synonyms_path: synonyms.txt then i created an index test: "mappings" : { "test" : { "properties" : { "text_1" : { "type" : "string", "analyzer" : "synonym" }, "text_2" : { "search_analyzer" : "standard", "index_analyzer" : "synonym", "type" : "string" }, "text_3" : { "type" : "string", "analyzer" : "synonym" } } } } and insrted a type test with this data: {

Meaning of parameters in a Google query?

自作多情 提交于 2019-12-03 05:45:32
Are there any ressources on what the parameters in a Google query mean? Any analysis how the Google search pages work internally? Examples would be http://www.google.com/#hl=en&source=hp&q=lol&aq=f&aqi=&aql=&oq=&fp=45675624562456 or http://www.google.com/url?sa=t&source=web&ct=res&cd=11&ved=KJSGHFKSDJF&url=sfdgagasdgasdgasgasg&rct=j&q=fghthwrteghedgf&ei=asdfasdfsa&usg=asdfasdfasf q=searchstring is the search string source=something is where the search originated (www.google.com webpage, Google Toolbar, Quick Search of various browsers) ie=utf8 input encoding oe=utf8 output encoding rls=en

List of JSON search engine APIs without quotas, like Bing? [closed]

末鹿安然 提交于 2019-12-03 05:41:58
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 5 years ago . I'd like to display some custom search results. I've looked at the JSON APIs of both Google and Microsoft (Bing). Unfortunately, Google has a limit on the amount of queries a day ($50 for a maximum of ten thousand queries). However, Bing allows an "unlimited" amount of queries a day, for free. Are there other

I wonder how reverse image search services like tineye.com work …?

橙三吉。 提交于 2019-12-03 05:27:04
How Would the Reverse Image Search Engines like TinEye Work ? I mean what parameters are required to do an image search ? Don't know if TinEye use exactly this one, but SURF is a commonly used algorithm for this purpose. Here you can see an usage example in Mathematica where a partial matching of images is used to compose a landscape: database: Generaly you have set of images that are collected from web sites. For each image extract key features (SURF, SIFT, whatever) in a form of numerical vectors associated to each image. Vectors are stored in searchable database. When you give image to

What does percolator mean/do in elasticsearch?

徘徊边缘 提交于 2019-12-03 04:01:01
问题 Even though I read the documentation for Elasticsearch to understand what a percolator is. I still have difficulty understanding what it means and where it is used in simple terms. Can anyone provide me with more details? 回答1: What you usually do is index documents and get them back by querying. What the percolator allows you to do in a nutshell is index your queries and percolate documents against the indexed queries to know which queries they match. It's also called reversed search, as what

Is there any free unlimited album artwork search API service? [closed]

大城市里の小女人 提交于 2019-12-03 03:49:36
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 3 years ago . Google's custom search API has a limitation up to 100 queries per day. That is far less than what I expected. I want to add that artwork-search function to my app. Thanks a lot. 回答1: How about Discogs, or Amazon or seeing what Cover Fetcher does? 回答2: Musicbrainz and the Internet Archive offer the Cover Art

Are HTML Meta Tags still important? [closed]

风流意气都作罢 提交于 2019-12-03 03:31:50
I read some articles on Internet, some said that search engine like Google and Bing don't care about HTML Meta Tags any more. Should I still need to maintain the HTML Meta Tags in my website properly? Thanks! Are meta tags critical? Every search engines emphasizes meta tags differently. Google doesn't let the public know how it uses meta tags. I have noticed that a lot of websites show up on the first page of Google for specific subjects and yet they don't have any keyword or description tags. So don't let anybody fool you by exaggerating how important they are for Search Engine Optimization.