search-engine

How do I do a partial field match using Haystack?

廉价感情. 提交于 2019-12-28 16:40:59
问题 I needed a simple search tool for my django-powered web site, so I went with Haystack and Solr. I have set everything up correctly and can find the correct search results when I type in the exact phrase, but I can't get any results when typing in a partial phrase. For example: "John" returns "John Doe" but "Joh" doesn't return anything. Model: class Person(models.Model): first_name = models.CharField(max_length=50) last_name = models.CharField(max_length=50) Search Index: class PersonIndex

Search engine using python for bookmarked sites [closed]

那年仲夏 提交于 2019-12-25 18:44:20
问题 Closed . This question needs to be more focused. It is not currently accepting answers. Want to improve this question? Update the question so it focuses on one problem only by editing this post. Closed 5 years ago . The idea I have is to build a search engine based on my bookmarks file which I have in CSV format. The motivation behind this idea is that I have a large number of bookmarks related to the educational resources which I want to be able to search and find related content for a

configuring nutch regex-normalize.xml

丶灬走出姿态 提交于 2019-12-25 15:22:08
问题 I am using the Java-based Nutch web-search software. In order to prevent duplicate (url) results from being returned in my search query results, I am trying to remove (a.k.a. normalize) the expressions of 'jsessionid' from the urls being indexed when running the Nutch crawler to index my intranet. However my modifications to $NUTCH_HOME/conf/regex-normalize.xml (prior to running my crawl) do not seem to be having any effect. How can I ensure that my regex-normalize.xml configuration is being

Multiple shards on single machine performance

|▌冷眼眸甩不掉的悲伤 提交于 2019-12-25 08:23:18
问题 Does it make sense to have multiple shards in Elasticsearch if I am going to use only single machine? Will it improve performance in any way? Same question for Apache Solr - does it make sense to use Solr Cloud with ZooKeeper for single server instance or just create one core without any sharding? Let's assume I am not going to use other machines in future, so the main point is how sharding on single machine influence search engines performance? 回答1: There are certain parts of Lucene that's

How to remove page element from search results / keywords?

谁都会走 提交于 2019-12-25 08:09:36
问题 Problem is here. I have simple page with form and not so much other text content. In form I have for example: <h1>It should be my search result</h1> <form> <select name="Months"> <option>January</option> <option>February</option> ... <option>December</option> </select> </form> In search results/page keywords I have more months names than other important content, where keywords should be IT SHOULD BE MY SEARCH RESULT only. How I can disable select content from results? Possible? 回答1:

MySQL query - weight-based search engine

给你一囗甜甜゛ 提交于 2019-12-25 07:52:27
问题 Here is the database structure for a medical MIS table: CREATE TABLE IF NOT EXISTS `ab_med` ( `med_id` bigint(20) NOT NULL AUTO_INCREMENT, `med_title` varchar(200) DEFAULT NULL, `med_posted_date` varchar(200) DEFAULT NULL, `med_company_name` varchar(200) DEFAULT NULL, `med_experience` varchar(200) DEFAULT NULL, `med_salary` varchar(200) DEFAULT NULL, `med_city` varchar(200) DEFAULT NULL, `med_description_short` text, `med_details_link` varchar(500) DEFAULT NULL, `med_from_city_type` int(1)

SPHINX SEARCH: Does searchd has to run as a service to make it work with PHP?

拥有回忆 提交于 2019-12-25 04:55:22
问题 I've been working with this all day. But don't get it to work. I can search through CMD and get results on my indexed tables but I have problems trying to understand the sphinxapi for php, i don't even now fully how to write a query and get the results can someone help me to see if this things work. HOW DO WE WRITE A PHP QUERY FOR SPHINX DATABASE = searchtest TABLE = documents overlook of my sphinx.conf file type = mysql sql_host = localhost sql_user = root sql_pass = sqlpass sql_db =

Mapping and indexing Path hierarchy in Elastic NEST to search with in directory paths

耗尽温柔 提交于 2019-12-25 04:27:43
问题 I need to search for files and folder with in specific directories. In order to do that, elastic asks us to create the analyzer and set the tokenizer to path_hierarchy PUT /fs { "settings": { "analysis": { "analyzer": { "paths": { "tokenizer": "path_hierarchy" } } } } } Then, create the mapping as illustrated below with two properties: name (holding the name of the file) and path (to store the directory path): PUT /fs/_mapping/file { "properties": { "name": { "type": "string", "index": "not

Angularjs website not indexing on google

本秂侑毒 提交于 2019-12-25 03:17:24
问题 I developed a large angularjs application with more than 20 pages and add some SEO tags. But it is not functioning well. Most of content of the pages also populate using javascript. This is how I implemented SEO tags for each pages. app.js (function() { var app = angular.module('myApp', ['ngRoute', 'ngSanitize', 'angular-flexslider']); // config route app.config(['$routeProvider', '$locationProvider', function($routeProvider, $locationProvider) { var baseUrl = "partials/"; $routeProvider.when

Design Question for Notification System

╄→гoц情女王★ 提交于 2019-12-25 02:22:26
问题 The original post was posted at https://stackoverflow.com/questions/6007097/design-question-for-notification-system Here is more clarification of the problem: The notification system purpose is to get user notified (via email for now) when content of the site has changed or updated, or new posting is made. This could be treated as a notification system where people define a rule or keyword for 3rd party site and notification system goes out crawle 3rd party site and crate search inverted