Django之站内搜索-Solr,Haystack

java -version 不多说 solr 是java 开发的

java version "1.7.0_79"

Java(TM) SE Runtime Environment (build 1.7.0_79-b15)

Java HotSpot(TM) 64-Bit Server VM (build 24.79-b02, mixed mode)

Download Solr:

http://archive.apache.org/dist/lucene/solr/

下载－－》解压－－》solr-4.10.4/example

SimilarFacedeMacBook-Pro:example similarface$ java -jar start.jar

长长的log我也是醉了

图形化真好看

SimilarFacedeMacBook-Pro:solr similarface$ pwd

/Users/similarface/Downloads/solr-4.10.4/example/solr

SimilarFacedeMacBook-Pro:solr similarface$ tree newblog/

newblog/

├── conf

│   ├── _rest_managed.json

│   ├── lang

│   │   └── stopwords_en.txt

│   ├── protwords.txt

│   ├── schema.xml

│   ├── solrconfig.xml

│   ├── stopwords.txt

│   └── synonyms.txt

├── core.properties

└── data

SimilarFacedeMacBook-Pro:solr similarface$ cat newblog/conf/solrconfig.xml

<?xml version="1.0" encoding="utf-8" ?>

<config>

    <luceneMatchVersion>LUCENE_36</luceneMatchVersion>

        <requestHandler name="/select" class="solr.StandardRequestHandler"

                            default="true"/>

                                <requestHandler name="/update" class="solr.UpdateRequestHandler"/>

                                    <requestHandler name="/admin" class="solr.admin.AdminHandlers"/>

                                        <requestHandler name="/admin/ping" class="solr.PingRequestHandler">

                                                <lst name="invariants">

                                                            <str name="qt">search</str>

                                                                        <str name="q">*:*</str>

                                                                                </lst>

                                                                                    </requestHandler>

                                                                                    </config>

schema.xml默认写下面的：

<?xml version="1.0" ?>

   <schema name="default" version="1.5">

   </schema>

上面怎么做，bitch ，mkdir and vim

pip install django-haystack==2.4.0

pip install pysolr==3.3.2

settings.py

［haystack ,HAYSTACK_CONNECTIONS］

INSTALLED_APPS = [

　　 ...

    'haystack',

    ...

]

HAYSTACK_CONNECTIONS = {

       'default': {

           'ENGINE': 'haystack.backends.solr_backend.SolrEngine',

           'URL': 'http://127.0.0.1:8983/solr/myblog'

       },

}

#coding:utf-8

__author__ = 'similarface'

from haystack import indexes

from .models import Post

class PostIndex(indexes.SearchIndex,indexes.Indexable):

    '''

    给文章Post 建立索引

    '''

    text = indexes.CharField(document=True, use_template=True)

    publish = indexes.DateTimeField(model_attr='publish')

    def get_model(self):

        return Post

    def index_queryset(self, using=None):

        return self.get_model().published.all()

SimilarFacedeMacBook-Pro:templates similarface$ tree search/

search/

└── indexes

    └── myblog

        └── post_text.txt

SimilarFacedeMacBook-Pro:templates similarface$ pwd
/Users/similarface/PycharmProjects/StudyDjango/myblog/templates
SimilarFacedeMacBook-Pro:templates similarface$ cat search/indexes/myblog/post_text.txt
{{ object.title }}
{{ object.tags.all|join:", " }}
{{ object.content }}

SimilarFacedeMacBook-Pro:StudyDjango similarface$ python manage.py build_solr_schema

/System/Library/Frameworks/Python.framework/Versions/2.7/Extras/lib/python/pytz/__init__.py:29: UserWarning: Module email was already imported from /System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/email/__init__.pyc, but /Library/Python/2.7/site-packages is being added to sys.path

  from pkg_resources import resource_stream

/Library/Python/2.7/site-packages/django/core/management/base.py:265: RemovedInDjango110Warning: OptionParser usage for Django management commands is deprecated, use ArgumentParser instead

  RemovedInDjango110Warning)

/Library/Python/2.7/site-packages/haystack/management/commands/build_solr_schema.py:56: RemovedInDjango110Warning: render() must be called with a dict, not a Context.

  return t.render(c)

Save the following output to 'schema.xml' and place it in your Solr configuration directory.

--------------------------------------------------------------------------------------------

<?xml version="1.0" ?>

<!--

 Licensed to the Apache Software Foundation (ASF) under one or more

 contributor license agreements.  See the NOTICE file distributed with

 this work for additional information regarding copyright ownership.

 The ASF licenses this file to You under the Apache License, Version 2.0

 (the "License"); you may not use this file except in compliance with

 the License.  You may obtain a copy of the License at

     http://www.apache.org/licenses/LICENSE-2.0

 Unless required by applicable law or agreed to in writing, software

 distributed under the License is distributed on an "AS IS" BASIS,

 WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.

 See the License for the specific language governing permissions and

 limitations under the License.

-->

<schema name="default" version="1.5">

  <types>

    <fieldtype name="string"  class="solr.StrField" sortMissingLast="true" omitNorms="true"/>

    <fieldType name="boolean" class="solr.BoolField" sortMissingLast="true" omitNorms="true"/>

    <fieldtype name="binary" class="solr.BinaryField"/>

    <!-- Numeric field types that manipulate the value into

         a string value that isn't human-readable in its internal form,

         but with a lexicographic ordering the same as the numeric ordering,

         so that range queries work correctly. -->

    <fieldType name="int" class="solr.TrieIntField" precisionStep="0" omitNorms="true" sortMissingLast="true" positionIncrementGap="0"/>

    <fieldType name="float" class="solr.TrieFloatField" precisionStep="0" omitNorms="true" sortMissingLast="true" positionIncrementGap="0"/>

    <fieldType name="long" class="solr.TrieLongField" precisionStep="0" omitNorms="true" sortMissingLast="true" positionIncrementGap="0"/>

    <fieldType name="double" class="solr.TrieDoubleField" precisionStep="0" omitNorms="true" sortMissingLast="true" positionIncrementGap="0"/>

    <fieldType name="sint" class="solr.SortableIntField" sortMissingLast="true" omitNorms="true"/>

    <fieldType name="slong" class="solr.SortableLongField" sortMissingLast="true" omitNorms="true"/>

    <fieldType name="sfloat" class="solr.SortableFloatField" sortMissingLast="true" omitNorms="true"/>

    <fieldType name="sdouble" class="solr.SortableDoubleField" sortMissingLast="true" omitNorms="true"/>

    <fieldType name="tint" class="solr.TrieIntField" precisionStep="8" omitNorms="true" positionIncrementGap="0"/>

    <fieldType name="tfloat" class="solr.TrieFloatField" precisionStep="8" omitNorms="true" positionIncrementGap="0"/>

    <fieldType name="tlong" class="solr.TrieLongField" precisionStep="8" omitNorms="true" positionIncrementGap="0"/>

    <fieldType name="tdouble" class="solr.TrieDoubleField" precisionStep="8" omitNorms="true" positionIncrementGap="0"/>

    <fieldType name="date" class="solr.TrieDateField" omitNorms="true" precisionStep="0" positionIncrementGap="0"/>

    <!-- A Trie based date field for faster date range queries and date faceting. -->

    <fieldType name="tdate" class="solr.TrieDateField" omitNorms="true" precisionStep="6" positionIncrementGap="0"/>

    <fieldType name="point" class="solr.PointType" dimension="2" subFieldSuffix="_d"/>

    <fieldType name="location" class="solr.LatLonType" subFieldSuffix="_coordinate"/>

    <fieldtype name="geohash" class="solr.GeoHashField"/>

    <fieldType name="text_general" class="solr.TextField" positionIncrementGap="100">

      <analyzer type="index">

        <tokenizer class="solr.StandardTokenizerFactory"/>

        <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" enablePositionIncrements="true" />

        <!-- in this example, we will only use synonyms at query time

        <filter class="solr.SynonymFilterFactory" synonyms="index_synonyms.txt" ignoreCase="true" expand="false"/>

        -->

        <filter class="solr.LowerCaseFilterFactory"/>

      </analyzer>

      <analyzer type="query">

        <tokenizer class="solr.StandardTokenizerFactory"/>

        <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" enablePositionIncrements="true" />

        <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/>

        <filter class="solr.LowerCaseFilterFactory"/>

      </analyzer>

    </fieldType>

    <fieldType name="text_en" class="solr.TextField" positionIncrementGap="100">

      <analyzer type="index">

        <tokenizer class="solr.StandardTokenizerFactory"/>

        <filter class="solr.StopFilterFactory"

                ignoreCase="true"

                words="lang/stopwords_en.txt"

                enablePositionIncrements="true"

                />

        <filter class="solr.LowerCaseFilterFactory"/>

        <filter class="solr.EnglishPossessiveFilterFactory"/>

        <filter class="solr.KeywordMarkerFilterFactory" protected="protwords.txt"/>

        <!-- Optionally you may want to use this less aggressive stemmer instead of PorterStemFilterFactory:

          <filter class="solr.EnglishMinimalStemFilterFactory"/>

        -->

        <filter class="solr.PorterStemFilterFactory"/>

      </analyzer>

      <analyzer type="query">

        <tokenizer class="solr.StandardTokenizerFactory"/>

        <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/>

        <filter class="solr.StopFilterFactory"

                ignoreCase="true"

                words="lang/stopwords_en.txt"

                enablePositionIncrements="true"

                />

        <filter class="solr.LowerCaseFilterFactory"/>

        <filter class="solr.EnglishPossessiveFilterFactory"/>

        <filter class="solr.KeywordMarkerFilterFactory" protected="protwords.txt"/>

        <!-- Optionally you may want to use this less aggressive stemmer instead of PorterStemFilterFactory:

          <filter class="solr.EnglishMinimalStemFilterFactory"/>

        -->

        <filter class="solr.PorterStemFilterFactory"/>

      </analyzer>

    </fieldType>

    <fieldType name="text_ws" class="solr.TextField" positionIncrementGap="100">

      <analyzer>

        <tokenizer class="solr.WhitespaceTokenizerFactory"/>

      </analyzer>

    </fieldType>

    <fieldType name="ngram" class="solr.TextField" >

      <analyzer type="index">

        <tokenizer class="solr.KeywordTokenizerFactory"/>

        <filter class="solr.LowerCaseFilterFactory"/>

        <filter class="solr.NGramFilterFactory" minGramSize="3" maxGramSize="15" />

      </analyzer>

      <analyzer type="query">

        <tokenizer class="solr.KeywordTokenizerFactory"/>

        <filter class="solr.LowerCaseFilterFactory"/>

      </analyzer>

    </fieldType>

    <fieldType name="edge_ngram" class="solr.TextField" positionIncrementGap="1">

      <analyzer type="index">

        <tokenizer class="solr.WhitespaceTokenizerFactory" />

        <filter class="solr.LowerCaseFilterFactory" />

        <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="0" catenateNumbers="0" catenateAll="0" splitOnCaseChange="1"/>

        <filter class="solr.EdgeNGramFilterFactory" minGramSize="2" maxGramSize="15" side="front" />

      </analyzer>

      <analyzer type="query">

        <tokenizer class="solr.WhitespaceTokenizerFactory" />

        <filter class="solr.LowerCaseFilterFactory" />

        <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="0" catenateNumbers="0" catenateAll="0" splitOnCaseChange="1"/>

      </analyzer>

    </fieldType>

  </types>

  <fields>

    <!-- general -->

    <field name="id" type="string" indexed="true" stored="true" multiValued="false" required="true"/>

    <field name="django_ct" type="string" indexed="true" stored="true" multiValued="false"/>

    <field name="django_id" type="string" indexed="true" stored="true" multiValued="false"/>

    <field name="_version_" type="long" indexed="true" stored ="true"/>

    <dynamicField name="*_i"  type="int"    indexed="true"  stored="true"/>

    <dynamicField name="*_s"  type="string"  indexed="true"  stored="true"/>

    <dynamicField name="*_l"  type="long"   indexed="true"  stored="true"/>

    <dynamicField name="*_t"  type="text_en"    indexed="true"  stored="true"/>

    <dynamicField name="*_b"  type="boolean" indexed="true"  stored="true"/>

    <dynamicField name="*_f"  type="float"  indexed="true"  stored="true"/>

    <dynamicField name="*_d"  type="double" indexed="true"  stored="true"/>

    <dynamicField name="*_dt" type="date" indexed="true" stored="true"/>

    <dynamicField name="*_p" type="location" indexed="true" stored="true"/>

    <dynamicField name="*_coordinate"  type="tdouble" indexed="true"  stored="false"/>

    <field name="text" type="text_en" indexed="true" stored="true" multiValued="false" />

    <field name="publish" type="date" indexed="true" stored="true" multiValued="false" />

  </fields>

  <!-- field to use to determine and enforce document uniqueness. -->

  <uniqueKey>id</uniqueKey>

  <!-- field for the QueryParser to use when an explicit fieldname is absent -->

  <defaultSearchField>text</defaultSearchField>

  <!-- SolrQueryParser configuration: defaultOperator="AND|OR" -->

  <solrQueryParser defaultOperator="AND"/>

</schema>

#Save the following output to 'schema.xml' and place it in your Solr configuration directory.

vi ~/Downloads/solr-4.10.4/example/solr/myblog/conf/schema.xml

SimilarFacedeMacBook-Pro:StudyDjango similarface$ python manage.py rebuild_index

 from pkg_resources import resource_stream

/Library/Python/2.7/site-packages/django/core/management/base.py:265: RemovedInDjango110Warning: OptionParser usage for Django management commands is deprecated, use ArgumentParser instead

  RemovedInDjango110Warning)

WARNING: This will irreparably remove EVERYTHING from your search index in connection 'default'.

Your choices after this are to restore from backups or rebuild via the `rebuild_index` command.

Are you sure you wish to continue? [y/N] y

Removing all documents from your index because you said so.

Failed to clear Solr index: [Reason: Error 404 Not Found]

All documents removed.

Indexing 8 posts

/Library/Python/2.7/site-packages/haystack/fields.py:137: RemovedInDjango110Warning: render() must be called with a dict, not a Context.

  return t.render(Context({'object': obj}))

Failed to add documents to Solr: [Reason: Error 404 Not Found]

Django之站内搜索-Solr,Haystack的更多相关文章

利用Solr服务建立的站内搜索雏形---solr1
最近看完nutch后总感觉像好好捯饬下solr,上次看到老大给我展现了下站内搜索我便久久不能忘怀.总觉着之前搭建的nutch配上solr还是有点呆板,在nutch爬取的时候就建立索引到solr服务下, ...
利用Solr服务建立的站内搜索雏形
最近看完nutch后总感觉像好好捯饬下solr,上次看到老大给我展现了下站内搜索我便久久不能忘怀.总觉着之前搭建的nutch配上solr还是有点呆板,在nutch爬取的时候就建立索引到solr服务下, ...
一步步开发自己的博客 .NET版（5、Lucenne.Net 和必应站内搜索）
前言这次开发的博客主要功能或特点: 第一:可以兼容各终端,特别是手机端. 第二:到时会用到大量html5,炫啊. 第三:导入博客园的精华文章,并做分类.(不要封我) 第四:做 ...
Lucene.net站内搜索—6、站内搜索第二版
目录 Lucene.net站内搜索—1.SEO优化 Lucene.net站内搜索—2.Lucene.Net简介和分词Lucene.net站内搜索—3.最简单搜索引擎代码Lucene.net站内搜索—4 ...
Lucene.net站内搜索—5、搜索引擎第一版实现
目录 Lucene.net站内搜索—1.SEO优化 Lucene.net站内搜索—2.Lucene.Net简介和分词Lucene.net站内搜索—3.最简单搜索引擎代码Lucene.net站内搜索—4 ...
Lucene.net站内搜索—4、搜索引擎第一版技术储备（简单介绍Log4Net、生产者消费者模式）
目录 Lucene.net站内搜索—1.SEO优化 Lucene.net站内搜索—2.Lucene.Net简介和分词Lucene.net站内搜索—3.最简单搜索引擎代码Lucene.net站内搜索—4 ...
Lucene.net站内搜索—3、最简单搜索引擎代码
目录 Lucene.net站内搜索—1.SEO优化 Lucene.net站内搜索—2.Lucene.Net简介和分词Lucene.net站内搜索—3.最简单搜索引擎代码Lucene.net站内搜索—4 ...
Lucene.net站内搜索—2、Lucene.Net简介和分词
目录 Lucene.net站内搜索—1.SEO优化 Lucene.net站内搜索—2.Lucene.Net简介和分词Lucene.net站内搜索—3.最简单搜索引擎代码Lucene.net站内搜索—4 ...
Lucene.net站内搜索—1、SEO优化
目录 Lucene.net站内搜索—1.SEO优化 Lucene.net站内搜索—2.Lucene.Net简介和分词Lucene.net站内搜索—3.最简单搜索引擎代码Lucene.net站内搜索—4 ...

随机推荐

一些汇编中的 trick
1. PC 总是指向下一条将要被执行的指令,而不是指向正在被执行的指令,这是有道理的,因为执行指令不是一个 atom 过程,而是分成了好多步骤,在执行指令的过程中 cpu 完全有可能将下一条将要执行的 ...
【CF1016D】Vasya And The Matrix（构造）
题意: 思路:构造方式见代码…… #include<cstdio> #include<cstring> #include<iostream> #include< ...
交叉编译x264和ffmpeg
1.x264 ./configure --host=arm-hisiv300-linux CC=arm-hisiv300-linux-gcc --enable-pic --prefix=/usr/lo ...
Day 29 process&thread_1
进程和线程 1 进程(process): 1.定义: 最小的执行单元.进程就是一个程序在一个数据集上的一次动态执行过程. 进程一般由程序.数据集.进程控制块三部分组成: 我们编写的程序用来描述进程要完 ...
LeetCode OJ-- Jump Game
https://oj.leetcode.com/problems/jump-game/ 从0开始,根据每一位上存的数值往前跳. 这道题给想复杂了... 记录当前位置 pos,记录可以调到的最远达位置为 ...
Android TextView 中实现部分文字变色以及点击事件
首先要想实现文字变色以及点击,都需要使用到SpannableStringBuilder,实例化该类也很简单,只需将你想要处理的字符串当做参数 SpannableStringBuilder spanna ...
web前端生成图片之探索踩坑
前段时间,产品和运营整了个非常变态的需求,要求将一个活动页面输出为图片,然后用户进行分享开始以为是用户自己手动截图分享,没想到后来不是,细思极恐,感叹需求之变态. 从网上找了N个方案,最后确定使用 ...
判断图连通的三种方法——dfs，bfs，并查集
Description 如果无向图G每对顶点v和w都有从v到w的路径,那么称无向图G是连通的.现在给定一张无向图,判断它是否是连通的. Input 第一行有2个整数n和m(0 < n,m < ...
Excel Sheet Column Title - LeetCode
Given a positive integer, return its corresponding column title as appear in an Excel sheet. For exa ...
java值传递和引用传递的理解
java的基础数据类型有:(byte.short.int.long.float.double.char.boolean)八种基础数据都是值传递,其他都是引用传递.但是引用传递要特别注意:String ...

Django之站内搜索-Solr,Haystack

Django之站内搜索-Solr,Haystack的更多相关文章

随机推荐

热门专题