solr 3.5安裝 及分詞器配置

1、搭建說明java

環境:web

1.系統:windows7 x86_64apache

2.java 版本 :jdk1.8windows

3.tomcat 版本:tomcat 6瀏覽器

4.solr:solr3.5.0tomcat

5.lucene:lucene 3.5.0app

6.中文分詞起mmseg4jwebapp

2、下載jsp

solr 3.5下載地址:http://www.apache.org/dist//lucene/solr/ide

3、solr 3.5的安裝

1.修改tomcat http服務監聽端口

<Connector port="8080" protocol="HTTP/1.1" connectionTimeout="20000" redirectPort="8443" URIEncoding="UTF-8"/>

 修改默認端口8080,能夠不修改,添加字符編碼  URIEncoding="UTF-8" 不然中文會亂碼

    注:若是是你一個系統啓動多個tomcat你要同時修改三個端口,以保證每一個tomcat都不一樣,才能同時啓動

    1.tomcat http端口

    2.tomcat shutdown 端口

2.在apache-solr-3.5.0\dist目錄下找到apache-solr-3.5.0.war把重命名爲solr.war 並上傳到 tomcat-6/webapps下。

3.建立一個solr.xml文件,在tomcat-7.0.14\conf\Catalina\localhost\下建立一個solr.xml文件。內容爲:      

<?xml version="1.0" encoding="UTF-8"?>

<Context docBase="F:\apache-tomcat-6.0.37\webapps\solr"   debug="0" crossContext="true">

    <Environment name="solr/home" type="java.lang.String"  value="F:\apache-tomcat-6.0.37\solr" override="true" />

</Context>

注:docBase 指向應用所在目錄,主要用來爲應用設置環境變量

4.把apache-solr-3.5.0\example目錄下的solr文件夾上傳到e:/tomcat-6目錄下(即tomcat根目錄下,同bin在一個目錄下).索引文件會默認會放到 e:/tomcat-6\solr\data下.

5.從新tomcat 在瀏覽器中輸入:http://localhost:8080/  會出現:

        welcome to solr  

        solr admin  說明發布已成功!

 

 

 

4、分詞器配置

   1.整合 漢字分詞器 IKAnalyzer 3.2.8

        1).添加IKAnalyzer3.2.8.jar

           把IKAnalyzer3.2.8.jar放到 tomcat-7.0.14\webapps\solr\WEB-INF\lib下;

        2).schema.xml 增長信息

            進入目錄編輯schema.xml文件,在<Types>下添加如下內容: 

            

<fieldType name="text" class="solr.TextField" >  

               <analyzer class="org.wltea.analyzer.lucene.IKAnalyzer"/>  

               <analyzer type="index">  

                   <tokenizer class="org.wltea.analyzer.solr.IKTokenizerFactory" isMaxWordLength="false"/>  

                   <filter class="solr.StopFilterFactory"  

                           ignoreCase="true" words="stopwords.txt"/>  

                   <filter class="solr.WordDelimiterFilterFactory"  

                           generateWordParts="1"  

                           generateNumberParts="1"  

                           catenateWords="1"  

                           catenateNumbers="1"  

                           catenateAll="0"  

                           splitOnCaseChange="1"/>  

                   <filter class="solr.LowerCaseFilterFactory"/>  

                   <filter class="solr.EnglishPorterFilterFactory"  

                       protected="protwords.txt"/>  

                   <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>  

               </analyzer>  

                 <analyzer type="query">  

                   <tokenizer class="org.wltea.analyzer.solr.IKTokenizerFactory" isMaxWordLength="false"/>  

                   <filter class="solr.StopFilterFactory"  

                           ignoreCase="true" words="stopwords.txt"/>  

                   <filter class="solr.WordDelimiterFilterFactory"  

                           generateWordParts="1"  

                           generateNumberParts="1"  

                           catenateWords="1"  

                           catenateNumbers="1"  

                           catenateAll="0"  

                           splitOnCaseChange="1"/>  

                   <filter class="solr.LowerCaseFilterFactory"/>  

                   <filter class="solr.EnglishPorterFilterFactory"  

                       protected="protwords.txt"/>  

                   <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>  

               </analyzer>  

    </fieldType>

 

而後在<fields>下添加:

 

<field name="name1" type="text" indexed="true" stored="true" required="true" />

        3)、重起tomcat 在瀏覽器中輸入:http://localhost:8080/solr/admin/analysis.jsp進去 solr admin 顯示:

相關文章
相關標籤/搜索