2011-03-11 36 views
0

我正在使用tika和dataimporthandler。在执行完全导入时,我收到以下错误。solr tika提取问题

SEVERE: Full Import failed:org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load EntityProcessor implementation for entity:tika-test Processing Document # 1 
      at org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:72) 
      at org.apache.solr.handler.dataimport.DocBuilder.getEntityProcessor(DocBuilder.java:817) 
      at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:547) 
      at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:267) 
      at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:186) 
      at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:353) 
      at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:411) 
      at org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:392) 
    Caused by: java.lang.ClassNotFoundException: Unable to load TikaEntityProcessor or org.apache.solr.handler.dataimport.TikaEntityProcessor 
      at org.apache.solr.handler.dataimport.DocBuilder.loadClass(DocBuilder.java:1010) 
      at org.apache.solr.handler.dataimport.DocBuilder.getEntityProcessor(DocBuilder.java:814) 
      ... 6 more 
    Caused by: org.apache.solr.common.SolrException: Error loading class 'TikaEntityProcessor' 
      at org.apache.solr.core.SolrResourceLoader.findClass(SolrResourceLoader.java:389) 
      at org.apache.solr.handler.dataimport.DocBuilder.loadClass(DocBuilder.java:1000) 
      ... 7 more 
    Caused by: java.lang.ClassNotFoundException: TikaEntityProcessor 
      at java.net.URLClassLoader$1.run(URLClassLoader.java:217) 
      at java.security.AccessController.doPrivileged(Native Method) 
      at java.net.URLClassLoader.findClass(URLClassLoader.java:205) 
      at java.lang.ClassLoader.loadClass(ClassLoader.java:319) 
      at java.net.FactoryURLClassLoader.loadClass(URLClassLoader.java:615) 
      at java.lang.ClassLoader.loadClass(ClassLoader.java:264) 
      at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:332) 
      at java.lang.Class.forName0(Native Method) 
      at java.lang.Class.forName(Class.java:264) 
      at org.apache.solr.core.SolrResourceLoader.findClass(SolrResourceLoader.java:373) 

solrconfig.xml中我写的配置参数,这样

</requestHandler> 
<requestHandler name="/dataimport" class="org.apache.solr.handler.dataimport.DataImportHandler"> 
       <lst name="defaults"> 
       <str name="config">/var/solr/data-config.xml</str> 
      </lst> 
     </requestHandler> 

我哪里错了?你们能帮助我吗?我正在寻找这个很长一段时间。

回答

1

您正在使用哪个Solr版本? Solr 1.4?
1.4后添加了TikaEntityProcessor:
http://www.mail-archive.com/[email protected]/msg35902.html

+0

我也在使用1.4.1和3.2。在这两个版本中它都显示相同的错误。ticka版本是8.0 – 2011-03-12 05:56:49

+0

org.apache.solr.handler.dataimport.TikaEntityProcessor应该位于** apache-solr-dataimporthandler - ** _version_ **。jar **中。如果你解压(rar)jar,你能找到类TikaEntityProcessor吗?也许你的v3.2安装使用1.4.1版本的jar包? – 2011-03-12 16:00:48