2013-11-01 46 views
0

我在尝试将我的抓取数据从索引索引到索引,并收到以下错误。任何帮助将不胜感激。索引nutch数据到索引时出错

SOLRIndexWriter 
solr.server.url : URL of the SOLR instance (mandatory) 
solr.commit.size : buffer size when sending to SOLR (default 1000) 
solr.mapping.file : name of the mapping file for fields (default solrindex-mapping.xml) 
solr.auth : use authentication (default false) 
solr.auth.username : use authentication (default false) 
solr.auth : username for authentication 
solr.auth.password : password for authentication 


Exception in thread "main" java.io.IOException: Job failed! 
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357) 
at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123) 
at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:81) 
at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:65) 
at org.apache.nutch.crawl.Crawl.run(Crawl.java:155) 
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) 
at org.apache.nutch.crawl.Crawl.main(Crawl.java:55) 
+0

logs/hadoop.log文件的内容是什么? – nimeshjm

回答

0

您是否看到过solr日志?那些日志记录错误原因。 我曾经在nutch遇到过同样的问题,并在solr的日志中发现了一条消息“unknown field host”。 编辑完scheme.xml后,问题消失了。