2016-10-01 76 views
2

,下面的错误来了,在mongodb日志中一个连接接受并立即结束。解决它?

$ bin/crawl conf/urls/seeds.txt tuto 1 未指定SOLRURL。跳过索引。 注入种子URL /e/apache-nutch/apache-nutch-2.3.1/runtime/local/bin/nutch注入conf/urls/seeds.txt -crawlId tuto InjectorJob:从2016-10-01开始18: 15:14 InjectorJob:注入urlDir:conf/urls/seeds.txt InjectorJob:使用类org.apache.gora.mongodb.store.MongoStore作为Gora存储类。 InjectorJob:java.lang.NullPointerException at java.lang.ProcessBuilder.start(ProcessBuilder.java:1010) at org.apache.hadoop.util.Shell.runCommand(Shell.java:482) at org.apache。 hadoop.util.Shell.run(Shell.java:455) at org.apache.hadoop.util.Shell $ ShellCommandExecutor.execute(Shell.java:702) at org.apache.hadoop.util.Shell.execCommand( (org.apache.hadoop.fs.RawLocalFileSystem.setPermission(RawLocalFileSystem.java:646) at org。)。 apache.hadoop.fs.RawLocalFileSystem.mkdirs(RawLocalFileSystem.java:434) at org.apache.hadoop.fs.FilterFileSystem.mkdirs(FilterFileSystem.java:281) 在org.apache.hadoop.mapreduce.JobSubmissionFiles.getStagingDir(JobSubmissionFiles.java:125) 在org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:348) 在org.apache.hadoop.mapreduce。作业$ 10.run(Job.java:1285) at org.apache.hadoop.mapreduce.Job $ 10.run(Job.java:1282) at java.security.AccessController.doPrivileged(Native Method) at javax.security .auth.Subject.doAs(Subject.java:415) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1614) at org.apache.hadoop.mapreduce.Job.submit(Job.java :1282) at org.apache.hatil.Nutc org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:1303) at org.apache.nutch.util.Nutc hJob.waitForCompletion(NutchJob.java:115) at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:231) at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:252) at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:275) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) at org.apache.nutch.crawl。 InjectorJob.main(InjectorJob.java:284)错误:当我尝试运行时,启动Apache nutch与mongodb

回答

0

相信Nutch的最新版本的使用

# bin/nutch inject seedDirectory/ 

这至少为我工作。

0

我刚好从gora-mongodb-mapping.xml文件中的Nutch的conf文件夹中删除两行即:

[field name="sitemaps" docfield="sitemaps" type="document"] 

[field name="stmPriority" docfield="stmPriority" type="int32"] 

解决了这个问题。希望它能帮助你..

相关问题