2015-09-03 68 views
0

我试图在Hadoop集群上运行MR作业。一旦我成功尝试,但是从昨天我不能运行相同的命令,因为它无法找到输入路径Hadoop mapreduce输入路径不存在(MR Job寻找什么文件系统?)

这里是跟踪:

[[email protected] ~]$ yarn jar /tmp/mihaylov-temp/MR.jar 
ru.sbt.hadoop.mr.data.index.DayDataIndexStart -D ru.sbt.hadoop.mr.map.mb=3072 -D ru.sbt.hadoop.mr.reduce.count=8 -D ru.sbt.hadoop.mr.map.count=80 /nameservices/supercluster/esbLogs/year/2015/months/7/days/27/hrs/18/ /nameservices/supercluster/esbLogs/year/2015/months/7/days/27/sort/ 1440622800966 1440709200966 

15/09/03 14:33:37 INFO client.RMProxy: Connecting to ResourceManager at SBT-IPO-077.ca.sbrf.ru/10.116.6.195:8032 
15/09/03 14:33:38 INFO mapreduce.JobSubmitter: Cleaning up the staging area /nameservices/supercluster/user/esb99usr/.staging/job_1439318575767_0078 
15/09/03 14:33:38 WARN security.UserGroupInformation: PriviledgedActionException as:esb99usr (auth:SIMPLE) cause:org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does not exist: viewfs://cluster16/nameservices/supercluster/esbLogs/year/2015/months/7/days/27/hrs/18 
Exception in thread "main" org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does not exist: viewfs://cluster16/nameservices/supercluster/esbLogs/year/2015/months/7/days/27/hrs/18 

at org.apache.hadoop.mapreduce.lib.input.FileInputFormat.singleThreadedListStatus(FileInputFormat.java:321) 
     .......... 


[[email protected] ~]$ hdfs dfs -ls /nameservices/supercluster/esbLogs/year/2015/months/7/days/27/hrs/18/ 
Found 2 items 
-rw-r--r-- 3 wasadmin supergroup 1417233459 2015-08-27 21:39 /nameservices/supercluster/esbLogs/year/2015/months/7/days/27/hrs/18/log1.seq 
-rw-r--r-- 3 wasadmin supergroup 6020096 2015-08-27 21:39 /nameservices/supercluster/esbLogs/year/2015/months/7/days/27/hrs/18/log1.seq.filter 

所以,当我运行MR的工作,它声称输入路径不存在,但我可以通过hadoop dfs -ls /....看到它。它直到昨天才完美运行,并且没有证据表明集群已经关闭,Cloudera说它已经启动,集群的健康状况良好。这个问题如何解决?

+0

您可以检查''esb99usr属于'supergroup'组? –

+0

@ Mr.Chowdary是的,它的确如此。 – Dmitry

+0

@ Mr.Chowdary btw可以看到,该文件的权限允许任何用户读取hdfs文件,所以我怀疑,该chmod是问题 – Dmitry

回答

0

的命令来运行作业

]$ yarn jar /tmp/mihaylov-temp/MR.jar (Main Class Name) (input file path) (output directoru path) 
相关问题