2013-05-17 65 views
0

我试图在CDH4环境中配置RHive。 在R中读取包'RHive'时,下面的错误被返回。 我猜这是由于错误的家园。 如果是这样,那么正确的是什么? 或者如果这不是原因,那有什么问题?CDH4中的Hadoop和Hive Homes

任何帮助将不胜感激。

谢谢。

> Sys.setenv(HIVE_HOME="/etc/hive") 
> Sys.setenv(HADOOP_HOME="/etc/hadoop") 
> library(RHive) 
Loading required package: rJava 
Loading required package: Rserve 
This is RHive 0.0-7. For overview type '?RHive'. 
HIVE_HOME=/etc/hive 
[1] "there is no slaves file of HADOOP. so you should pass hosts argument when you call rhive.connect()." 
Error : .onLoad failed in loadNamespace() for 'RHive', details: 
    call: .jnew("org/apache/hadoop/conf/Configuration") 
    error: java.lang.ClassNotFoundException 
In addition: Warning message: 
In file(file, "rt") : 
    cannot open file '/etc/hadoop/conf/slaves': No such file or directory 
Error: package/namespace load failed for 'RHive' 
+0

似乎很自我解释。您缺少指定集群中哪些节点的“从属”文件。 – greedybuddha

回答

0

您应该单独设置HADOOP_CONF_DIR。 尝试导出$ HADOOP_CONF_DIR =/etc/hadoop/conf/conf.pseudo

conf.pseudo具有从属文件。

虽然我很想知道您是否可以让CDH4使用RHive。

+0

感谢您的回应!我会给它一个镜头并发布结果。 –

+0

似乎像RHive可以使用CDH4。你可以看看http://stackoverflow.com/questions/16783549/rhive-not-working-with-cdh4。虽然我没有机会再看一遍。如果你确实得到它的工作,那么请张贴你是如何做到的。谢谢。 –

2
Had the problems but solved it. Downside is that I have to keep track of a bunch of sym links 

After struggling with install RHive_0.0-7.tar.gz on CDH 4.7.x and getting: 
Warning in file(file, "rt") : 
cannot open file '/etc/hadoop/conf/slaves': No such file or directory 
[1] "there is no slaves file of HADOOP. so you should pass hosts argument when you call rhive.connect()." 

In /etc/hadoop/conf 
I added a the following sym link ----> ln -s /opt/cloudera/parcels/CDH-4.4.0-1.cdh4.4.0.p0.39/etc/hadoop/conf.empty/slaves slaves 
(why Cloudera CHD 4.7 installs in /opt without creating the proper sym links from /usr/lib is puzzling) 

I also defined the followingin /usr/lib64/R/etc/Renviron 
## set hive paths 
HIVE_HOME='/opt/cloudera/parcels/CDH-4.4.0-1.cdh4.4.0.p0.39/lib/hive' 
HADOOP_HOME='/opt/cloudera/parcels/CDH-4.4.0-1.cdh4.4.0.p0.39/lib/hadoop' 
LD_LIBRARY_PATH='/opt/cloudera/parcels/CDH-4.4.0-1.cdh4.4.0.p0.39/lib/hadoop' 

At a shell prompt I ran R CMD INSTALL RHive_0.0-7.tar.gz 
Installation Happiness!! 

++++++ 
Inside R-Studio (server) 

> 
> library(RHive) 
Loading required package: rJava 
Loading required package: Rserve 
This is RHive 0.0-7. For overview type ‘?RHive’. 
HIVE_HOME=/opt/cloudera/parcels/CDH-4.4.0-1.cdh4.4.0.p0.39/lib/hive 
call rhive.init() because HIVE_HOME is set. 
rhive.init() 
> 
+++++++