2012-09-15 31 views
5

我正尝试使用AWS上的集群计算进行一些实验。我在这方面是全新的,并且有一些问题。我试图按照这里找到的教程:http://star.mit.edu/cluster/docs/latest/plugins/ipython.html#using-the-ipython-cluster。我使用星群启动具有以下内容的群集实例:在AWS上使用starcluster和ipython进行集群计算

starcluster start mycluster 

一切都如预期的那样出现,它显示ipython插件已加载。然后我尝试执行以下命令,如图教程:但是

starcluster sshmaster mycluster -u myuser 

连接失败,并告诉我

Permission denied (publickey). 

我能够登录使用

starcluster sshmaster mycluster 

所以我试图继续教程登录到主,但当我尝试创建客户端我收到并出现错误:

AssertionError: Not a valid connection file or url: 
u'/root/.ipython/profile_default/security/ipcontroller-client.json' 

,我看到的唯一的看上去与众不同的是,当群集已启动此出现:

>>> Running plugin ipcluster 
>>> Writing IPython cluster config files 
>>> Starting IPython cluster with 7 engines 
>>> Waiting for JSON connector file... 
>>> Creating IPCluster cache directory: /Users/username/.starcluster/ipcluster 
>>> Saving JSON connector file to '/Users/username/.starcluster/ipcluster/mycluster-us-east-1.json' 
!!! ERROR - Error occurred while running plugin 'ipcluster': 
Traceback (most recent call last): 
    File "/Library/Python/2.7/site-packages/StarCluster-0.93.3-py2.7.egg/starcluster/cluster.py", line 1506, in run_plugin 
    func(*args) 
    File "/Library/Python/2.7/site-packages/StarCluster-0.93.3-py2.7.egg/starcluster/plugins/ipcluster.py", line 276, in run 
    plug.run(nodes, master, user, user_shell, volumes) 
    File "<string>", line 2, in run 
    File "/Library/Python/2.7/site-packages/StarCluster-0.93.3-py2.7.egg/starcluster/utils.py", line 87, in wrap_f 
    res = func(*arg, **kargs) 
    File "/Library/Python/2.7/site-packages/StarCluster-0.93.3-py2.7.egg/starcluster/plugins/ipcluster.py", line 228, in run 
    cfile = self._start_cluster(master, n, profile_dir) 
    File "/Library/Python/2.7/site-packages/StarCluster-0.93.3-py2.7.egg/starcluster/plugins/ipcluster.py", line 173, in _start_cluster 
    master.ssh.get(json, local_json) 
    File "/Library/Python/2.7/site-packages/StarCluster-0.93.3-py2.7.egg/starcluster/sshutils/__init__.py", line 431, in get 
    self.scp.get(remotepaths, localpath, recursive=recursive) 
    File "/Library/Python/2.7/site-packages/StarCluster-0.93.3-py2.7.egg/starcluster/sshutils/scp.py", line 141, in get 
    self._recv_all() 
    File "/Library/Python/2.7/site-packages/StarCluster-0.93.3-py2.7.egg/starcluster/sshutils/scp.py", line 242, in _recv_all 
    msg = self.channel.recv(1024) 
    File "build/bdist.macosx-10.8-intel/egg/ssh/channel.py", line 611, in recv 
    raise socket.timeout() 
timeout 

有什么想法?

回答

6

本教程假定CLUSTER_USER = myuser~/.starcluster/config即使默认CLUSTER_USER = sgeadmin