2014-02-19 41 views
1

我有一个大小为153895行x 644列(从csv文件读取)的熊猫数据框,并且有几列是字符串,其他是整数和浮点数。我试图将其保存为Rda文件。convert_to_r_dataframe给出错误无属性dtype

我想:

import pandas.rpy.common as com 
myDFinR = com.convert_to_r_dataframe(myDF) 

我得到以下错误:

Traceback (most recent call last): 
    File "C:\PF\WinPython-64bit-3.3.3.3\python-3.3.3.amd64\lib\site-packages\IPython\core\interactiveshell.py", line 2828, in run_code 
exec(code_obj, self.user_global_ns, self.user_ns) 
    File "<ipython-input-101-7d2a8ae98ea4>", line 1, in <module> 
dDataR=com.convert_to_r_dataframe(dData) 
    File "C:\PF\WinPython-64bit-3.3.3.3\python-3.3.3.amd64\lib\site-packages\pandas\rpy\common.py", line 305, in convert_to_r_dataframe 
value_type = value.dtype.type 
    File "C:\PF\WinPython-64bit-3.3.3.3\python-3.3.3.amd64\lib\site-packages\pandas\core\generic.py", line 1815, in __getattr__ 
(type(self).__name__, name)) 
AttributeError: 'DataFrame' object has no attribute 'dtype' 

我试图做myDF.dtypes,并没有给我什么不寻常的输出

col1  object 
col2  object 
col3  int64 
... 
col642  float64 
col643  float64 
col644  float64 
Length: 644, dtype: object 

当我试过for i,j in enumerate(myDF.columns): print(i,":",myDF[j].dtype),然后它在列359给我一个错误。但是,如果我尝试myDF[[359]].dtypes它给我

col359  float64 
dtype: object 

可能是什么问题?

回答

1

我可以重现该错误消息时myDF具有非唯一的列名:

import pandas as pd 
import pandas.rpy.common as com 

myDF = pd.DataFrame([[1,2],[3,4]], columns=['A','B']) 
myDFinR = com.convert_to_r_dataframe(myDF) 
print(myDFinR) # 1 

myDF2 = pd.DataFrame([[1,2],[3,4]], columns=['A','A']) 
myDFinR2 = com.convert_to_r_dataframe(myDF2) 
print(myDFinR2) # 2 
  1. 打印

    A B 
    0 1 2 
    1 3 4 
    
  2. 引发AttributeError

    AttributeError: 'DataFrame' object has no attribute 'dtype' 
    

如果这确实是你的问题的根源,你可以通过重命名列独特的东西修复:

myDF.columns = ['col{i}'.format(i=i) for i in range(len(myDF.columns))] 
+0

凉爽。谢谢,让我看看。 – uday

+0

是的,这是问题所在 – uday

相关问题