2017-06-29 41 views
0

我想使用ETL将文件导入到Orientdb中的文件模型中csv文件 我不知道这是否作为新手正确,而不是文档模型上的大量文档,但我试过是:Orientdb将CSV文件导入到文档模型

{ 
    "config": { 
    "log": "debug" 
    }, 
    "begin": [], 
    "source": { 
    "file": { 
     "path": "C:/Users/M/Desktop/files/lact.csv" 
    } 
    }, 
    "extractor": 
{ "csv": 
     { "separator": ",", 
     "nullValue": "NULL" 
     } 
    }, 
    "transformers": [ 
    { 
     "log": {} 
    } 
    ], 
    "loader": { 
    "orientdb": { 
     "dbURL": "plocal:../databases/Model_doc", 



     "dbType": "document", 
     "classes": [ 
     { 
      "name": "Annotations" 
     }, 


     ] 
    } 
    }, 
    "end": [] 
} 

显示该文件的内容的解析之后,我的这句话: [orientdb] DEBUG orientdb:发现在课堂上“空”

的CSV文件0文件

"Entry","Entry_name","Status","Protein_names","Gene_names","Organism","Length","Cross_reference(STRING)" 
"Q29836","1B67_HUMAN","reviewed","HLA class I histocompatibility antigen, B-67 alpha chain (MHC class I antigen B*67)","HLA-B HLAB","Homo sapiens (Human)","362","9606.ENSP00000399168;" 
"P30501","1C02_HUMAN","reviewed","HLA class I histocompatibility antigen, Cw-2 alpha chain (MHC class I antigen Cw*2)","HLA-C HLAC","Homo sapiens (Human)","366","" 
"P30508","1C12_HUMAN","reviewed","HLA class I histocompatibility antigen, Cw-12 alpha chain (MHC class I antigen Cw*12)","HLA-C HLAC","Homo sapiens (Human)","366","" 
"Q29960","1C16_HUMAN","reviewed","HLA class I histocompatibility antigen, Cw-16 alpha chain (MHC class I antigen Cw*16)","HLA-C HLAC","Homo sapiens (Human)","366","" 
"Q29865","1C18_HUMAN","reviewed","HLA class I histocompatibility antigen, Cw-18 alpha chain (MHC class I antigen Cw*18)","HLA-C HLAC","Homo sapiens (Human)","366","" 
+0

嗨,你能发布csv文件吗?你使用什么版本? Thx –

+0

@MichelaBonizzi我正在使用2.2.22 oientdb公社版本。至于csv我编辑了问题 –

+0

@MichelaBonizzi有什么想法吗? –

回答

1

我想你的代码,我也有同样的消息:

[orientdb] DEBUG orientdb: found 0 documents in class 'null' 

但我已经能够导入所有的数据,正如你从截图中看到的那样。

enter image description here

做到这一点作为@RobertoFranchini说,你要补充一点:

"transformers": [ 
{ 
    "log": {} 
}, 
{ 
    "field": { 
    "fieldName": "@class", 
    "value": "Annotations" 
    } 
} 
], 

我做了这个小改变您的CSV文件:

Entry,Entry_name,Status,Protein_names,Gene_names,Organism,Length,Cross_reference(STRING) 
Q29836,1B67_HUMAN,reviewed,HLA class I histocompatibility antigen, B-67 alpha chain (MHC class I antigen B*67),HLA-B HLAB,Homo sapiens (Human),362,9606.ENSP00000399168 
P30501,1C02_HUMAN,reviewed,HLA class I histocompatibility antigen, Cw-2 alpha chain (MHC class I antigen Cw*2),HLA-C HLAC,Homo sapiens (Human),366, 
P30508,1C12_HUMAN,reviewed,HLA class I histocompatibility antigen, Cw-12 alpha chain (MHC class I antigen Cw*12),HLA-C HLAC,Homo sapiens (Human),366, 
Q29960,1C16_HUMAN,reviewed,HLA class I histocompatibility antigen, Cw-16 alpha chain (MHC class I antigen Cw*16),HLA-C HLAC,Homo sapiens (Human),366, 
Q29865,1C18_HUMAN,reviewed,HLA class I histocompatibility antigen, Cw-18 alpha chain (MHC class I antigen Cw*18),HLA-C HLAC,Homo sapiens (Human),366, 

和所有的数据已导入。

希望它有帮助。

问候。

+0

谢谢你已经修好了 –

1

你需要一个类分配给文件,现场变压器加入连锁,只要登录后

"transformers": [ 
{ 
    "log": {} 
}, 
{ 
    "field": { 
    "fieldName": "@class", 
    "value": "Annotations" 
    } 
} 
], 
+0

Franchini,没有工作我添加了该行,但问题仍然存在 –

+0

在您的重复问题中,您使用class而不是@class。请更新这个问题。 –