2014-04-08 57 views
0

我想索引csv文件中的数据到elasticsearch服务器。问题是该字符串本身包含多个“,”所以在建立索引期间,它给了indexoutofbound异常。在csv文件中处理逗号csv-river插件

如何使用csv-river plugin处理逗号。

编辑:

示例文件是:

MESSAGE_ID,PARENT_MESSAGE_ID,THREAD_ID,FORUM_ID,FORUMINDEX,USER_ID,SUBJECT,BODY,MODVALUE,FORUM_NAME,CATEGORY_NAME,LIKES,DISLIKES,IS_ROOT_MESSAGE,IS_QUESTION 
244,195,103,4,3,341,Re: The most stupidest program I've ever seen--Amazon,"I know nothing of your case, but I do know that throwing around terms like ""stupid idiot"" doesn't exactly help your side any.",1,"Order Management, Shipping, Feedback & Returns",Sell on Amazon,,,no,no 

回答

0

你需要附上领域中的报价。如果该字段包含引号,则需要使用前面的引号将其转义。 例如:

"field1","field2","field3 with, commas","field4","field ""5"" with quotes","field6" 
+0

请参阅示例文件。这不行吗? –

+0

你需要引用所有字段 – Donal

+0

我测试过,它的工作使得我给出的示例行成功建立索引。我认为问题在别的地方。 –