2016-09-15 98 views
2

带有一个joinFieldName并查找Edge变压器的工作原理。但是,现在需要两个键,即查找中的复合索引。如何指定两个joinFieldNames?OrientDB ETL边缘变换器2 joinFieldName(s)

这是脚本(后处理)版本: 创建边缘扩展从(从MC选取样品= 1和MKEY = 6)〜(从事件选择其中样品= 1和Mcl = 6)

这个工作,但不适合生产。

任何人都可以帮忙吗?

回答

3

,你可以简单地添加2 joinFieldName(S),如

{ "edge": { "class": "Conn", 
       "joinFieldName": "b1", 
       "lookup": "A.a1", 
       "joinFieldName": "b2", 
       "lookup": "A.a2", 
       "direction": "out" 
      }} 

见下面我的测试数据:

json1.json

{ 
    "source": { "file": { "path": "/home/ivan/Scrivania/cose/etl/stak39517796/data1.csv" } }, 
    "extractor": { "csv": {} }, 
    "transformers": [ 
    { "vertex": { "class": "A" } } 
    ], 
    "loader": { 
    "orientdb": { 
     "dbURL": "plocal:/home/ivan/OrientDB/db_installati/enterprise/orientdb-enterprise-2.2.10/databases/stack39517796", 
     "dbType": "graph", 
     "dbAutoCreate": true, 
     "classes": [ 
     {"name": "A", "extends": "V"}, 
     {"name": "B", "extends": "V"}, 
     {"name": "Conn", "extends": "E"} 
     ] 
    } 
    } 
} 

json2.json

{ 
    "source": { "file": { "path": "/home/ivan/Scrivania/cose/etl/stak39517796/data2.csv" } }, 
    "extractor": { "csv": {} }, 
    "transformers": [ 
    { "vertex": { "class": "B" } }, 
    { "edge": { "class": "Conn", 
       "joinFieldName": "b1", 
       "lookup": "A.a1", 
       "joinFieldName": "b2", 
       "lookup": "A.a2", 
       "direction": "out" 
      }} 
    ], 
    "loader": { 
    "orientdb": { 
     "dbURL": "plocal:/home/ivan/OrientDB/db_installati/enterprise/orientdb-enterprise-2.2.10/databases/stack39517796", 
     "dbType": "graph", 
     "dbAutoCreate": true, 
     "classes": [ 
     {"name": "A", "extends": "V"}, 
     {"name": "B", "extends": "V"}, 
     {"name": "Conn", "extends": "E"} 
     ] 
    } 
    } 
} 

data1.csv

a1,a2 
1,1 
1,2 
2,3 

data2.csv

b1,b2 
1,1 
2,3 
1,2 

执行顺序:

  1. json1
  2. json2

这里是最终结果:

orientdb {db=stack39517796}> select from v           

+----+-----+------+----+----+-------+----+----+--------+ 
|# |@RID |@CLASS|a1 |a2 |in_Conn|b2 |b1 |out_Conn| 
+----+-----+------+----+----+-------+----+----+--------+ 
|0 |#17:0|A  |1 |1 |[#25:0]| | |  | 
|1 |#18:0|A  |1 |2 |[#27:0]| | |  | 
|2 |#19:0|A  |2 |3 |[#26:0]| | |  | 
|3 |#21:0|B  | | |  |1 |1 |[#25:0] | 
|4 |#22:0|B  | | |  |3 |2 |[#26:0] | 
|5 |#23:0|B  | | |  |2 |1 |[#27:0] | 
+----+-----+------+----+----+-------+----+----+--------+ 
+0

优秀的帮助伊万! Mille Grazie – Tore

+0

@ ivan-mainetti我不知道这是否工作...如果你追加'2,1'到data1.csv和data2.csv,结果会产生额外的边缘从b(1,1)到a(1,1)和(2,1)。我认为它只匹配最后一个连接字段,而不是执行** AND **操作。 – TxAG98

+0

@ ivan-mainetti我对它进行了更多的研究,我认为这只适用于你的例子,因为a2和b2是唯一的。 json2.json中的第二个“joinFieldName”条目将[不会导致错误而破坏第一个...](http://stackoverflow.com/questions/5306741/do-json-keys-need-to-be-唯一的)如果你使用调试日志记录运行它,你会发现它只匹配a2到b2。 – TxAG98