0
我有80,000个XML文件,它们应该使用相同的格式。但是,情况显然不是这样。因此,我试图识别文件中的所有现有节点和子节点。确定列表中所有可能的父母和孩子
我已经使用XML包将XML文件导入为列表,并在下面描述了我的输入和我所需的输出。
输入(名单列表):
XML1 <- list(name = "Company Number 1",
adress = list(street = "JP Street", number = "12"),
product = "chicken")
XML2 <- list(name = "Company Number 2",
company_adress = list(street = "House Street", number = "93"),
invoice = list(quantity = "2", product = "phone"))
XML3 <- list(company_name = "Company Number 3",
adress = list(street = "Lake Street", number = "1"),
invoice = list(quantity = "2", product = "phone", list(note = "Phones are refurbished")))
输出(树形结构跨文件与出现的次数在叶子):
List of 5
$ name : num 2
$ company_name : num 1
$ adress :List of 2
..$ street: num 2
..$ number: num 2
$ company_adress:List of 2
..$ street: num 1
..$ number: num 1
$ invoice :List of 3
..$ quantity: num 2
..$ product : num 2
..$ :List of 1
.. ..$ note: num 1
$ product : num 1
是否有一个包,可以沿着这条线做一些事情,还是我需要写一个自己做这个的函数?