如何使用java从xml文件中提取详细信息？

我有以下类型的XML文件，如何使用java从xml文件中提取详细信息？

<?xml version="1.0" encoding="UTF-8"?> 
<!DOCTYPE eSummaryResult PUBLIC "-//NLM//DTD eSummaryResult, 29 October 2004//EN" "http://www.ncbi.nlm.nih.gov/entrez/query/DTD/eSummary_041029.dtd"> 
<eSummaryResult> 
<DocSum> 
    <Id>224589801</Id> 
    <Item Name="Caption" Type="String">NC_000010</Item> 
    <Item Name="Title" Type="String">Homo sapiens chromosome 10, GRCh37.p10 Primary Assembly</Item> 
    <Item Name="Extra" Type="String">gi|224589801|gnl|ASM:GCF_000001305|10|ref|NC_000010.10||gpp|GPC_000000034.1||gnl|NCBI_GENOMES|10[224589801]</Item> 
    <Item Name="Gi" Type="Integer">224589801</Item> 
    <Item Name="CreateDate" Type="String">2002/08/29</Item> 
    <Item Name="UpdateDate" Type="String">2012/10/30</Item> 
    <Item Name="Flags" Type="Integer">544</Item> 
    <Item Name="TaxId" Type="Integer">9606</Item> 
    <Item Name="Length" Type="Integer">135534747</Item> 
    <Item Name="Status" Type="String">live</Item> 
    <Item Name="ReplacedBy" Type="String"/> 
    <Item Name="Comment" Type="String"><![CDATA[ ]]></Item> 
</DocSum> 

</eSummaryResult>

如何提取节点=“项目”基于它的名称为价值的细节？为了达到这个目的，使用标准的java dom xml或更好的方式来使用任何其他的xml解析器库也是好的吗？

来源

2012-12-07 Balaram26

，试试这个（javax.xml.stream *）

XMLInputFactory f = XMLInputFactory.newInstance(); 
    XMLStreamReader rdr = f.createXMLStreamReader(new FileReader("test.xml")); 
    while (rdr.hasNext()) { 
     if (rdr.next() == XMLStreamConstants.START_ELEMENT) { 
      if (rdr.getLocalName().equals("Item")) { 
       System.out.println(rdr.getAttributeValue("", "Name")); 
       System.out.println(rdr.getElementText()); 
      } 
     } 
    }

的StAX必须始终是首先要考虑的事情。见http://en.wikipedia.org/wiki/StAX你会知道为什么

来源

2012-12-07 13:05:39

试试下面的代码

/* Create a Document object (doc) from the xml */ 
NodeList list = doc.getElementsByTagName("Item"); 

for(int i=0;i<list.getLength();i++) 
{ 
    Node node = list.item(i); 
    NamedNodeMap namedNodeMap = node.getAttributes(); 
    if(namedNodeMap.getNamedItem("Name").getTextContent().equalsIgnoreCase("Caption")) 
    { 
     System.out.println(node.getTextContent()); 
    } 
}

输出应该NC_000010

来源

2012-12-07 12:41:24 DDK

也许使用XPath？

Document dom = ...; 
XPath xpath = XPathFactory.newInstance().newXPath(); 
String result = xpath.evaluate("/eSummaryResult/DocSum/Item[@Name='Title']", dom);

来源

2012-12-07 12:45:57 mgaert

如果仅仅使用标准的Java，XPath是要走的路：我建议StAX的

private URL xml = getClass().getResource("/example.xml"); 

@Test 
public void testExamples() throws Exception { 
    //assertEquals("NC_000010", extractUsingDom("Caption")); 
    assertEquals("NC_000010", extractUsingXPath("Caption")); 
} 

public String extractUsingXPath(final String name) throws XPathExpressionException, IOException { 
    // XPathFactory class is not thread-safe so we do not store it 
    XPath xpath = XPathFactory.newInstance().newXPath(); 
    return xpath.evaluate(
     String.format("/eSummaryResult/DocSum/Item[@Name='%s']", name), // xpath expression 
     new InputSource(xml.openStream()));        // the XML Document 
}

来源

2012-12-07 12:58:49 rmuller

如何使用java从xml文件中提取详细信息？

回答

相关问题