2012-03-26 57 views
0

我想解析输入'文本'行,并由于某种原因,它摆脱& quot。似乎所有&被忽略。我如何获得这部分?DocumentBuilderFactory和运算符

DocumentBuilderFactory factory = DocumentBuilderFactory 
.newInstance(); 
DocumentBuilder builder = factory.newDocumentBuilder(); 
String input = "<status> <id>000000014</id> <created_at>2012/03/07</created_at><text>&quot;Resolutionary&quot; ...</text> <retweet_count>2</retweet_count> <user> <name>Siggi Eggertsson</name> <location>Berlin, Germany</location> <description></description> <url>http://www.siggieggertsson.com</url> </user> </status> "; 
Document document = builder.parse(new InputSource(new StringReader(
    input))); 
String value = document.getElementsByTagName("id").item(0) 
    .getTextContent(); 
    System.out.println(value); 
+0

你的 “ID” 元素不*有*的任何报价。如果你找到'text'元素而不是'id'元素,你会看到双引号就好了。 – 2012-03-26 05:11:25

回答

2

当你打印出来

<text>&quot;Resolutionary&quot; ...</text> 

将成为 “Resolutionary” ......