0
我想使用excel vba从网页中的标题类型中提取属性值。我想从webpage刮数据结构如下:Excel:从HTML标题查询属性
<div class="index-detail">
<h5><a href="/indices/equity/dow-jones-sustainability-chile-index-clp" title="DJSI Chile" contentIdentifier="2e9cb165-0cbf-4070-a5ef-dc20bf6219ba" contentType="web-page" contentTitle="Dow Jones Sustainability™ Chile Index (CLP)">DJSI Chile</a></h5>
<span class="return-value">917.08 </span>
<span class="daily-change down ">-0.1% ▼ </span>
</div>
使用getElementsByClassName
和getElementsByTagName
我已经提取的标题<h5>
,但是当我打印的标题我的innerText
得到DJSI Chile
,但我想获得属性contentTitle
的文本Dow Jones Sustainability™ Chile Index (CLP)
。
我该怎么做?
UPDATE
的代码我使用如下:
Sub myConSP()
' Declare variables
Dim oHtmlSP As HTMLDocument
Dim tSPIndex As HTMLDivElement
Dim tSPIdx As HTMLDivElement
' Load page inside HTMLDocument
Set oHtmlSP = New HTMLDocument
With CreateObject("WINHTTP.WinHTTPRequest.5.1")
.Open "GET", "http://www.espanol.spindices.com", False
.send
oHtmlSP.body.innerHTML = .responseText
End With
' Get indices
Set tSPIndex = oHtmlSP.getElementById("all-indices-slider")
Set objTitleTag = tSPIndex.getElementsByClassName("index-detail")(0).getElementsByTagName("h5")(0)
MsgBox objTitleTag.getAttribute("contentTitle").innerText
End Sub
'objTitleTag.getAttribute(“contentTitle”)' –
如何定义objTitleTag? – capm
这就是你所谓的'innerText'。总是最好展示您的实际代码:更容易提出有关添加内容的建议。 –