<div data-projects-path="/pt/projects" id="explore_results">
<div class="results">
<div class="project-box" itemscope="" itemtype="http://schema.org/CreativeWork">
<meta content="2014-08-30" itemprop="dateCreated">
<div class="image">
<a href="/pt/ospassosdabia" target="" title="Os passos da Bia">
<img alt="Project thumb bia" height="172" src="http://s3.amazonaws.com/cdn.catarse/uploads/project/uploaded_image/7229/project_thumb_Bia.png" width="220">
</a>
<div class="project-box" itemscope="" itemtype="http://schema.org/CreativeWork">
<meta content="2014-09-19" itemprop="dateCreated">
<div class="image">
<a href="/pt/livrepartida" target="" title="Livre Partida">
<img alt="Project thumb logo colorido" height="172" src="http://s3.amazonaws.com/cdn.catarse/uploads/project/uploaded_image/7613/project_thumb_logo_colorido.jpg" width="220">
</a>
这刮的是,我想用刮R.我只需要所有/pt/....
为/pt/livrepartida
和/pt/ospassosdabia
一个例子的HTML代码。网站有R
当我向下滚动网页时,会出现更多类似的代码,并会出现更多类似那样的术语(“pt/....”)。
我想从网站上得到所有这些“pt/....”。我怎样才能做到这一点?
你可以发布多'PT/..'方面的例子吗?这将有助于测试。 – akrun 2014-10-09 15:14:08
请再看我的问题。 '/ pt/...'和上面的代码一样。但是这个信息'/ pt/..'有一个截止日期来获得html代码和新的'/ pt/....'放在每天,我想得到它们 – Gabriel 2014-10-09 16:07:54
当我使用代码时,我得到'unname (xpathSApply(doc1,“// a/@ href”))#[1]“/ pt/ospassosdabia”“/ pt/livrepartida” – akrun 2014-10-09 16:25:26