在网页中搜索字符串并使用python打印包含该字符串的整行

我想搜索一个字符串的网页并打印包含该字符串的整个行。在网页中搜索字符串并使用python打印包含该字符串的整行

我有一个输入文件，其中包含我想要搜索该字符串的链接。要搜索

字符串： “VCORE”

我的输入文件：

http://abc/cluster/app/application_1447334090028_225490 
http://abc/cluster/app/application_1447334090028_228858

预期输出文件：

http://abc/cluster/app/application_1447334090028_225490 12434 vcore, 123 mb 
http://abc/cluster/app/application_1447334090028_228858 12132 vcore, 131 mb

代码到目前为止：

import sys 
import re 
import urllib 

Links = [Link.strip() for Link in open ('/home/try/Input.txt','r').readlines()] 

for link in Links: 
     webPage = urllib.urlopen(link).read() 
     print webPage

然后，我使用grep来搜索字符串并将其存储在另一个文件中。但我希望它由代码本身完成，并且该行出现在相应链接旁边。谁可以帮我这个事？

来源

2015-11-20 blackfury

“线”不存在HTML，只有节点和文本。 –

你可以给一个小样本输入和输出吗？ –

所以，有没有办法让我从没有bs4的HTML页面中找到什么？ – blackfury

lines = urllib.urlopen(link).readlines() 
for line in lines: 
    if "vcore" in line: 
     print line

来源

2015-11-20 08:12:16

工程就像一个魅力....只需在输出空白行。谢谢史蒂夫:)我已经更新了你的代码，以及链接 – blackfury

import re 
import urllib 

Links = [Link.strip() for Link in open ('/home/try/Urls.txt','r').readlines()] 

for link in Links: 
     lines = urllib.urlopen(link).readlines() 
     for line in lines: 
       if "vcore" in line: 
         print link,line

只要有每个打印语句后的空白行

来源

2015-11-20 09:02:21 blackfury

，每一行都以'\ n'结尾，因此，要删除空白行，请尝试：print line.strip（） –

在网页中搜索字符串并使用python打印包含该字符串的整行

回答

相关问题