Python网页抓取，符号含义

在下面的代码中，符号字符串re.sub('<[^>]*>|[\n]|\[[0-9]*\]', '', htmlread)的每个元素是什么意思？Python网页抓取，符号含义

import urllib2 
import re 

htmltext = urllib2.urlopen("https://en.wikipedia.org/wiki/Linkin_Park") 
htmlread = htmltext.read() 
htmlread = re.sub('<[^>]*>|[\n]|\[[0-9]*\]', '', htmlread) 
regex = '(?<=Linkin Park was founded)(.*)(?=the following year.)' 
pattern = re.compile(regex) 
htmlread = re.findall(pattern, htmlread) 
print "Linkin Park was founded" + htmlread[0] + "the following year."

来源

2016-08-10 Kernel2710

http://stackoverflow.com/questions/22937618/参考 - 什么 - 做 - 这正则表达式均值 –