我正在使用BeautifulSoup4来刮取网站。这里是我有的一个精简版:BeautifulSoup4:FileNotFoundError用于打开URL
from bs4 import BeautifulSoup
def getTeamRoster(teamURL):
soup = BeautifulSoup(open(teamURL))
def main():
getTeamRoster("http://modules.ussquash.com/ssm/pages/leagues/Team_Information.asp?id=11325")
我已经拉起了页面,它正常加载在我的浏览器(Chrome)中。出于某种原因,我收到以下错误:
Traceback (most recent call last):
File "SquashScraper.py", line 61, in <module>
main()
File "SquashScraper.py", line 58, in main
getTeamRoster("http://modules.ussquash.com/ssm/pages/leagues/Team_Information.asp?id=11325")
File "SquashScraper.py", line 21, in getTeamRoster
soup = BeautifulSoup(open(teamURL))
FileNotFoundError: [Errno 2] No such file or directory: 'http://modules.ussquash.com/ssm/pages/leagues/Team_Information.asp?id=11325'
任何想法可能发生什么?
我看过其他人的BeautifulSoup4代码,并认为我所做的是访问页面的HTML的惯用方式。
感谢, bclayman