2014-02-23 41 views
-1
import mechanize 
br=mechanize.Browser() 
r=br.open("http://www.drugs.com/search-wildcard-phonetic.html") 
br.select_form(nr=0) 
br.form['searchterm']='panadol' 
br.submit() 
print br.response().read() 

error in above code: 
Traceback (most recent call last): 
    File "mech2.py", line 6, in <module> 
    br.submit() 
    File "/usr/lib/python2.7/dist-packages/mechanize/_mechanize.py", line 541, in submit 
    return self.open(self.click(*args, **kwds)) 
    File "/usr/lib/python2.7/dist-packages/mechanize/_mechanize.py", line 203, in open 
    return self._mech_open(url, data, timeout=timeout) 
    File "/usr/lib/python2.7/dist-packages/mechanize/_mechanize.py", line 255, in _mech_open 
    raise response 
mechanize._response.httperror_seek_wrapper: HTTP Error 403: request disallowed by robots.txt 

请帮忙纠正上面的代码网页抓取使用代码机械化错误

回答

0

它看起来像有什么不对您的代码。你的错误信息

mechanize._response.httperror_seek_wrapper:HTTP错误403:请求被robots.txt

不允许您似乎违反了robots.txt文件。如果您不想看到此错误消息,请停止滥用此网站,并可能与他们联系,以获取可接受的使用他们的数据的方式。

+0

此代码是否工作?您是否尝试执行它? – FathimaBeevi