2013-08-25 76 views
0

因此,我正在用C#中的HtmlAgilityPack制作一个程序(用于自己的目的),以便在某个点加载一个网页。加载大量的页面后 ,我得到这个错误:无法从传输连接读取数据:C#HtmlAgilityPack

Unhandled Exception: System.IO.IOException: Unable to read data from the transpo 
rt connection: An existing connection was forcibly closed by the remote host. -- 
-> System.Net.Sockets.SocketException: An existing connection was forcibly close 
d by the remote host 
    at System.Net.Sockets.Socket.Receive(Byte[] buffer, Int32 offset, Int32 size, 
SocketFlags socketFlags) 
    at System.Net.Sockets.NetworkStream.Read(Byte[] buffer, Int32 offset, Int32 s 
ize) 
    --- End of inner exception stack trace --- 
    at System.Net.ConnectStream.Read(Byte[] buffer, Int32 offset, Int32 size) 
    at System.IO.StreamReader.ReadBuffer() 
    at System.IO.StreamReader.ReadToEnd() 
    at HtmlAgilityPack.HtmlDocument.Load(TextReader reader) in d:\Source\htmlagil 
itypack.new\Trunk\HtmlAgilityPack\HtmlDocument.cs:line 612 
    at HtmlAgilityPack.HtmlWeb.Get(Uri uri, String method, String path, HtmlDocum 
ent doc, IWebProxy proxy, ICredentials creds) in d:\Source\htmlagilitypack.new\T 
runk\HtmlAgilityPack\HtmlWeb.cs:line 1422 
    at HtmlAgilityPack.HtmlWeb.LoadUrl(Uri uri, String method, WebProxy proxy, Ne 
tworkCredential creds) in d:\Source\htmlagilitypack.new\Trunk\HtmlAgilityPack\Ht 
mlWeb.cs:line 1479 
    at HtmlAgilityPack.HtmlWeb.Load(String url, String method) in d:\Source\htmla 
gilitypack.new\Trunk\HtmlAgilityPack\HtmlWeb.cs:line 1103 
    at HtmlAgilityPack.HtmlWeb.Load(String url) in d:\Source\htmlagilitypack.new\ 
Trunk\HtmlAgilityPack\HtmlWeb.cs:line 1061 
    at ConsoleApplication1.Program.Main(String[] args) in 
c:\Users\...ConsoleApplication1\Program.c 
s:line 37 

在第37行我加载一个for循环内页:

for (var i = 0; i< 5000; i++) 
    var page = web.Load(url+Convert.ToString(i+1)+"/"); 

我试图做一些研究上的错误,但是在那里没有太多的组建。

+2

这与Html Agility Pack库无关。错误来自HTTP/TCP/Socket层。这只是意味着服务器有问题或者只是拒绝你的电话。 –

+0

好的,谢谢你的回答,但我该如何解决这个错误? – breght

+0

它可能由许多事情引起。如果你不拥有这台服务器,你真的不知道。例如,他们可能会将您检测为黑客。 –

回答

0

我下载了1000多个网页后出现同样的错误。在循环中用一个额外的IOException解决它。 这是我的代码:

HtmlWeb web = new HtmlWeb(); 
web.PreRequest = delegate(HttpWebRequest webRequest) 
{ 
    webRequest.Timeout = 15000; 
    return true; 
}; 

try { doc = web.Load(yUrl); } 
catch (WebException ex) 
{ 
    reTryCounter++; 
    if (reTryCounter == 19) { MessageBox.Show("Error Program 1121 , Download webpage \n" + ex.ToString()); } 
} 
catch (IOException ex2) 
{ 
    MessageBox.Show("Error Program 1125 , IOException Download webpage \n" + ex2.ToString()); 
    return null; 
} 
相关问题