2017-10-16 93 views
0

我的目标是使用java.net.HttpURLConnection来解析给定URL的所有302重定向跃点。Java HttpURLConnection - 列举所有302重定向跳转

这是我的代码片段(实际上,我用它在Talend SW):

String url = row2.url; 
java.net.HttpURLConnection con = (java.net.HttpURLConnection) new java.net.URL(url).openConnection(); 
con.setInstanceFollowRedirects(false); 
con.connect(); 
String realURL = con.getHeaderField("Location"); 

System.out.println(realURL); 

它工作得很好,如果只有一个302的答复。 例如,如果我填充row2.ulr =“https://jigsaw.w3.org/HTTP/300/302.html”代码将输出 - >https://jigsaw.w3.org/HTTP/300/Overview.html女巫是100%正确的。

我的问题是,我无法分析几个302的回复,一个典型的例子就是Facebook的网址: www.facebook.com/ID_account - > redirect-> www.facebook.com/ACCOUNT_NAME

使用重定向检查网上我发现了2 - 302响应(我需要第二个): enter image description here

这是卷曲输出:

> >>> http://www.facebook.com/123456789 
> 
> > -------------------------------------------- 
> > 302 Found 
> > -------------------------------------------- 
> 
> Status: 302 Found Code: 302 
> Location: https://www.facebook.com/123456789 Vary: Accept-Encoding 
> Content-Type: text/html; charset=UTF-8 
> X-FB-Debug: Muf4PfCP9TRKCO17QUf7SV2vsdnrCu6Gw2+sjWAKe0QPGdAToJPcmgH5LHv3NIAhzsJXfPB3a9/mVtuhiiEihA== 
> Date: Mon, 16 Oct 2017 16:02:55 GMT Connection: close 
> Content-Length: 0 
> 
> 
> 
> 
> >>> https://www.facebook.com/123456789 
> 
> > -------------------------------------------- 
> > 302 Found 
> > -------------------------------------------- 
> 
> Status: 302 Found Code: 302 X-XSS-Protection: 0 
> public-key-pins-report-only: max-age=600; 
> pin-sha256="WoiWRyIOVNa9ihaBciRSC7XHjliYS9VwUGOIud4PB18="; 
> pin-sha256="k2v657xBsOVe1PQRwOsHsw3bsGT2VzIqz5K+59sNQws="; 
> pin-sha256="gMxWOrX4PMQesK9qFNbYBxjBfjUvlkn/vN1n+L9lE5E="; 
> pin-sha256="q4PO2G2cbkZhZ82+JgmRUyGMoAeozA+BSXVXQWB8XWQ="; 
> report-uri="http://reports.fb.com/hpkp/" Pragma: no-cache 
> Location: https://www.facebook.com/a_name_account/ 
> Cache-Control: private, no-cache, no-store, must-revalidate 
> X-Frame-Options: DENY Strict-Transport-Security: max-age=15552000; 
> preload X-Content-Type-Options: nosniff Expires: Sat, 01 Jan 2000 
> 00:00:00 GMT Vary: Accept-Encoding Content-Type: text/html; 
> charset=UTF-8 
> X-FB-Debug: j2KCBNZ1poIJ0xUeeQYbinpcqq2avoI4z8eWb9Dx/yUUg98uyGYGadydia7en1s5X4DJeaJB7VjxYaRvP+psCw== 
> Date: Mon, 16 Oct 2017 16:02:55 GMT Connection: close 
> Content-Length: 0 
> 
> 
> 
> 
> >>> https://www.facebook.com/a_name_account/ 

如何找到第二个302的任何建议“要价较高离子“ - >https://www.facebook.com/a_name_account/

在此先感谢 的REG S.

回答

1

这将是一个伟大的Recursive Method行为。

如果您确定响应头中仍有位置,则可以继续调用您的方法。

public void recurseLocation(String url) { 

    URL url = new URL(location); 
    HttpURLConnection connection = (HttpURLConnection) url.openConnection(); 
    connection.setInstanceFollowRedirects(false); 
    connection.connect(); 
    String newLocation = connection.getHeaderField("Location"); 

    if (newLocation != null) { 
     newLocation = recurseLocation(newLocation); 
    } else { 
     newLocation = location; 
    } 

    return newLocation; 
} 
+0

Thx,但是有没有办法简单得到总计302的索引而不递归调用函数? – user3925023

+0

也可以解释一下这个行的URL url = new URL(location);? thx – user3925023

+0

我相信你将不得不为每个重定向提出请求。只有一个http请求获得所有重定向并不是一种神奇的方式。我们只需要通过所有这些,并找到最后一个... – hooknc