2
我希望我的代码能够读取每分钟生成的json文本文件(它是来自Citibike的站点提要数据),并且我尝试使用Spark Streaming。但我不断收到未知的主机异常错误。Spark Streaming中的UnknownHostExceptionError
我的代码:
String url = "http://citibikenyc.com/stations/json";
SparkConf conf = new SparkConf().setMaster("local[2]").setAppName("Streaming");
JavaSparkContext sc = new JavaSparkContext(conf);
JavaStreamingContext jssc = new JavaStreamingContext(sc, new Duration(60000));
JavaDStream<String> lines = jssc.socketTextStream(url, 9999);
lines.print();
jssc.start();
jssc.awaitTermination();
和错误:
14/11/22 15:32:54 ERROR scheduler.ReceiverTracker: Deregistered receiver for stream 0: Restarting receiver with delay 2000ms: Error receiving data - java.net.UnknownHostException: http://citibikenyc.com/stations/json
at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:178)
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
at java.net.Socket.connect(Socket.java:579)
at java.net.Socket.connect(Socket.java:528)
at java.net.Socket.<init>(Socket.java:425)
at java.net.Socket.<init>(Socket.java:208)
at org.apache.spark.streaming.dstream.SocketReceiver.receive(SocketInputDStream.scala:71)
at org.apache.spark.streaming.dstream.SocketReceiver$$anon$2.run(SocketInputDStream.scala:57)
14/11/22 15:32:54 INFO receiver.ReceiverSupervisorImpl: Stopped receiver 0
Google“java.net.UnknownHostException”? – vzamanillo 2014-11-24 14:25:34