2016-03-02 52 views

回答

3

默认情况下,所有的RDD都是懒惰的。直到你打电话,他们才会真正做任何事情。所以不要添加懒惰,因为这只会延迟围绕RDD创建元数据,而实际上并不影响执行。

val table = sparkContext.cassandraTable[Type](keyspace,tableName) 
val tableWithWhere = table.where("x = 5") 
val tableTransformed = table.map(x:Type => turnXIntoY(x)) 
//nothing has happened in C* or Spark on executors yet 
tableTransformed.collect // This causes spark to start doing work