2016-11-15 26 views
3

排序和orderBy有什么区别?Spark DataFrame是什么?Spark中的排序和orderBy函数有什么区别

scala> zips.printSchema 
root 
|-- _id: string (nullable = true) 
|-- city: string (nullable = true) 
|-- loc: array (nullable = true) 
| |-- element: double (containsNull = true) 
|-- pop: long (nullable = true) 
|-- state: string (nullable = true) 

下面的命令产生相同的结果:

zips.sort(desc("pop")).show 
zips.orderBy(desc("pop")).show 

回答

7

排序依据仅仅是排序函数的别名。

从星火文档:

/** 
    * Returns a new Dataset sorted by the given expressions. 
    * This is an alias of the `sort` function. 
    * 
    * @group typedrel 
    * @since 2.0.0 
    */ 
    @scala.annotation.varargs 
    def orderBy(sortCol: String, sortCols: String*): Dataset[T] = sort(sortCol, sortCols : _*) 
相关问题