3
我想运行在pyspark代码(火花2.1.1):趴趴的类型必须为org.apache.spark.ml.linalg.VectorUDT
from pyspark.ml.feature import PCA
bankPCA = PCA(k=3, inputCol="features", outputCol="pcaFeatures")
pcaModel = bankPCA.fit(bankDf)
pcaResult = pcaModel.transform(bankDF).select("label", "pcaFeatures")
pcaResult.show(truncate= false)
但我得到这个错误:
requirement failed: Column features must be of type
org.apache.spark.ml.linalg.Vect [email protected]
but was actually[email protected]
.