Skip to content

Commit 4c7a6b3

Browse files
wangmiao1981mengxr
authored andcommitted
[SPARK-15363][ML][EXAMPLE] Example code shouldn't use VectorImplicits._, asML/fromML
## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) In this DataFrame example, we use VectorImplicits._, which is private API. Since Vectors object has public API, we use Vectors.fromML instead of implicts. ## How was this patch tested? (Please explain how this patch was tested. E.g. unit tests, integration tests, manual tests) Manually run the example. Author: [email protected] <[email protected]> Closes apache#13213 from wangmiao1981/ml.
1 parent 09a0051 commit 4c7a6b3

1 file changed

Lines changed: 2 additions & 2 deletions

File tree

examples/src/main/scala/org/apache/spark/examples/ml/DataFrameExample.scala

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,7 @@ import scopt.OptionParser
2525

2626
import org.apache.spark.examples.mllib.AbstractParams
2727
import org.apache.spark.ml.linalg.Vector
28-
import org.apache.spark.mllib.linalg.VectorImplicits._
28+
import org.apache.spark.mllib.linalg.Vectors
2929
import org.apache.spark.mllib.stat.MultivariateOnlineSummarizer
3030
import org.apache.spark.sql.{DataFrame, Row, SparkSession}
3131

@@ -81,7 +81,7 @@ object DataFrameExample {
8181
// Convert features column to an RDD of vectors.
8282
val features = df.select("features").rdd.map { case Row(v: Vector) => v }
8383
val featureSummary = features.aggregate(new MultivariateOnlineSummarizer())(
84-
(summary, feat) => summary.add(feat),
84+
(summary, feat) => summary.add(Vectors.fromML(feat)),
8585
(sum1, sum2) => sum1.merge(sum2))
8686
println(s"Selected features column with average values:\n ${featureSummary.mean.toString}")
8787

0 commit comments

Comments
 (0)