This tutorial will show you how to add a new column to an already existing dataset /dataframe .
First we create a dataset.
1 |
Dataset<Row> data = spark.read().format("csv").load("path/to/your/CSV.csv"); |
Then we add a column with lit
1 |
Dataset<Row> data = data.withColumn("newColumn", lit(1)); |
and we are done!