A mungebit which affects multiple columns identically and independently can be abstracted into a column transformation. This function allows one to specify what happens to an individual column, and the mungebit will be the resulting column transformation applied to an arbitrary combination of columns.
column_transformation(transformation, nonstandard = FALSE)
transformation | function. The function's first argument will
receive an atomic vector derived from some |
---|---|
nonstandard | logical. If |
a function which takes a data.frame and a vector of column
names (or several other formats, see standard_column_format
)
and applies the transformation
.
The function produced by calling column_transformation
will
not run independently. It must be used a train or predict function for
a mungebit
.
multi_column_transformation
, standard_column_format
doubler <- column_transformation(function(x) { 2 * x }) # doubles the Sepal.Length column in the iris dataset iris2 <- mungebit$new(doubler)$run(iris, c("Sepal.Length"))